Publications
Addressing Janus Issue in Text-to-3D via Orientation-Controlled Diffusion Models
Abstract
In the evolving landscape of text-to-3D technology, Dreamfusion [18] has showcased its proficiency by utilizing Score Distillation Sampling (SDS) to optimize implicit representations such as NeRF. This process is achieved through the distillation of pretrained large-scale text-to-image diffusion models. However, Dreamfusion encounters fidelity and efficiency constraints: it faces the multi-head Janus issue and exhibits a relatively slow optimization process. To circumvent these challenges, we introduce OrientDream, a camera orientation conditioned framework designed for efficient and multi-view consistent 3D generation from textual prompts. Our strategy emphasizes the implementation of an explicit camera orientation conditioned feature in the pre-training of a 2D text-to-image diffusion module. This feature effectively utilizes data from MVImgNet, an extensive external multi-view dataset, to refine and bolster its …
- Date
- July 15, 2025
- Authors
- Yuzhong Huang, Fred Morstatter
- Conference
- International Conference on Pattern Recognition
- Pages
- 187-201
- Publisher
- Springer, Cham