Publications

Addressing Janus Issue in Text-to-3D via Orientation-Controlled Diffusion Models

Abstract

In the evolving landscape of text-to-3D technology, Dreamfusion [18] has showcased its proficiency by utilizing Score Distillation Sampling (SDS) to optimize implicit representations such as NeRF. This process is achieved through the distillation of pretrained large-scale text-to-image diffusion models. However, Dreamfusion encounters fidelity and efficiency constraints: it faces the multi-head Janus issue and exhibits a relatively slow optimization process. To circumvent these challenges, we introduce OrientDream, a camera orientation conditioned framework designed for efficient and multi-view consistent 3D generation from textual prompts. Our strategy emphasizes the implementation of an explicit camera orientation conditioned feature in the pre-training of a 2D text-to-image diffusion module. This feature effectively utilizes data from MVImgNet, an extensive external multi-view dataset, to refine and bolster its …

Date
July 15, 2025
Authors
Yuzhong Huang, Fred Morstatter
Conference
International Conference on Pattern Recognition
Pages
187-201
Publisher
Springer, Cham