Publications

Orientdream: Streamlining text-to-3d generation with explicit orientation control

Abstract

In the evolving landscape of text-to-3D technology, Dreamfusion [9] optimizes implicit representations like NeRF using Score Distillation Sampling (SDS) but faces limitations in both fidelity and speed. Specifically, it faces the multi-head Janus issue and exhibits a relatively slow optimization process. We present OrientDream, a camera orientation conditioned framework for efficient, multi-view consistent 3D generation from text prompts. OrientDream achieves this by pre-training a 2D text-to-image diffusion module with camera orientation features and utilizing data from MVImgNet. To shorten training time, we introduced a decoupled back-propagation technique, allowing for multiple updates of implicit parameters per optimization cycle. Our experiments reveal that our method not only produces high-quality NeRF models with consistent multi-view properties but also achieves an optimization speed significantly greater …

Date
April 6, 2025
Authors
Yuzhong Huang, Fred Morstatter
Conference
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Pages
1-5
Publisher
IEEE