Publications
Orientdream: Streamlining text-to-3d generation with explicit orientation control
Abstract
In the evolving landscape of text-to-3D technology, Dreamfusion [9] optimizes implicit representations like NeRF using Score Distillation Sampling (SDS) but faces limitations in both fidelity and speed. Specifically, it faces the multi-head Janus issue and exhibits a relatively slow optimization process. We present OrientDream, a camera orientation conditioned framework for efficient, multi-view consistent 3D generation from text prompts. OrientDream achieves this by pre-training a 2D text-to-image diffusion module with camera orientation features and utilizing data from MVImgNet. To shorten training time, we introduced a decoupled back-propagation technique, allowing for multiple updates of implicit parameters per optimization cycle. Our experiments reveal that our method not only produces high-quality NeRF models with consistent multi-view properties but also achieves an optimization speed significantly greater …
- Date
- April 6, 2025
- Authors
- Yuzhong Huang, Fred Morstatter
- Conference
- ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Pages
- 1-5
- Publisher
- IEEE