← Back to Computer Vision
cs.CV

Can generative models reconstruct entire 3D scenes from photos?

Katharina Schmid, Nicolas von Lützow, Jozef Hladký, Angela Dai, Matthias Nießner

May 22, 2026

GenRecon reconstructs detailed 3D scenes from multi-view photos by breaking the scene into overlapping chunks and applying a strong generative model (Trellis.2) to each. A projection-based conditioning mechanism aligns multi-view image features with the generative model, ensuring consistency across views and generating high-fidelity PBR meshes. Results outperform state-of-the-art reconstruction by 16% on indoor scenes.
Published as GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction arXiv:2605.23888
Read the original paper →