← Back to Computer Vision cs.CV
Can generative models reconstruct entire 3D scenes from photos?
Katharina Schmid, Nicolas von Lützow, Jozef Hladký, Angela Dai, Matthias Nießner
May 22, 2026
GenRecon reconstructs detailed 3D scenes from multi-view photos by breaking the scene into overlapping chunks and applying a strong generative model (Trellis.2) to each. A projection-based conditioning mechanism aligns multi-view image features with the generative model, ensuring consistency across views and generating high-fidelity PBR meshes. Results outperform state-of-the-art reconstruction by 16% on indoor scenes.
Read the original paper →