← Back to Computer Vision
cs.CV

Separating appearance from geometry in AI view synthesis

Yihang Wu, Yihang Sun, Shaofeng Zhang, Zuxuan Wu, Junchi Yan, Xiaosong Jia, Yu-gang Jiang

May 18, 2026

Current feedforward view synthesis models blend semantic (color, texture) and spatial (ray position) information into one representation, which causes spatial structure to interfere with appearance quality. This paper separates them into distinct branches that still communicate through shared attention, adding optional branch-specific supervision and bidirectional modulation to improve their interaction. The approach works with both decoder-only and encoder-decoder architectures and adds virtually no inference cost.
Published as Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling arXiv:2605.18599
Read the original paper →