← Back to Robotics
cs.RO

What if city navigation models learned invisible geometry instead of what things look like?

Xuhui Lin, Stephen Law, Nanjiang Chen, Kunyao Li, Tao Yang

June 2, 2026

Navigation agents need to predict where they can move, not what buildings look like. This work replaces flat occupancy grids with 3D isovists—360° depth maps capturing free space around an agent. Trained on Manhattan and Paris data, a single city-blind model unexpectedly encodes city identity in its learned dynamics rather than visual appearance, making the spatial signature linearly decodable from latent trajectories. Open dataset and code included.
Published as A 3D Isovist World Model -- Revealing a City's Unseen Geometry and Its Emergent Cross-City Signature arXiv:2606.03609
Read the original paper →