← Back to Computer Vision
cs.CV

Can AI understand 3D scenes as well as their individual parts?

Shaohui Dai, Yansong Qu, You Shen, Shengchuan Zhang, Liujuan Cao

June 4, 2026

Existing 3D AI models understand objects but miss fine-grained details—like distinguishing a chair's leg from its seat. PAR3D adds part-awareness by learning hierarchical object-part relationships and grounding both in 3D space. A new dataset (ScenePart) with part-level annotations enables training and evaluation. The method substantially improves part-focused tasks while maintaining strong performance on standard object-level benchmarks.
Published as PAR3D: A Unified 3D-MLLM with Part-Aware Representation for Scene Understanding arXiv:2606.06485
Read the original paper →