← Back to Computer Vision
cs.CV

Measuring whether AI videos obey physics

Jiaxin Wu, Yihao Pi, Yinling Zhang, Yuheng Li, Xueyan Zou

May 14, 2026

Evaluating whether generative video models produce physically coherent 3D worlds has relied on subjective human judgment or learned metrics. PDI-Bench introduces an objective framework that reconstructs 3D coordinates from generated videos using segmentation and point tracking, then measures geometric failures across three dimensions: scale-depth alignment, motion consistency, and structural rigidity. Testing on state-of-the-art video generators reveals consistent geometry-specific failure modes invisible to standard perceptual metrics like LPIPS or FVD. The authors release both code and PDI-Dataset, a diverse benchmark designed to stress physical constraints.
Published as Quantitative Video World Model Evaluation for Geometric-Consistency arXiv:2605.15185
Read the original paper →