← Back to Computer Vision cs.CV
Can you animate a scene just by describing it?
Mannat Khurana, Sanyam Jain, Rishav Agarwal
May 26, 2026
Designers currently spend time manually plotting motion paths and timing curves for animations. This system chains language models for understanding prompts with visual grounding tools to auto-generate animations that respect scene geometry, depth, and perspective. Three demos show it handling orbital motion, contour-following, and perspective-aligned movement on transformed objects.
Read the original paper →