← Back to Computer Vision
cs.CV

Recognizing actions from any camera angle without retraining

Yannick Porto, Renato Martins, Thomas Chalumeau, Cedric Demonceaux

May 21, 2026

Action recognition systems fail when camera angles or body orientations differ from training data—a major problem for real-world deployment. This work combines motion cues from multiple viewpoints with text descriptions to train models that recognize both seen and unseen actions across camera angles. The approach uses an orientation-aware motion encoder and adaptive text prompts that adjust to different body positions at test time, improving performance across four major benchmarks (NTU-RGB+D, BABEL, NW-UCLA, and surveillance datasets) while outperforming recent zero-shot methods. Code and models are released.
Published as Cross-Domain Human Action Recognition from Multiview Motion and Textual Descriptions arXiv:2605.22697
Read the original paper →