← Back to Robotics cs.RO
Can a single transformer track any human motion without training?
Zekun Qi, Xuchuan Chen, Dairu Liu, Chenghuai Lin, Yunrui Lian, Sikai Liang, Zhikai Zhang, Yu Guan, Jilong Wang, Wenyao Zhang, Xinqiang Yu, He Wang, Li Yi
June 2, 2026
Humanoid-GPT trains a GPT-style transformer on 2 billion motion frames from unified mocap datasets to control full-body humanoid movement. Unlike older MLP-based trackers that struggle with either agility or generalization, this single model tracks complex dynamic behaviors while zero-shot transferring to entirely new motions and tasks—a capability that previously required task-specific retraining.
Read the original paper →