← Back to Robotics
cs.RO

Teaching robots two hands and ten fingers at once

Zongzheng Zhang, Jingrui Pang, Zhuo Yang, Kun Li, Minwen Liao, Saining Zhang, Guoxuan Chi, Jinbang Guo, Huan-ang Gao, Modi Shi, Dongyun Ge, Yao Mu, Jiayuan Gu, Rui Chen, Hao Dong, Huazhe Xu, Li Yi, Yixin Zhu, Hang Zhao, Pengwei Wang, Shanghang Zhang, Guocai Yao, Jianyu Chen, Hongyang Li, Hao Zhao

May 18, 2026

Controlling robots with five fingers per hand is harder than simple gripper manipulation, but existing vision-language-action models skip it. Dexora combines a custom exoskeleton for arm control and Apple Vision Pro for finger tracking to collect 10K teleoperated episodes, then trains a diffusion-transformer policy that learns which demonstrations are trustworthy via an offline discriminator. The result: 66.7% success on dexterous manipulation—25% better than baseline VLAs—plus code, models, and datasets released openly.
Published as Dexora: Open-source VLA for High-DoF Bimanual Dexterity arXiv:2605.18722
Read the original paper →