← Back to Computer Vision cs.CV
One toolkit for egocentric video across any device
Liuchuan Yu, Erdem Murat, Beichen Wang, Yan Zeng, Tingting Luo, Huizhen Zhou, Shanghao Li, Huining Feng, Zhigen Zhao, Ning Yang, Ke Jing, Yunhao Liu, Ruoya Sheng
May 16, 2026
Egocentric video collection for robotics, activity recognition, and embodied AI is hampered by incompatible SDKs, camera access policies, and proprietary platforms across different devices. EgoKit provides a single toolkit that exposes the same recording workflow on six heterogeneous host devices (Android phones, iPhones, iPads, smart glasses, XR headsets) and produces locally stored video in a uniform log format. On XR headsets, it captures synchronized head pose and 26-joint hand tracking aligned to video. Companion accessories—two wrist cameras with mounts, head strap, and USB-C hub—add multi-view capability without custom fabrication. Code and design specifications available at egokit.chuange.org.
Read the original paper →