← Back to Computer Vision
cs.CV

One toolkit for egocentric video across any device

Liuchuan Yu, Erdem Murat, Beichen Wang, Yan Zeng, Tingting Luo, Huizhen Zhou, Shanghao Li, Huining Feng, Zhigen Zhao, Ning Yang, Ke Jing, Yunhao Liu, Ruoya Sheng

May 16, 2026

Egocentric video collection for robotics, activity recognition, and embodied AI is hampered by incompatible SDKs, camera access policies, and proprietary platforms across different devices. EgoKit provides a single toolkit that exposes the same recording workflow on six heterogeneous host devices (Android phones, iPhones, iPads, smart glasses, XR headsets) and produces locally stored video in a uniform log format. On XR headsets, it captures synchronized head pose and 26-joint hand tracking aligned to video. Companion accessories—two wrist cameras with mounts, head strap, and USB-C hub—add multi-view capability without custom fabrication. Code and design specifications available at egokit.chuange.org.
Published as EgoKit: Towards Unified Low-Cost Egocentric Data Collection with Heterogeneous Devices arXiv:2605.16797
Read the original paper →