← Back to Robotics
cs.RO

Can robots learn from their own work?

Anlan Yu, Zaishu Chen, Zhiqing Hong, Daqing Zhang

June 4, 2026

Scaling robot learning in warehouses requires more than lab data—it needs continuous feedback loops. This work proposes a data flywheel that converts real logistics operations into reusable training assets. A world model generates supervision for tricky parcels robots rarely encounter, while deployment feedback improves policies over time. WM-DAgger combines world-model-based data synthesis with imitation learning to handle out-of-distribution scenarios.
Published as Towards a Data Flywheel for Embodied Intelligence in Logistics arXiv:2606.05960
Read the original paper →