← Back to Robotics cs.RO
Can robots learn from their own work?
Anlan Yu, Zaishu Chen, Zhiqing Hong, Daqing Zhang
June 4, 2026
Scaling robot learning in warehouses requires more than lab data—it needs continuous feedback loops. This work proposes a data flywheel that converts real logistics operations into reusable training assets. A world model generates supervision for tricky parcels robots rarely encounter, while deployment feedback improves policies over time. WM-DAgger combines world-model-based data synthesis with imitation learning to handle out-of-distribution scenarios.
Read the original paper →