← Back to Machine Learning
cs.LG

Teaching drones to race safely by competing with each other

Ismail Geles, Leonard Bauersfeld, Markus Wulfmeier, Davide Scaramuzza

May 21, 2026

Autonomous systems trained in isolation fail in the real world because they ignore other actors. This paper trains quadrotor drones using multi-agent reinforcement learning through competitive racing, where agents learn to anticipate collisions, overtake strategically, and handle aerodynamic interactions with multiple competitors. The trained agents outperform elite human pilots in multi-drone races and—surprisingly—generalize to safer interaction with untrained human partners without any fine-tuning, suggesting that learning through realistic multi-agent competition builds robustness better than explicit safety constraints.
Published as Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning arXiv:2605.22748
Read the original paper →