← Back to Artificial Intelligence
cs.AI

Why GUI agents fail at dragging and how to fix it

Nathan Bout, Maxime Langevin, Ronan Riochet

June 4, 2026

GUI agents can click but struggle with drag operations (highlighting, resizing, slider manipulation)—a gap in training data of an order of magnitude. DragOn provides 3.5M drag tasks across four interaction types to address this. Fine-tuning Qwen on the dataset improves performance on complex drag interactions, suggesting the bottleneck was simply missing examples rather than fundamental model limits.
Published as DragOn: A Benchmark and Dataset for Drag-Based GUI Interactions arXiv:2606.06322
Read the original paper →