cs.CL

Learning from what users fix: turning agent mistakes into training data

Hande Dong, Xiaoyun Liang, Jiarui Yu, Jiayi Lin, Changqing Ai, Feng Liu, Wenjun Zhang, Rongbi Wei, Chaofan Zhu, Linjie Che, Feng Wu, Xin Shen, Dexu Kong, Xiaotian Wang, Qiuyuan Chen, Bingxu An, Yueting Lei, Qiang Lin

May 21, 2026

Raw interaction logs between AI agents and users are noisy and inefficient for training. Echo captures the high-quality signal hidden in user refinements—when people fix flawed AI outputs—and feeds those corrections back into model training. Tested on a production code completion system, this approach broke performance plateaus by learning directly from what users actually need, rather than relying on static curated datasets.

Published as Echo: Learning from Experience Data via User-Driven Refinement arXiv:2605.21984

Read the original paper →