← Back to Computation and Language cs.CL
Learning from what users fix: turning agent mistakes into training data
Hande Dong, Xiaoyun Liang, Jiarui Yu, Jiayi Lin, Changqing Ai, Feng Liu, Wenjun Zhang, Rongbi Wei, Chaofan Zhu, Linjie Che, Feng Wu, Xin Shen, Dexu Kong, Xiaotian Wang, Qiuyuan Chen, Bingxu An, Yueting Lei, Qiang Lin
May 21, 2026
Raw interaction logs between AI agents and users are noisy and inefficient for training. Echo captures the high-quality signal hidden in user refinements—when people fix flawed AI outputs—and feeds those corrections back into model training. Tested on a production code completion system, this approach broke performance plateaus by learning directly from what users actually need, rather than relying on static curated datasets.
Read the original paper →