← Back to Computer Vision
cs.CV

Training web agents to learn from live websites, not just recorded demos

Rui Yang, Qianhui Wu, Yuxi Chen, Hao Bai, Wenlin Yao, Hao Cheng, Baolin Peng, Huan Zhang, Tong Zhang, Jianfeng Gao

June 1, 2026

Building capable web agents usually requires huge collections of hand-curated website interaction recordings—expensive and incomplete. OpenWebRL trains visual agents directly on live websites using reinforcement learning instead, letting them learn from real-world feedback. The framework achieved 67% success on Online-Mind2Web and 64% on DeepShop with minimal supervised data, matching OpenAI's and Google's proprietary systems. Code and models will be released.
Published as OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents arXiv:2606.02031
Read the original paper →