← Back to Computation and Language cs.CL
Do AI agents actually remember what you told them?
Adril Putra Merin, David Anugraha, Ayu Purwarianti, Genta Indra Winata
May 30, 2026
Existing agent benchmarks test single sessions, but real users interact with AI assistants across weeks or months with evolving needs. Momento evaluates whether agents can remember past actions, stated preferences, and context while handling tool use across multiple sessions. Current agents struggle—they treat old session history as current fact rather than outdated information needing re-validation, revealing a gap between lab performance and realistic long-horizon interaction.
Read the original paper →