← Back to Artificial Intelligence cs.AI
Why you can't trust a language model agent's reputation
Botao Amber Hu, Helena Rong, Max Van Kleek
May 28, 2026
Language model agents lack persistent identity: their behavior shifts with prompt changes, module updates, or adversarial attacks, unlike humans who internalize sanctions. Traditional reputation mechanisms assume stable identity and behavioral continuity—properties agents can't provide. The authors argue reputation-based governance will fail for AI agents and propose observable, protocol-based safeguards instead.
Read the original paper →