Why you can't trust a language model agent's reputation

Language model agents lack persistent identity: their behavior shifts with prompt changes, module updates, or adversarial attacks, unlike humans who internalize sanctions. Traditional reputation mechanisms assume stable identity and behavioral continuity—properties agents can't provide. The authors argue reputation-based governance will fail for AI agents and propose observable, protocol-based safeguards instead.