← Back to Artificial Intelligence
cs.AI

Why you can't trust a language model agent's reputation

Botao Amber Hu, Helena Rong, Max Van Kleek

May 28, 2026

Language model agents lack persistent identity: their behavior shifts with prompt changes, module updates, or adversarial attacks, unlike humans who internalize sanctions. Traditional reputation mechanisms assume stable identity and behavioral continuity—properties agents can't provide. The authors argue reputation-based governance will fail for AI agents and propose observable, protocol-based safeguards instead.
Published as Dissociative Identity: Language Model Agents Lack Grounding for Reputation Mechanisms arXiv:2605.30169
Read the original paper →