← Back to Computation and Language
cs.CL

Why superintelligent AI trained alone won't cooperate with us

Rakshit S Trivedi, Natasha Jaques, Logan Cross, Alexander Sasha Vezhnevets, Joel Z Leibo

June 2, 2026

AI trained to maximize performance on fixed benchmarks faces a fatal flaw: deployment changes the world, breaking the assumptions the system learned from. The authors argue that building cooperative superintelligence requires abandoning the isolated optimization approach entirely—instead designing AI as a participant in multi-agent equilibrium from the start, with adaptive counterparties and institutions as core design features rather than afterthoughts.
Published as Solipsistic Superintelligence is Unlikely to be Cooperative arXiv:2606.03237
Read the original paper →