← Back to General Economics
econ.GN

How AI detectors fool us about who's actually using them

Shang Wu, Randol Yao

May 26, 2026

Researchers tested how AI-detection benchmarks measure language tool use in journal abstracts across countries and academic fields. A pooled detection method systematically misidentified natural stylistic differences as AI-generated text, overestimating adoption in some regions by large margins. Using field-specific and country-specific baselines instead produced far more accurate results, suggesting that crude, one-size-fits-all AI detection distorts reality and creates unfair comparisons between nations and disciplines.
Published as AI evaluation may bias perceptions: The importance of context in interpreting academic writing arXiv:2605.26662
Read the original paper →