← Back to Machine Learning (Statistics)
stat.ML

Can you make frozen language models smarter without retraining?

Lizhang Chen, Jonathan Li, Chen Liang, Ni Lao, Qiang Liu

May 22, 2026

Researchers treat repeated application of transformer layers as refinement steps in an ODE solver, allowing inference-time looping without modifying or retraining frozen models. By damping sub-steps rather than naively reapplying blocks, the method improves performance across multiple model families: +2.64 pp on MMLU-Pro (Qwen3-4B), +1.14 pp on CommonsenseQA (30B), and +1.20 pp on OpenBookQA (16B).
Published as Training-Free Looped Transformers arXiv:2605.23872
Read the original paper →