← Back to Computation and Language
cs.CL

Can swapping English words for translations speed up multilingual learning?

Anastasiia Sedova, Natalie Schluter, Skyler Seto, Maartje ter Hoeve

May 22, 2026

Building effective language models for low-resource languages usually demands expensive parallel data or translation systems. LINK replaces random English words with their translations during pretraining—nothing fancy, just bilingual dictionary lookups. Tested across eight languages and five model sizes, the method cuts training time in half to reach the same downstream task performance, with gains in reasoning and world knowledge tasks.
Published as Multilingual Knowledge Transfer under Data Constraints via Lexical Interventions arXiv:2605.23885
Read the original paper →