← Back to Computation and Language
cs.CL

Forgetting on command: unlearning for parallel text generation

Georu Lee, Seungwon Jeong, Hoki Kim, Jinseong Park, Woojin Lee

May 18, 2026

Masked diffusion language models like LLaDA generate text by denoising all positions in parallel rather than sequentially, but nobody has figured out how to make them forget specific information. This paper proposes Masked Diffusion Unlearning (MDU), which pushes the model back toward its unconditional state while keeping other knowledge intact. Testing shows MDU matches or beats existing unlearning methods on standard benchmarks. Code is released.
Published as Machine Unlearning for Masked Diffusion Language Models arXiv:2605.18253
Read the original paper →