← Back to Computer Vision
cs.CV

Making slow image generators 100× faster with one-step distillation

Chaoyang Wang, Yunhai Tong

May 20, 2026

Discrete diffusion models generate high-quality images but require slow iterative decoding. Fixed-Point Distillation (FPD) compresses this into one inference step by training a student model to mimic a teacher's corrections. The key trick: corrupt the student's draft slightly, let the teacher refine it once, then train the student to predict that refined version—all while routing gradients through discrete tokens via a straight-through estimator. On class-conditional and text-to-image tasks, FPD matches multi-step teachers while being orders of magnitude faster at inference.
Published as One-Step Distillation of Discrete Diffusion Image Generators via Fixed-Point Iteration arXiv:2605.21484
Read the original paper →