← Back to Computer Vision
cs.CV

How to skip unnecessary computation in image generation without losing quality?

Changwang Mei, Peisong Wang, Zekun Li, Changsheng Li, Shuang Qiu, Qinghao Hu, Gang Li, Yifan Zhang, Zhihui Wei, Jian Cheng

May 29, 2026

Visual autoregressive models generate images one token at a time, but most tokens are redundant computation. Instead of guessing which ones to skip using shallow heuristics, this work measures each token's true impact on the final image by tracking changes in the model's internal state. On the Infinity-8B model, the method achieves 2.35× speedup while preserving generation quality, and requires no retraining.
Published as Where to Refine, When to Stop: Rethinking Redundancy via Latent Discrepancy for Efficient Visual Autoregressive Generation arXiv:2606.00310
Read the original paper →