← Back to Artificial Intelligence
cs.AI

Making Python AI pipelines 6× faster by parallelizing automatically

Stephen Mell, David Mell, Konstantinos Kallas, Steve Zdancewic, Osbert Bastani

May 18, 2026

Compound AI applications—Python code that chains together calls to language models and other ML services—are bottlenecked by waiting for external services sequentially. PopPy analyzes Python code to find parallelization opportunities, then runs independent operations concurrently while preserving the program's original behavior. On real-world applications, it delivers up to 6.4× speedups with minimal developer effort.
Published as PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications arXiv:2605.18697
Read the original paper →