← Back to Robotics
cs.RO

Can language models safely handle delicate robot touch tasks?

Haofan Cao, Zhaoyang Li, Zhichao You, Liang Guo, Tianrui Li

May 30, 2026

Vision-language models excel at understanding what a robot should do, but they output commands too slowly for precise contact tasks like inserting connectors. PaCo-VLA wraps the VLA in a high-frequency "passivity shield" that treats the model's outputs as suggestions—compliance targets, not direct motor commands—and enforces energy conservation to prevent damage. In real connector-insertion trials, this framework achieved zero safety violations while outperforming unshielded baselines, proving language models can handle delicate manipulation when wrapped in physics-aware guards.
Published as PaCo-VLA: Passivity-Shielded Compliance Prior for Contact-Rich Vision-Language-Action Manipulation arXiv:2606.00515
Read the original paper →