VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
This paper introduces VLA-ATTC, a framework that enhances Vision-Language-Action models with adaptive test-time compute via an uncertainty-based "cognitive clutch" and a novel Relative Action Critic for pairwise action selection, significantly reducing failure rates in complex manipulation tasks without manual annotation.