Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
Fast-ThinkAct is an efficient Vision-Language-Action framework that utilizes preference-guided distillation of verbalizable latent reasoning to significantly reduce inference latency while maintaining strong performance in long-horizon planning, few-shot adaptation, and failure recovery.