DexHiL: A Human-in-the-Loop Framework for Vision-Language-Action Model Post-Training in Dexterous Manipulation
DexHiL is the first integrated human-in-the-loop framework for dexterous Vision-Language-Action models that combines coordinated arm-hand teleoperation with intervention-aware data sampling to significantly improve post-training performance and reliability in complex manipulation tasks.