NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning
NORD is a data-efficient Vision-Language-Action model for autonomous driving that achieves competitive performance on Waymo and NAVSIM benchmarks using less than 60% of the training data and no reasoning annotations by addressing the difficulty bias in standard Group Relative Policy Optimization through the Dr. GRPO algorithm.