From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation
This paper proposes a real-time multi-modal trajectory policy framework that distills a Conditional Flow Matching expert into a single-step student using Implicit Maximum Likelihood Estimation and a bi-directional Chamfer distance, thereby eliminating the latency of iterative ODE integration while preserving multi-modal action diversity for high-frequency robotic control.