Act, Think or Abstain: Complexity-Aware Adaptive Inference for Vision-Language-Action Models

This paper proposes a complexity-aware adaptive inference framework for Vision-Language-Action models that dynamically routes execution to "Act," "Think," or "Abstain" based on task complexity, leveraging a vision-only detector to optimize resource allocation and prevent failures while achieving high accuracy with minimal training data.

Riccardo Andrea Izzo, Gianluca Bardaro, Matteo Matteucci2026-03-06💻 cs

Critic in the Loop: A Tri-System VLA Framework for Robust Long-Horizon Manipulation

This paper introduces "Critic in the Loop," a tri-system framework that dynamically coordinates a high-level Vision-Language Model for global reasoning and a fast Vision-Language-Action model for reactive execution via a lightweight visual critic, thereby achieving robust, state-of-the-art performance in long-horizon robotic manipulation by balancing semantic depth with real-time control.

Pengfei Yi, Yingjie Ma, Wenjiang Xu + 4 more2026-03-06💻 cs

Curve-Induced Dynamical Systems on Riemannian Manifolds and Lie Groups

This paper introduces Curve-induced Dynamical Systems on Smooth Manifolds (CDSM), a real-time framework that generates stable and adaptable robotic behaviors on Riemannian manifolds and Lie groups by constructing dynamical systems with tangential and normal components relative to a nominal curve, demonstrating superior accuracy and efficiency in both benchmarks and practical robotic applications.

Saray Bakker, Martin Schonger, Tobias Löw + 2 more2026-03-06💻 cs

From Code to Road: A Vehicle-in-the-Loop and Digital Twin-Based Framework for Central Car Server Testing in Autonomous Driving

This paper presents a Vehicle-in-the-Loop and digital twin-based framework that integrates a physical test vehicle on a dynamometer with a synchronized virtual environment to enable safe, cost-effective, and realistic validation of autonomous driving algorithms on centralized E/E architectures without requiring individual ECU testing or intermediate software layers.

Chengdong Wu, Sven Kirchner, Nils Purschke + 9 more2026-03-06💻 cs

Iterative On-Policy Refinement of Hierarchical Diffusion Policies for Language-Conditioned Manipulation

The paper proposes HD-ExpIt, a framework that iteratively refines hierarchical diffusion policies for language-conditioned manipulation by leveraging environment feedback to autonomously discover and distill successful behaviors, thereby aligning the planner with the controller's capabilities and achieving state-of-the-art performance on the CALVIN benchmark.

Clemence Grislain, Olivier Sigaud, Mohamed Chetouani2026-03-06💻 cs

Latent Policy Steering through One-Step Flow Policies

The paper proposes Latent Policy Steering (LPS), a robust offline reinforcement learning method that achieves state-of-the-art performance by using a differentiable one-step MeanFlow policy to backpropagate original-action-space Q-gradients directly to a latent actor, thereby eliminating the need for proxy latent critics and sensitive hyperparameter tuning while ensuring policies remain within dataset support.

Hokyun Im, Andrey Kolobov, Jianlong Fu + 1 more2026-03-06🤖 cs.LG

Omni-Manip: Beyond-FOV Large-Workspace Humanoid Manipulation with Omnidirectional 3D Perception

This paper presents Omni-Manip, an end-to-end LiDAR-driven visuomotor policy that leverages a Time-Aware Attention Pooling mechanism to process 360° panoramic point clouds, enabling humanoid robots to perform robust dexterous manipulation in large, cluttered workspaces without the need for frequent repositioning or reliance on narrow-field-of-view RGB-D cameras.

Pei Qu, Zheng Li, Yufei Jia + 5 more2026-03-06💻 cs