ROSflight 2.0: Lean ROS 2-Based Autopilot for Unmanned Aerial Vehicles

This paper introduces ROSflight 2.0, a modular, open-source ROS 2-based autopilot ecosystem designed to lower barriers for UAV research and accelerate the transition from simulation to hardware, featuring a lean architecture that successfully controls multirotors at 400 Hz with all loops running on a companion computer.

Jacob Moore, Phil Tokumaru, Ian Reid, Brandon Sutherland, Joseph Ritchie, Gabe Snow, Tim McLain2026-03-09💻 cs

Push Anything: Single- and Multi-Object Pushing From First Sight with Contact-Implicit MPC

This paper introduces Consensus Complementarity Control Plus (C3+), an enhanced contact-implicit model predictive control algorithm that enables a complete robotic pipeline to robustly and efficiently push diverse single and multi-object configurations to target poses in real-time, achieving a 98% success rate on hardware.

Hien Bui, Yufeiyang Gao, Haoran Yang, Eric Cui, Siddhant Mody, Brian Acosta, Thomas Stephen Felix, Bibit Bianchini, Michael Posa2026-03-09💻 cs

AURASeg: Attention-guided Upsampling with Residual-Assistive Boundary Refinement for Onboard Robot Drivable-Area Segmentation

This paper introduces AURASeg, an attention-guided segmentation framework featuring a Residual Boundary Refinement Module and an Attention Progressive Upsampling Decoder to enhance drivable-area boundary precision and multi-scale feature representation for onboard robot navigation, demonstrating superior performance on multiple datasets and successful deployment on a Jetson Nano.

Narendhiran Vijayakumar, Sridevi. M2026-03-09💻 cs

Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning

This paper presents a real-time online framework that utilizes modified sliding-window Hankel Dynamic Mode Decomposition with singular-value hard thresholding and Cadzow projection to denoise partial measurements and construct predictive models for dynamic obstacle motion, enabling stable, variance-aware forecasting suitable for robotic motion planning.

Stella Kombo, Masih Haseli, Skylar X. Wei, Joel W. Burdick2026-03-09🤖 cs.LG

ExpReS-VLA: Specializing Vision-Language-Action Models Through Experience Replay and Retrieval

ExpReS-VLA is a specialized Vision-Language-Action model that enables rapid, memory-efficient on-device adaptation to specific robotic tasks by combining compressed experience replay, retrieval-augmented generation, and a novel contrastive loss to prevent catastrophic forgetting while significantly improving performance on both spatial and long-horizon benchmarks.

Shahram Najam Syed, Yatharth Ahuja, Arthur Jakobsson, Jeff Ichnowski2026-03-09💻 cs

EchoVLA: Synergistic Declarative Memory for VLA-Driven Mobile Manipulation

EchoVLA is a memory-enhanced Vision-Language-Action model for mobile manipulation that synergizes scene and episodic declarative memories to improve navigation and task performance, validated by the new MoMani benchmark and demonstrating significant gains over existing baselines in both simulation and real-world settings.

Min Lin, Xiwen Liang, Bingqian Lin, Liu Jingzhi, Zijian Jiao, Kehan Li, Yu Sun, Weijia Liufu, Yuhan Ma, Yuecheng Liu, Shen Zhao, Yuzheng Zhuang, Xiaodan Liang2026-03-09💻 cs

XR-DT: Extended Reality-Enhanced Digital Twin for Safe Motion Planning via Human-Aware Model Predictive Path Integral Control

This paper introduces XR-DT, an Extended Reality-enhanced Digital Twin framework that integrates a novel Human-Aware Model Predictive Path Integral (HA-MPPI) controller with an attention-based trajectory prediction model to enable safe, efficient, and interpretable motion planning for mobile robots operating alongside humans.

Tianyi Wang, Jiseop Byeon, Ahmad Yehia, Yiming Xu, Jihyung Park, Tianyi Zeng, Sikai Chen, Ziran Wang, Junfeng Jiao, Christian Claudel2026-03-09🤖 cs.AI

Safe Model Predictive Diffusion with Shielding

This paper introduces Safe Model Predictive Diffusion (Safe MPD), a training-free planning framework that integrates a safety shield directly into the diffusion denoising process to generate kinodynamically feasible and safe trajectories in real-time, outperforming existing methods in success rate and safety without requiring post-processing corrections.

Taekyung Kim, Keyvan Majd, Hideki Okamoto, Bardh Hoxha, Dimitra Panagou, Georgios Fainekos2026-03-09💻 cs

VISO: Robust Underwater Visual-Inertial-Sonar SLAM with Photometric Rendering for Dense 3D Reconstruction

This paper presents VISO, a robust underwater SLAM system that fuses stereo cameras, IMUs, and 3D sonar with novel calibration and photometric rendering techniques to achieve accurate 6-DoF localization and real-time, high-fidelity dense 3D reconstruction in challenging aquatic environments.

Shu Pan, Simon Archieri, Ahmet Cinar, Jonatan Scharff Willners, Ignacio Carlucho, Yvan Petillot2026-03-09💻 cs