cs.RO papers | Gist.Science

ROSflight 2.0: Lean ROS 2-Based Autopilot for Unmanned Aerial Vehicles

This paper introduces ROSflight 2.0, a modular, open-source ROS 2-based autopilot ecosystem designed to lower barriers for UAV research and accelerate the transition from simulation to hardware, featuring a lean architecture that successfully controls multirotors at 400 Hz with all loops running on a companion computer.

Jacob Moore, Phil Tokumaru, Ian Reid, Brandon Sutherland, Joseph Ritchie, Gabe Snow, Tim McLain2026-03-09💻 cs

ROSplane 2.0: A Fixed-Wing Autopilot for Research

ROSplane 2.0 is an open-source, ROS 2-based fixed-wing autopilot framework designed by researchers to accelerate UAV experimentation through a lean, modular architecture, enhanced control algorithms, and a streamlined aerodynamic modeling pipeline that simplifies the transition from simulation to real-world testing.

Ian Reid, Joseph Ritchie, Jacob Moore, Brandon Sutherland, Gabe Snow, Phillip Tokumaru, Tim McLain2026-03-09💻 cs

Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation

Phys2Real is a real-to-sim-to-real reinforcement learning framework that enhances sim-to-real transfer for precise robotic manipulation by fusing vision-language model-inferred physical parameter priors with online interactive adaptation through uncertainty-aware ensemble estimation.

Maggie Wang, Stephen Tian, Aiden Swann, Ola Shorinwa, Jiajun Wu, Mac Schwager2026-03-09🤖 cs.AI

Sample-Based Hybrid Mode Control: Asymptotically Optimal Switching of Algorithmic and Non-Differentiable Control Modes

This paper presents a sample-based hybrid mode control framework that formulates mode selection, switching timing, and duration as an integer optimization problem to achieve asymptotically optimal, reactive switching between non-differentiable and algorithmic control strategies for complex robotic tasks.

Yilang Liu, Haoxiang You, Ian Abraham2026-03-09💻 cs

Push Anything: Single- and Multi-Object Pushing From First Sight with Contact-Implicit MPC

This paper introduces Consensus Complementarity Control Plus (C3+), an enhanced contact-implicit model predictive control algorithm that enables a complete robotic pipeline to robustly and efficiently push diverse single and multi-object configurations to target poses in real-time, achieving a 98% success rate on hardware.

Hien Bui, Yufeiyang Gao, Haoran Yang, Eric Cui, Siddhant Mody, Brian Acosta, Thomas Stephen Felix, Bibit Bianchini, Michael Posa2026-03-09💻 cs

AURASeg: Attention-guided Upsampling with Residual-Assistive Boundary Refinement for Onboard Robot Drivable-Area Segmentation

This paper introduces AURASeg, an attention-guided segmentation framework featuring a Residual Boundary Refinement Module and an Attention Progressive Upsampling Decoder to enhance drivable-area boundary precision and multi-scale feature representation for onboard robot navigation, demonstrating superior performance on multiple datasets and successful deployment on a Jetson Nano.

Narendhiran Vijayakumar, Sridevi. M2026-03-09💻 cs

Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning

This paper presents a real-time online framework that utilizes modified sliding-window Hankel Dynamic Mode Decomposition with singular-value hard thresholding and Cadzow projection to denoise partial measurements and construct predictive models for dynamic obstacle motion, enabling stable, variance-aware forecasting suitable for robotic motion planning.

Stella Kombo, Masih Haseli, Skylar X. Wei, Joel W. Burdick2026-03-09🤖 cs.LG

Indicating Robot Vision Capabilities with Augmented Reality

This paper proposes and evaluates four augmented reality indicators designed to visualize a robot's field of view, finding that allocentric indicators placed in the task space and egocentric indicators modifying the robot's eyes effectively improve human accuracy in collaborative tasks while maintaining high confidence and low cognitive load.

Hong Wang, Ridhima Phatak, James Ocampo, Zhao Han2026-03-09💻 cs

ExpReS-VLA: Specializing Vision-Language-Action Models Through Experience Replay and Retrieval

ExpReS-VLA is a specialized Vision-Language-Action model that enables rapid, memory-efficient on-device adaptation to specific robotic tasks by combining compressed experience replay, retrieval-augmented generation, and a novel contrastive loss to prevent catastrophic forgetting while significantly improving performance on both spatial and long-horizon benchmarks.

Shahram Najam Syed, Yatharth Ahuja, Arthur Jakobsson, Jeff Ichnowski2026-03-09💻 cs

CAVER: Curious Audiovisual Exploring Robot

The paper introduces CAVER, a novel robot equipped with a specialized audio-exciting end-effector, a combined audiovisual representation, and a curiosity-driven exploration algorithm that efficiently learns object correlations to improve material classification and audio-only imitation tasks.

Luca Macesanu, Boueny Folefack, Samik Singh, Ruchira Ray, Ben Abbatematteo, Roberto Martín-Martín2026-03-09💻 cs

Contact-Safe Reinforcement Learning with ProMP Reparameterization and Energy Awareness

This paper proposes a contact-safe reinforcement learning framework that combines Proximal Policy Optimization with movement primitives and an energy-aware Cartesian impedance controller to generate robust, safe, and energy-efficient task-space trajectories for complex contact-rich manipulation in 3D environments.

Bingkun Huang, Yuhe Gong, Zewen Yang, Tianyu Ren, Luis Figueredo2026-03-09💻 cs

Symmetry-Breaking in Multi-Agent Navigation: Winding Number-Aware MPC with a Learned Topological Strategy

This paper introduces WNumMPC, a hierarchical multi-agent navigation framework that combines a reinforcement learning-based planner and a model-based controller to resolve symmetry-induced deadlocks in dense environments by leveraging topological winding numbers for robust, communication-free coordination.

Tomoki Nakao, Kazumi Kasaura, Tadashi Kozuno2026-03-09💻 cs

Bi-AQUA: Bilateral Control-Based Imitation Learning for Underwater Robot Arms via Lighting-Aware Action Chunking with Transformers

Bi-AQUA is a novel bilateral control-based imitation learning framework for underwater robot arms that integrates transformer-based action chunking with explicit lighting modeling to achieve robust performance in challenging, variable illumination conditions.

Takeru Tsunoori, Masato Kobayashi, Yuki Uranishi2026-03-09💻 cs

EchoVLA: Synergistic Declarative Memory for VLA-Driven Mobile Manipulation

EchoVLA is a memory-enhanced Vision-Language-Action model for mobile manipulation that synergizes scene and episodic declarative memories to improve navigation and task performance, validated by the new MoMani benchmark and demonstrating significant gains over existing baselines in both simulation and real-world settings.

Min Lin, Xiwen Liang, Bingqian Lin, Liu Jingzhi, Zijian Jiao, Kehan Li, Yu Sun, Weijia Liufu, Yuhan Ma, Yuecheng Liu, Shen Zhao, Yuzheng Zhuang, Xiaodan Liang2026-03-09💻 cs

Safe Autonomous Lane Changing: Planning with Dynamic Risk Fields and Time-Varying Convex Space Generation

This paper proposes a novel autonomous lane-changing planning framework that integrates dynamic risk fields with time-varying convex feasible spaces and a constrained iLQR solver to achieve safe, efficient, and comfortable trajectories that outperform traditional methods in complex traffic scenarios.

Yijun Lu, Zhihao Lin, Zhen Tian2026-03-09💻 cs

Dependent Reachable Sets for the Constant Bearing Pursuit Strategy

This paper introduces the concept of dependent reachable sets for two-agent pursuit scenarios, characterizing their geometric bounds and shape through theoretical analysis and simulations of the constant bearing pursuit strategy.

Venkata Ramana Makkapati, Tulasi Ram Vechalapu, Vinodhini Comandur, Seth Hutchinson2026-03-09🔢 math

XR-DT: Extended Reality-Enhanced Digital Twin for Safe Motion Planning via Human-Aware Model Predictive Path Integral Control

This paper introduces XR-DT, an Extended Reality-enhanced Digital Twin framework that integrates a novel Human-Aware Model Predictive Path Integral (HA-MPPI) controller with an attention-based trajectory prediction model to enable safe, efficient, and interpretable motion planning for mobile robots operating alongside humans.

Tianyi Wang, Jiseop Byeon, Ahmad Yehia, Yiming Xu, Jihyung Park, Tianyi Zeng, Sikai Chen, Ziran Wang, Junfeng Jiao, Christian Claudel2026-03-09🤖 cs.AI

Safe Model Predictive Diffusion with Shielding

This paper introduces Safe Model Predictive Diffusion (Safe MPD), a training-free planning framework that integrates a safety shield directly into the diffusion denoising process to generate kinodynamically feasible and safe trajectories in real-time, outperforming existing methods in success rate and safety without requiring post-processing corrections.

Taekyung Kim, Keyvan Majd, Hideki Okamoto, Bardh Hoxha, Dimitra Panagou, Georgios Fainekos2026-03-09💻 cs

SORS: A Modular, High-Fidelity Simulator for Soft Robots

This paper introduces SORS, a modular, high-fidelity simulator based on the finite element method and constrained nonlinear optimization that accurately models complex soft robot dynamics and contact interactions, effectively bridging the sim-to-real gap for prototyping and control optimization.

Manuel Mekkattu, Mike Y. Michelis, Robert K. Katzschmann2026-03-09💻 cs

VISO: Robust Underwater Visual-Inertial-Sonar SLAM with Photometric Rendering for Dense 3D Reconstruction

This paper presents VISO, a robust underwater SLAM system that fuses stereo cameras, IMUs, and 3D sonar with novel calibration and photometric rendering techniques to achieve accurate 6-DoF localization and real-time, high-fidelity dense 3D reconstruction in challenging aquatic environments.

Shu Pan, Simon Archieri, Ahmet Cinar, Jonatan Scharff Willners, Ignacio Carlucho, Yvan Petillot2026-03-09💻 cs

← Previous Next →