Seed2Scale: A Self-Evolving Data Engine for Embodied AI via Small to Large Model Synergy and Multimodal Evaluation

Seed2Scale is a self-evolving data engine that overcomes data bottlenecks in embodied AI by synergizing a lightweight "SuperTiny" model for robust data collection with a large Vision-Language Model for autonomous quality verification, enabling a target model to achieve a 131.2% performance improvement starting from just four seed demonstrations.

Cong Tai, Zhaoyu Zheng, Haixu Long, Hansheng Wu, Zhengbin Long, Haodong Xiang, Rong Shi, Zhuo Cui, Shizhuang Zhang, Gang Qiu, He Wang, Ruifeng Li, Biao Liu, Zhenzhe Sun, Tao ShenTue, 10 Ma💻 cs

Hierarchical Multi-Modal Planning for Fixed-Altitude Sparse Target Search and Sampling

This paper introduces HIMoS, a hierarchical multi-modal planning framework that enables Autonomous Underwater Vehicles to efficiently search for and sample sparse benthic targets like coral colonies at a fixed altitude by integrating a global topological route optimizer with a local differentiable belief propagation planner, thereby outperforming traditional exhaustive and adaptive sampling strategies in high-fidelity simulations.

Lingpeng Chen, Yuchen Zheng, Apple Pui-Yi Chui, Junfeng Wu, Ziyang HongTue, 10 Ma💻 cs

PhaForce: Phase-Scheduled Visual-Force Policy Learning with Slow Planning and Fast Correction for Contact-Rich Manipulation

PhaForce is a phase-scheduled visuomotor policy that enhances contact-rich manipulation by coordinating a slow, vision-dominant diffusion planner with a fast, force-driven corrector to enable high-frequency, phase-aware residual corrections, achieving an 86% success rate and superior adaptability compared to existing baselines.

Mingxin Wang, Zhirun Yue, Renhao Lu, Yizhe Li, Zihan Wang, Guoping Pan, Kangkang Dong, Jun Cheng, Yi Cheng, Houde LiuTue, 10 Ma💻 cs

MoMaStage: Skill-State Graph Guided Planning and Closed-Loop Execution for Long-Horizon Indoor Mobile Manipulation

MoMaStage is a structured vision-language framework that enables robust long-horizon indoor mobile manipulation by guiding task planning through a topology-aware Skill-State Graph and ensuring execution reliability via a closed-loop mechanism that triggers semantic replanning upon detecting physical deviations, all without requiring explicit scene mapping.

Chenxu Li, Zixuan Chen, Yetao Li, Jiapeng Xu, Hongyu Ding, Jieqi Shi, Jing Huo, Yang GaoTue, 10 Ma💻 cs

Human-Aware Robot Behaviour in Self-Driving Labs

This paper proposes an AI-driven perception method with hierarchical human intention prediction to enable mobile robot chemists in self-driving laboratories to proactively distinguish between human preparatory actions and transient interactions, thereby overcoming the inefficiencies of passive obstruction detection and streamlining human-robot coordination in shared-access scenarios.

Satheeshkumar Veeramani, Anna Kisil, Abigail Bentley, Hatem Fakhruldeen, Gabriella Pizzuto, Andrew I. CooperTue, 10 Ma💻 cs

Tactile Recognition of Both Shapes and Materials with Automatic Feature Optimization-Enabled Meta Learning

This paper proposes the AFOP-ML framework, an automatic feature optimization-enabled prototypical network that achieves rapid few-shot tactile recognition of both shapes and materials with high accuracy and robustness against perturbations, effectively addressing the challenges of data scarcity and time-consuming training in robotic applications.

Hongliang Zhao, Wenhui Yang, Yang Chen, Zhuorui Wang, Baiheng Liu, Longhui QinTue, 10 Ma💻 cs

FoMo: A Multi-Season Dataset for Robot Navigation in Forêt Montmorency

The FoMo dataset presents a comprehensive, multi-season collection of over 64 km of diverse robot navigation data from a boreal forest, featuring significant environmental changes like heavy snow and vegetation growth to challenge and evaluate the robustness of state-of-the-art odometry and SLAM systems.

Matej Boxan, Gabriel Jeanson, Alexander Krawciw, Effie Daum, Xinyuan Qiao, Sven Lilge, Timothy D. Barfoot, François PomerleauTue, 10 Ma💻 cs

R2F: Repurposing Ray Frontiers for LLM-free Object Navigation

The paper proposes R2F, an LLM-free framework for zero-shot open-vocabulary object navigation that repurposes ray frontiers as direction-conditioned semantic hypotheses to achieve competitive performance with real-time execution, eliminating the latency and computational overhead of iterative large-model queries.

Francesco Argenziano, John Mark Alexis Marcelo, Michele Brienza, Abdel Hakim Drid, Emanuele Musumeci, Daniele Nardi, Domenico D. Bloisi, Vincenzo SurianiTue, 10 Ma💻 cs

LAR-MoE: Latent-Aligned Routing for Mixture of Experts in Robotic Imitation Learning

LAR-MoE is a two-stage framework that decouples unsupervised skill discovery from policy learning by regularizing expert routing to align with a learned latent representation, enabling robots to achieve high success rates in heterogeneous manipulation tasks without requiring manual skill annotations.

Ariel Rodriguez, Chenpan Li, Lorenzo Mazza, Rayan Younis, Ortrun Hellig, Sebastian Bodenstedt, Martin Wagner, Stefanie SpeidelTue, 10 Ma💻 cs

STRIDE: Structured Lagrangian and Stochastic Residual Dynamics via Flow Matching

The paper proposes STRIDE, a hybrid dynamics learning framework that combines a Lagrangian Neural Network for energy-consistent rigid-body mechanics with Conditional Flow Matching for stochastic residual interaction forces, achieving significant improvements in long-horizon prediction and contact force accuracy for robotic systems in unstructured environments.

Prakrut Kotecha, Ganga Nair B, Shishir KolathayaTue, 10 Ma🤖 cs.LG

An Open-Source Robotics Research Platform for Autonomous Laparoscopic Surgery

This paper introduces an open-source, robot-agnostic surgical robotics platform featuring a deterministic, closed-form RCM controller and full-stack ROS integration, which achieves sub-millimeter precision and expert-level trajectory smoothness in autonomous laparoscopic tasks across phantom, ex vivo, and in vivo porcine models.

Ariel Rodriguez, Lorenzo Mazza, Martin Lelis, Rayan Younis, Sebastian Bodenstedt, Martin Wagner, Stefanie SpeidelTue, 10 Ma💻 cs