ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction

ReconDrive is a fast, feed-forward framework that adapts the VGGT foundation model with hybrid prediction heads and static-dynamic composition to achieve high-fidelity, scalable 4D Gaussian Splatting for autonomous driving scenes, outperforming existing feed-forward methods while matching the quality of slower optimization-based approaches.

Haibao Yu, Kuntao Xiao, Jiahang Wang, Ruiyang Hao, Yuxin Huang, Guoran Hu, Haifang Qin, Bowen Jing, Yuntian Bo, Ping LuoTue, 10 Ma💻 cs

Approximate Imitation Learning for Event-based Quadrotor Flight in Cluttered Environments

This paper proposes an Approximate Imitation Learning framework that enables a quadrotor to fly at high speeds through cluttered environments using only a single event camera by training an end-to-end neural network with a large offline dataset and lightweight state simulations, thereby avoiding the computational cost of rendering synthetic event data while achieving robust real-world performance.

Nico Messikommer, Jiaxu Xing, Leonard Bauersfeld, Marco Cannici, Elie Aljalbout, Davide ScaramuzzaTue, 10 Ma💻 cs

GeoLoco: Leveraging 3D Geometric Priors from Visual Foundation Model for Robust RGB-Only Humanoid Locomotion

GeoLoco is a robust, RGB-only humanoid locomotion framework that leverages geometric priors from a frozen Visual Foundation Model and a specialized cross-attention mechanism to achieve zero-shot sim-to-real transfer on the Unitree G1 without relying on active depth sensors.

Yufei Liu, Xieyuanli Chen, Hainan Pan, Chenghao Shi, Yanjie Chen, Kaihong Huang, Zhiwen Zeng, Huimin LuTue, 10 Ma💻 cs

Exoskeleton Control through Learning to Reduce Biological Joint Moments in Simulations

This paper presents a reinforcement learning framework for training exoskeleton controllers to reduce biological joint moments and establishes a quantitative validation pipeline that demonstrates strong simulation-to-data consistency in torque predictions, particularly at the hip, while identifying specific challenges in timing and power injection for sim-to-real transfer.

Zihang You, Xianlian ZhouTue, 10 Ma🤖 cs.LG

TempoFit: Plug-and-Play Layer-Wise Temporal KV Memory for Long-Horizon Vision-Language-Action Manipulation

TempoFit is a training-free, plug-and-play method that enhances frozen Vision-Language-Action policies for long-horizon manipulation by retrieving and injecting layer-wise temporal key-value memory from previous timesteps, thereby improving success rates in non-Markovian environments without increasing inference latency or requiring model retraining.

Jun Sun, Boyu Yang, Jiahao Zhang, Ning Ma, Chencheng Wu, Siqing Zhang, Yiou Huang, Qiufeng Wang, Shan Liang, Yaran ChenTue, 10 Ma💻 cs

AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots

The paper proposes AtomicVLA, a unified planning-and-execution framework that utilizes a Skill-Guided Mixture-of-Experts architecture to dynamically compose atomic skill abstractions, thereby significantly improving scalability and performance in long-horizon robotic manipulation and continual learning tasks compared to existing monolithic VLA models.

Likui Zhang, Tao Tang, Zhihao Zhan, Xiuwei Chen, Zisheng Chen, Jianhua Han, Jiangtong Zhu, Pei Xu, Hang Xu, Hefeng Wu, Liang Lin, Xiaodan LiangTue, 10 Ma💻 cs

Multi-Agent Off-World Exploration for Sparse Evidence Discovery via Gaussian Belief Mapping and Dual-Domain Coverage

This paper proposes a multi-agent informative path planning framework for off-world exploration that utilizes Gaussian belief mapping and dual-domain coverage to effectively discover sparse, visually ambiguous evidence while balancing information gain with operational safety in hazardous, communication-constrained environments.

Zhuoran Qiao, Tianxin Hu, Thien-Minh Nguyen, Shenghai YuanTue, 10 Ma💻 cs

UniUncer: Unified Dynamic Static Uncertainty for End to End Driving

UniUncer is a lightweight, unified framework for end-to-end autonomous driving that jointly estimates and leverages uncertainty for both static map elements and dynamic agents through probabilistic regression, uncertainty-aware query fusion, and adaptive gating, thereby significantly improving trajectory accuracy and planning robustness with minimal computational overhead.

Yu Gao, Jijun Wang, Zongzheng Zhang, Anqing Jiang, Yiru Wang, Yuwen Heng, Shuo Wang, Hao Sun, Zhangfeng Hu, Hao ZhaoTue, 10 Ma💻 cs

C2^2-Explorer: Contiguity-Driven Task Allocation with Connectivity-Aware Task Representation for Decentralized Multi-UAV Exploration

C2^2-Explorer is a decentralized framework for multi-UAV exploration that addresses communication limitations and inefficient traversal by utilizing connectivity-aware task representation and a contiguity-driven allocation strategy, achieving significant reductions in exploration time and path length compared to state-of-the-art methods.

Xinlu Yan, Mingjie Zhang, Yuhao Fang, Yanke Sun, Jun Ma, Youmin Gong, Boyu Zhou, Jie MeiTue, 10 Ma💻 cs

AeroPlace-Flow: Language-Grounded Object Placement for Aerial Manipulators via Visual Foresight and Object Flow

This paper introduces AeroPlace-Flow, a training-free framework that enables aerial manipulators to perform precise, language-grounded object placement by synthesizing goal images, grounding them into 3D space, and generating collision-aware object flows without requiring predefined poses or task-specific training.

Sarthak Mishra, Rishabh Dev Yadav, Naveen Nair, Wei Pan, Spandan RoyTue, 10 Ma💻 cs

Residual Control for Fast Recovery from Dynamics Shifts

This paper proposes a stability-aligned residual control architecture that enables robotic systems to rapidly recover from mid-episode dynamics shifts by keeping the nominal policy frozen while using a bounded, gated additive residual channel to adaptively compensate for unobserved disturbances, achieving up to an 87% reduction in recovery time across various robotic platforms.

Nethmi Jayasinghe, Diana Gontero, Francesco Migliarba, Spencer T. Brown, Vinod K. Sangwan, Mark C. Hersam, Amit Ranjan TrivediTue, 10 Ma💻 cs

A Robust Antenna Provides Tactile Feedback in a Multi-legged Robot

This paper presents a multi-legged robot equipped with biomimetic, gradient-compliant tactile antennae that enable robust navigation and recovery in confined, obstacle-rich environments by mapping antenna deformation to collision states for real-time steering without relying on global environmental information or vision.

Zhaochen J. Xu, Juntao He, Delfin Aydan, Malaika Taylor, Tianyu Wang, Jianfeng Lin, Wesley Dyer, Daniel I. GoldmanTue, 10 Ma💻 cs