UrbanHuRo: A Two-Layer Human-Robot Collaboration Framework for the Joint Optimization of Heterogeneous Urban Services

This paper proposes UrbanHuRo, a two-layer human-robot collaboration framework that jointly optimizes heterogeneous urban services like crowdsourced delivery and sensing through scalable order dispatch and deep reinforcement learning, achieving significant improvements in sensing coverage, courier income, and order timeliness.

Tonmoy Dey, Lin Jiang, Zheng Dong + 1 more2026-03-05🤖 cs.AI

Interaction-Aware Whole-Body Control for Compliant Object Transport

This paper presents a bio-inspired, interaction-oriented whole-body control framework that combines a trajectory-optimized reference generator with a reinforcement learning policy trained via asymmetric teacher-student distillation to enable assistive humanoids to maintain stable balance and compliant object transport in unstructured environments despite strong, time-varying interaction forces.

Hao Zhang, Yves Tseng, Ding Zhao + 1 more2026-03-05🤖 cs.AI

Cognition to Control - Multi-Agent Learning for Human-Humanoid Collaborative Transport

This paper introduces Cognition-to-Control (C2C), a three-layer hierarchical framework that bridges high-level deliberation and low-level execution for human-humanoid collaborative transport by integrating a VLM-based grounding layer, a decentralized multi-agent reinforcement learning coordination layer, and a whole-body control layer to achieve robust, stable, and adaptive joint manipulation.

Hao Zhang, Ding Zhao, H. Eric Tseng2026-03-05🤖 cs.AI

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

This paper demonstrates that large-scale pretrained Vision-Language-Action models exhibit remarkable resistance to catastrophic forgetting during continual learning, often achieving zero forgetting with simple experience replay and enabling rapid skill recovery through fine-tuning, a capability that fundamentally differs from smaller models trained from scratch.

Huihan Liu, Changyeon Kim, Bo Liu + 2 more2026-03-05🤖 cs.AI

SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling

SaFeR is a novel framework for generating safety-critical autonomous driving test scenarios that balances adversarial criticality, physical feasibility, and behavioral realism by employing a Transformer-based realism prior with a differential attention mechanism and a feasibility-constrained token resampling strategy derived from offline reinforcement learning.

Jinlong Cui, Fenghua Liang, Guo Yang + 2 more2026-03-05🤖 cs.AI

Learning Hip Exoskeleton Control Policy via Predictive Neuromusculoskeletal Simulation

This paper presents a physics-based neuromusculoskeletal learning framework that trains a hip-exoskeleton control policy entirely in simulation using reinforcement learning and muscle-synergy priors, successfully transferring the policy to hardware without motion-capture data or additional tuning while achieving significant reductions in muscle activation and joint power across diverse walking conditions.

Ilseung Park, Changseob Song, Inseung Kang2026-03-05🤖 cs.LG

PRAM-R: A Perception-Reasoning-Action-Memory Framework with LLM-Guided Modality Routing for Adaptive Autonomous Driving

This paper introduces PRAM-R, a unified framework that leverages an LLM-guided router and hierarchical memory within an asynchronous dual-loop architecture to dynamically optimize sensor modality usage, significantly reducing computational costs and routing instability while maintaining high trajectory accuracy in autonomous driving.

Yi Zhang, Xian Zhang, Saisi Zhao + 4 more2026-03-05🤖 cs.AI

VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments

This paper introduces VANGUARD, a lightweight geometric perception tool that enables LLM-based UAV agents operating in GPS-denied environments to accurately estimate Ground Sample Distance and recover metric scale by leveraging detected vehicles as environmental anchors, thereby significantly reducing spatial hallucinations and catastrophic failures compared to state-of-the-art vision-language models.

Yifei Chen, Xupeng Chen, Feng Wang + 2 more2026-03-05🤖 cs.AI

RoboCasa365: A Large-Scale Simulation Framework for Training and Benchmarking Generalist Robots

This paper introduces RoboCasa365, a large-scale simulation framework featuring 365 everyday tasks across 2,500 diverse kitchen environments and extensive human and synthetic demonstration data, designed to provide a reproducible benchmark for evaluating and advancing generalist robot policies through systematic analysis of task diversity, dataset scale, and environment variation.

Soroush Nasiriany, Sepehr Nasiriany, Abhiram Maddukuri + 1 more2026-03-05🤖 cs.AI