On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

This paper investigates the generalization capabilities of a multimodal foundation model fine-tuned on diverse synthetic interactive data for the novel task of Open-Set Corrective Assistance, demonstrating that effective open-set assistive intelligence requires datasets encompassing multimodal grounding, defect inference, and exposure to varied scenarios.

Pradyumna Tambwekar, Andrew Silva, Deepak Gopinath + 3 more2026-03-06🤖 cs.AI

Task-Relevant and Irrelevant Region-Aware Augmentation for Generalizable Vision-Based Imitation Learning in Agricultural Manipulation

To address the generalization challenges of vision-based imitation learning in agricultural manipulation caused by scarce data and visual domain gaps, the authors propose DRAIL, a region-aware augmentation framework that separately augments task-relevant and task-irrelevant regions to train robust policies that rely on essential features rather than spurious background cues.

Shun Hattori, Hikaru Sasaki, Takumi Hachimine + 2 more2026-03-06💻 cs

VPWEM: Non-Markovian Visuomotor Policy with Working and Episodic Memory

This paper introduces VPWEM, a non-Markovian visuomotor policy that combines a sliding window of recent observations with a Transformer-based episodic memory compressor to efficiently retain long-term context for robotic control, achieving significant performance improvements over state-of-the-art baselines on memory-intensive manipulation tasks while maintaining constant computational costs.

Yuheng Lei, Zhixuan Liang, Hongyuan Zhang + 1 more2026-03-06🤖 cs.AI

Beyond the Patch: Exploring Vulnerabilities of Visuomotor Policies via Viewpoint-Consistent 3D Adversarial Object

This paper proposes a viewpoint-consistent 3D adversarial texture optimization method using differentiable rendering, Expectation over Transformation with a Coarse-to-Fine curriculum, and saliency-guided perturbations to effectively expose and exploit vulnerabilities in robot visuomotor policies under dynamic camera viewpoints.

Chanmi Lee, Minsung Yoon, Woojae Kim + 2 more2026-03-06💻 cs

U-OBCA: Uncertainty-Aware Optimization-Based Collision Avoidance via Wasserstein Distributionally Robust Chance Constraints

This paper introduces U-OBCA, an uncertainty-aware optimization-based collision avoidance framework that utilizes Wasserstein distributionally robust chance constraints to handle polygon-shaped robots and obstacles without geometric simplification, thereby significantly reducing conservatism and improving navigation efficiency in narrow, cluttered environments compared to existing methods.

Zehao Wang, Yuxuan Tang, Han Zhang + 2 more2026-03-06🔢 math

Integrated cooperative localization of heterogeneous measurement swarm: A unified data-driven method

This paper proposes a unified data-driven method for cooperative localization in heterogeneous robotic swarms that utilizes pairwise relative localization and a distributed pose-coupling strategy to achieve robust performance under weakly connected directed measurement topologies, overcoming the restrictive geometric requirements of existing approaches.

Kunrui Ze, Wei Wang, Guibin Sun + 3 more2026-03-06💻 cs