Prototype Perturbation for Relaxing Alignment Constraints in Backward-Compatible Learning

To address the trade-off between backward compatibility and discriminative power in retrieval model updates, this paper proposes a method that relaxes alignment constraints by introducing Neighbor-Driven and Optimization-Driven perturbations to old feature prototypes, enabling the new model to align with a pseudo-old feature space while preserving its ability to distinguish closely clustered classes.

Zikun Zhou, Yushuai Sun, Wenjie Pei, Xin Li, Yaowei Wang2026-03-10💻 cs

From 2D Alignment to 3D Plausibility: Unifying Heterogeneous 2D Priors and Penetration-Free Diffusion for Occlusion-Robust Two-Hand Reconstruction

This paper proposes a unified framework for occlusion-robust two-hand reconstruction that combines a fusion-alignment encoder to implicitly integrate heterogeneous 2D structural priors from vision foundation models with a penetration-free diffusion model that guides 3D pose generation toward collision-free, kinematically coherent interactions.

Gaoge Han, Yongkang Cheng, Zhe Chen, Shaoli Huang, Tongliang Liu2026-03-10💻 cs

LEL: Lipschitz Continuity Constrained Ensemble Learning for Efficient EEG-Based Intra-subject Emotion Recognition

This paper introduces Lipschitz continuity-constrained Ensemble Learning (LEL), a novel framework that enhances the stability, accuracy, and robustness of intra-subject EEG-based emotion recognition by enforcing Lipschitz constraints on Transformer components and utilizing a learnable ensemble fusion strategy, achieving superior performance on three public benchmark datasets.

Shengyu Gong, Yueyang Li, Zijian Kang, Bo Chai, Weiming Zeng, Hongjie Yan, Zhiguo Zhang, Wai Ting Siok, Nizhuan Wang2026-03-10💻 cs

GeoNav: Empowering MLLMs with dual-scale geospatial reasoning for language-goal aerial navigation

This paper introduces GeoNav, a multi-modal agent that enhances MLLMs for long-range aerial navigation by employing a three-phase, coarse-to-fine reasoning process supported by dual-scale spatial representations (a global cognitive map and a local scene graph) and a spatial chain-of-thought mechanism, achieving state-of-the-art performance on the CityNav benchmark.

Haotian Xu, Yue Hu, Chen Gao, Zhengqiu Zhu, Yong Zhao, Yong Li, Quanjun Yin2026-03-10💻 cs

Can LLM-Simulated Practice and Feedback Upskill Human Counselors? A Randomized Study with 90+ Novice Counselors

A randomized study with 94 novice counselors demonstrates that combining LLM-simulated practice with structured feedback significantly improves client-centered microskills and empathy, whereas practice alone fails to produce gains and may even lead to a decline in empathetic performance.

Ryan Louie, Raj Sanjay Shah, Ifdita Hasan Orney, Juan Pablo Pacheco, Emma Brunskill, Diyi Yang2026-03-10💻 cs

FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis

This paper presents FoldNet, a framework that addresses the challenge of robotic garment folding by generating a large-scale synthetic dataset through keypoint-driven asset and demonstration synthesis, and employs a keypoint-based KG-DAgger algorithm to train a robust closed-loop policy that achieves a 75% success rate in real-world tasks.

Yuxing Chen, Bowen Xiao, He Wang2026-03-10💻 cs