Decoupling Stability and Plasticity for Multi-Modal Test-Time Adaptation

This paper proposes Decoupling Adaptation for Stability and Plasticity (DASP), a novel framework that addresses negative transfer and catastrophic forgetting in multi-modal test-time adaptation by leveraging interdimensional redundancy to identify biased modalities and applying an asymmetric strategy that updates plastic components for biased data while preserving stable components for unbiased data.

Yongbo He, Zirun Guo, Tao Jin2026-03-03🤖 cs.AI

Exploring Spatiotemporal Feature Propagation for Video-Level Compressive Spectral Reconstruction: Dataset, Model and Benchmark

This paper addresses the limitations of existing image-based spectral reconstruction methods by introducing the first high-quality dynamic hyperspectral dataset (DynaSpec), a novel Propagation-Guided Spectral Video Reconstruction Transformer (PG-SVRT) model that leverages spatiotemporal feature propagation for superior video-level reconstruction, and a comprehensive benchmark for both simulation and real-world evaluation.

Lijing Cai, Zhan Shi, Chenglong Huang + 6 more2026-03-03💻 cs

Exploring 3D Dataset Pruning

This paper addresses the challenges of 3D dataset pruning caused by long-tail class distributions by formulating the problem as expected risk approximation and proposing a method that combines representation-aware subset selection with per-class retention quotas and prior-invariant teacher supervision to simultaneously improve Overall Accuracy and Mean Accuracy while enabling flexible trade-off control.

Xiaohan Zhao, Xinyi Shang, Jiacheng Liu + 1 more2026-03-03🤖 cs.LG

Act Like a Pathologist: Tissue-Aware Whole Slide Image Reasoning

This paper introduces HistoSelect, a question-guided, coarse-to-fine retrieval framework that mimics pathologists' human-like scanning behavior to efficiently identify relevant tissue regions and informative patches in gigapixel whole slide images, thereby significantly reducing computational costs while improving accuracy and interpretability in pathology visual question answering.

Wentao Huang, Weimin Lyu, Peiliang Lou + 8 more2026-03-03💻 cs

Specializing Foundation Models via Mixture of Low-Rank Experts for Comprehensive Head CT Analysis

This paper introduces the Mixture of Low-Rank Experts (MoLRE) framework, a parameter-efficient fine-tuning method that significantly enhances the performance of diverse foundation models on comprehensive multi-label head CT diagnosis by employing specialized low-rank adapters and unsupervised soft routing without requiring explicit pathology supervision.

Youngjin Yoo, Han Liu, Bogdan Georgescu + 14 more2026-03-03💻 cs

CoLC: Communication-Efficient Collaborative Perception with LiDAR Completion

The paper proposes CoLC, a communication-efficient collaborative perception framework that leverages LiDAR completion techniques—specifically Foreground-Aware Point Sampling, Completion-Enhanced Early Fusion, and Dense-Guided Dual Alignment—to restore scene completeness from sparse transmissions and achieve superior perception-communication trade-offs while remaining robust to model heterogeneity.

Yushan Han, Hui Zhang, Qiming Xia + 2 more2026-03-03💻 cs

STMI: Segmentation-Guided Token Modulation with Cross-Modal Hypergraph Interaction for Multi-Modal Object Re-Identification

This paper proposes STMI, a novel multi-modal object Re-Identification framework that integrates segmentation-guided feature modulation, semantic token reallocation, and cross-modal hypergraph interaction to enhance foreground representation, preserve discriminative cues, and capture high-order semantic relationships while mitigating background noise.

Xingguo Xu, Zhanyu Liu, Weixiang Zhou + 5 more2026-03-03💻 cs