Act Like a Pathologist: Tissue-Aware Whole Slide Image Reasoning

This paper introduces HistoSelect, a question-guided, coarse-to-fine retrieval framework that mimics pathologists' human-like scanning behavior to efficiently identify relevant tissue regions and informative patches in gigapixel whole slide images, thereby significantly reducing computational costs while improving accuracy and interpretability in pathology visual question answering.

Wentao Huang, Weimin Lyu, Peiliang Lou + 8 more2026-03-03💻 cs

Specializing Foundation Models via Mixture of Low-Rank Experts for Comprehensive Head CT Analysis

This paper introduces the Mixture of Low-Rank Experts (MoLRE) framework, a parameter-efficient fine-tuning method that significantly enhances the performance of diverse foundation models on comprehensive multi-label head CT diagnosis by employing specialized low-rank adapters and unsupervised soft routing without requiring explicit pathology supervision.

Youngjin Yoo, Han Liu, Bogdan Georgescu + 14 more2026-03-03💻 cs

CoLC: Communication-Efficient Collaborative Perception with LiDAR Completion

The paper proposes CoLC, a communication-efficient collaborative perception framework that leverages LiDAR completion techniques—specifically Foreground-Aware Point Sampling, Completion-Enhanced Early Fusion, and Dense-Guided Dual Alignment—to restore scene completeness from sparse transmissions and achieve superior perception-communication trade-offs while remaining robust to model heterogeneity.

Yushan Han, Hui Zhang, Qiming Xia + 2 more2026-03-03💻 cs

STMI: Segmentation-Guided Token Modulation with Cross-Modal Hypergraph Interaction for Multi-Modal Object Re-Identification

This paper proposes STMI, a novel multi-modal object Re-Identification framework that integrates segmentation-guided feature modulation, semantic token reallocation, and cross-modal hypergraph interaction to enhance foreground representation, preserve discriminative cues, and capture high-order semantic relationships while mitigating background noise.

Xingguo Xu, Zhanyu Liu, Weixiang Zhou + 5 more2026-03-03💻 cs

A Reconstruction System for Industrial Pipeline Inner Walls Using Panoramic Image Stitching with Endoscopic Imaging

This paper presents an industrial pipeline inner wall reconstruction system that utilizes panoramic image stitching and polar coordinate transformation on endoscopic video to generate comprehensive planar panoramic images, thereby significantly improving the efficiency and accuracy of defect detection compared to traditional frame-by-frame review methods.

Rui Ma, Yifeng Wang, Ziteng Yang + 1 more2026-03-03💻 cs

UniHM: Unified Dexterous Hand Manipulation with Vision Language Model

UniHM introduces a unified framework for dexterous hand manipulation that leverages a shared tokenizer for diverse hand morphologies and a vision-language action model trained on human-object interactions to generate physically feasible, human-like manipulation sequences from open-vocabulary language instructions without requiring extensive real-world teleoperation data.

Zhenhao Zhang, Jiaxin Liu, Ye Shi + 1 more2026-03-03💻 cs

Neural Functional Alignment Space: Brain-Referenced Representation of Artificial Neural Networks

This paper introduces the Neural Functional Alignment Space (NFAS), a brain-referenced framework that characterizes diverse artificial neural networks by modeling their layer-wise dynamics via Dynamic Mode Decomposition and projecting them into a biologically anchored coordinate system to reveal structured modality-specific clustering and cross-modal convergence.

Ruiyu Yan, Hanqi Jiang, Yi Pan + 4 more2026-03-03💻 cs