A Geometry-Based View of Mahalanobis OOD Detection

This paper reveals that the reliability of Mahalanobis-based out-of-distribution detection is highly dependent on the geometric properties of the feature space, specifically within-class spectral structure and local intrinsic dimensionality, and proposes a radially scaled 2\ell_2 normalization method that dynamically adjusts feature radii to optimize detection performance based on these geometric signals.

Denis Janiak, Jakub Binkowski, Tomasz Kajdanowicz2026-03-05🤖 cs.LG

Weakly Supervised Concept Learning with Class-Level Priors for Interpretable Medical Diagnosis

This paper introduces Prior-guided Concept Predictor (PCP), a weakly supervised framework that leverages class-level concept priors and regularization to enable reliable, interpretable medical diagnosis without costly concept annotations, significantly outperforming zero-shot baselines while matching fully supervised models.

Md Nahiduzzaman, Steven Korevaar, Alireza Bab-Hadiashar + 1 more2026-03-05💻 cs

VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning

VideoChat-M1 introduces a novel multi-agent system for video understanding that employs a learnable Collaborative Policy Planning paradigm, where multiple agents dynamically generate, execute, and refine tool invocation strategies through interaction and multi-agent reinforcement learning to achieve state-of-the-art performance across diverse video benchmarks.

Boyu Chen, Zikang Wang, Zhengrong Yue + 9 more2026-03-05💻 cs

Tracing 3D Anatomy in 2D Strokes: A Multi-Stage Projection Driven Approach to Cervical Spine Fracture Identification

This paper presents an automated, multi-stage pipeline that identifies cervical spine fractures by fusing orthogonal 2D segmentations to estimate 3D volumes of interest, which are then analyzed using a 2.5D CNN-Transformer ensemble to achieve diagnostic performance comparable to expert radiologists while reducing computational dimensionality.

Fabi Nahian Madhurja, Rusab Sarmun, Muhammad E. H. Chowdhury + 3 more2026-03-05🤖 cs.AI

When Safety Collides: Resolving Multi-Category Harmful Conflicts in Text-to-Image Diffusion via Adaptive Safety Guidance

This paper proposes Conflict-aware Adaptive Safety Guidance (CASG), a training-free framework that dynamically identifies and applies category-specific safety directions to resolve harmful conflicts in text-to-image diffusion models, thereby significantly reducing overall harmful output rates compared to existing methods.

Yongli Xiang, Ziming Hong, Zhaoqing Wang + 3 more2026-03-05💻 cs