SO3UFormer: Learning Intrinsic Spherical Features for Rotation-Robust Panoramic Segmentation

SO3UFormer addresses the failure of standard panoramic segmentation models under 3D rotations by introducing a rotation-robust architecture that learns intrinsic spherical features through gravity-independent representations, quadrature-consistent attention, and gauge-aware positional encoding, achieving superior stability on the proposed Pose35 benchmark compared to existing state-of-the-art methods.

Qinfeng Zhu, Yunxi Jiang, Lei Fan2026-02-27💻 cs

Chain of Flow: A Foundational Generative Framework for ECG-to-4D Cardiac Digital Twins

This paper introduces Chain of Flow (COF), a foundational generative framework that reconstructs patient-specific 4D cardiac anatomy and motion from single-cycle 12-lead ECGs by integrating cine-CMR data, thereby transforming cardiac digital twins from task-specific predictors into fully manipulable virtual hearts for diverse clinical simulations.

Haofan Wu, Nay Aung, Theodoros N. Arvanitis + 3 more2026-02-27💻 cs

WaterVideoQA: ASV-Centric Perception and Rule-Compliant Reasoning via Multi-Modal Agents

This paper introduces WaterVideoQA, a large-scale video question answering benchmark for all-waterway environments, and NaviMind, a multi-agent neuro-symbolic system that enables Autonomous Surface Vessels to transition from passive perception to regulation-compliant, interpretable cognitive reasoning through adaptive semantic routing and self-reflective verification.

Runwei Guan, Shaofeng Liang, Ningwei Ouyang + 9 more2026-02-27💻 cs

Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study

This pilot study evaluates the zero-shot capabilities of multimodal large language model agents in distinguishing visually confounded diseases like melanoma versus atypical nevus and pulmonary edema versus pneumonia, finding that a proposed multi-agent framework with contrastive adjudication improves accuracy and reduces unsupported claims, though performance remains insufficient for immediate clinical deployment.

Zihao Zhao, Frederik Hauke, Juliana De Castilhos + 2 more2026-02-27💻 cs

An automatic counting algorithm for the quantification and uncertainty analysis of the number of microglial cells trainable in small and heterogeneous datasets

This paper proposes a flexible, non-parametric automatic kernel counter that enables accurate microglial cell counting and uncertainty estimation in small, heterogeneous datasets by bypassing traditional cell detection in favor of a tailored feature extraction and single hyper-parameter training approach.

L. Martino, M. M. Garcia, P. S. Paradas + 1 more2026-02-27⚡ eess

Small Object Detection Model with Spatial Laplacian Pyramid Attention and Multi-Scale Features Enhancement in Aerial Images

This paper proposes an enhanced small object detection model for aerial images that integrates a Spatial Laplacian Pyramid Attention module to highlight local regions, a Multi-Scale Feature Enhancement Module to improve semantic representation, and deformable convolutions to align features within the Feature Pyramid Network, demonstrating superior performance on the VisDrone and DOTA datasets.

Zhangjian Ji, Huijia Yan, Shaotong Qiao + 2 more2026-02-27💻 cs

Locally Adaptive Decay Surfaces for High-Speed Face and Landmark Detection with Event Cameras

This paper introduces Locally Adaptive Decay Surfaces (LADS), a novel event representation that dynamically modulates temporal decay based on local signal dynamics to overcome the limitations of fixed-parameter methods, thereby achieving state-of-the-art face detection and landmark localization accuracy at high frequencies while enabling the use of lighter network architectures.

Paul Kielty, Timothy Hanley, Peter Corcoran2026-02-27💻 cs