Pareto-Guided Optimization for Uncertainty-Aware Medical Image Segmentation

This paper proposes a Pareto-guided optimization framework for medical image segmentation that employs a region-wise curriculum strategy and a fuzzy labeling mechanism to prioritize learning from certain regions, thereby stabilizing gradients and guiding the model toward Pareto-optimal solutions that outperform traditional methods in handling boundary ambiguity.

Jinming Zhang, Youpeng Yang, Xi Yang + 5 more2026-02-25💻 cs

DVLA-RL: Dual-Level Vision-Language Alignment with Reinforcement Learning Gating for Few-Shot Learning

The paper proposes DVLA-RL, a novel few-shot learning framework that leverages reinforcement learning gating to dynamically integrate progressive dual-level vision-language alignments—ranging from fine-grained attributes to holistic descriptions generated by large language models—thereby achieving state-of-the-art performance across diverse benchmarks.

Wenhao Li, Xianjing Meng, Qiangchang Wang + 3 more2026-02-25💻 cs

Ecological mapping with geospatial foundation models

This study systematically evaluates geospatial foundation models (Prithvi-EO-2.0 and TerraMind) for ecological mapping, demonstrating their consistent superiority over traditional baselines across forest trait estimation, land cover mapping, and peatland detection while highlighting the critical importance of dataset alignment and high-resolution inputs for optimal performance.

Craig Mahlasi, Gciniwe S. Baloyi, Zaheed Gaffoor + 6 more2026-02-25💻 cs

DriveMamba: Task-Centric Scalable State Space Model for Efficient End-to-End Autonomous Driving

DriveMamba proposes a task-centric, scalable state space model for efficient end-to-end autonomous driving that replaces the sequential Transformer-based paradigm with a unified Mamba decoder featuring linear-complexity operators and bidirectional trajectory-guided scanning to overcome information loss, cumulative errors, and computational inefficiencies in handling spatiotemporal inputs.

Haisheng Su, Wei Wu, Feixiang Song + 3 more2026-02-25💻 cs

Probability-Invariant Random Walk Learning on Gyral Folding-Based Cortical Similarity Networks for Alzheimer's and Lewy Body Dementia Diagnosis

This paper proposes a probability-invariant random walk framework that classifies individualized gyral folding-based cortical similarity networks without requiring explicit node alignment, thereby overcoming anatomical heterogeneity to achieve robust diagnosis of Alzheimer's disease and Lewy body dementia.

Minheng Chen, Tong Chen, Chao Cao + 4 more2026-02-25🧬 q-bio

TraceVision: Trajectory-Aware Vision-Language Model for Human-Like Spatial Understanding

The paper introduces TraceVision, a novel end-to-end vision-language model that integrates trajectory-aware spatial understanding through a Trajectory-aware Visual Perception module and a specialized training pipeline, achieving state-of-the-art performance in tasks like captioning, localization, and segmentation by simulating human visual attention trajectories.

Fan Yang, Shurong Zheng, Hongyin Zhao + 5 more2026-02-25💻 cs

VISION-ICE: Video-based Interpretation and Spatial Identification of Arrhythmia Origins via Neural Networks in Intracardiac Echocardiography

This paper proposes VISION-ICE, an AI framework utilizing 3D Convolutional Neural Networks to analyze intracardiac echocardiography videos for automated, real-time localization of arrhythmia origins, achieving 66.2% accuracy and demonstrating potential to streamline electrophysiological procedures.

Dorsa EPMoghaddam, Feng Gao, Drew Bernard + 3 more2026-02-25🤖 cs.LG

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

OptimusVLA is a dual-memory augmented Vision-Language-Action model that enhances robotic manipulation efficiency and robustness by replacing isotropic noise with a global prior memory for faster inference and incorporating a local consistency memory to ensure temporal coherence, achieving superior performance across simulation and real-world benchmarks compared to state-of-the-art baselines.

Zaijing Li, Bing Hu, Rui Shao + 5 more2026-02-25🤖 cs.AI