Act, Think or Abstain: Complexity-Aware Adaptive Inference for Vision-Language-Action Models

This paper proposes a complexity-aware adaptive inference framework for Vision-Language-Action models that dynamically routes execution to "Act," "Think," or "Abstain" based on task complexity, leveraging a vision-only detector to optimize resource allocation and prevent failures while achieving high accuracy with minimal training data.

Riccardo Andrea Izzo, Gianluca Bardaro, Matteo Matteucci2026-03-06💻 cs

Mario: Multimodal Graph Reasoning with Large Language Models

The paper proposes Mario, a unified framework that enhances large language model-based reasoning on multimodal graphs by employing a graph-conditioned vision-language model for cross-modal feature refinement and a modality-adaptive instruction tuning mechanism to dynamically select optimal modality configurations, thereby outperforming existing state-of-the-art methods in node classification and link prediction tasks.

Yuanfu Sun, Kang Li, Pengkang Guo + 2 more2026-03-06💻 cs

Semantic Class Distribution Learning for Debiasing Semi-Supervised Medical Image Segmentation

The paper proposes the Semantic Class Distribution Learning (SCDL) framework, a plug-and-play module that mitigates supervision and representation biases in semi-supervised medical image segmentation by learning structured class-conditional feature distributions, thereby achieving state-of-the-art performance with significant improvements on minority classes.

Yingxue Su, Yiheng Zhong, Keying Zhu + 5 more2026-03-06💻 cs

SPyCer: Semi-Supervised Physics-Guided Contextual Attention for Near-Surface Air Temperature Estimation from Satellite Imagery

The paper introduces SPyCer, a semi-supervised physics-guided deep learning framework that leverages satellite imagery and physical constraints derived from surface energy balance and advection-diffusion-reaction equations to generate accurate, spatially continuous estimates of near-surface air temperature, outperforming existing methods in both accuracy and physical consistency.

Sofiane Bouaziz, Adel Hafiane, Raphael Canals + 1 more2026-03-06🤖 cs.AI