Do Foundation Models Know Geometry? Probing Frozen Features for Continuous Physical Measurement

This paper demonstrates that frozen vision-language model features contain rich, continuous geometric information that outperforms text-based outputs by 3.3x, revealing that the accuracy bottleneck stems from training objectives and autoregressive generation rather than representational limitations, as evidenced by high-precision linear probes and consistent performance across diverse encoder architectures.

Yakov Pyotr Shkolnikov2026-03-09🤖 cs.AI

PONTE: Personalized Orchestration for Natural Language Trustworthy Explanations

PONTE is a human-in-the-loop framework that enhances the reliability and personalization of Explainable AI narratives by modeling user adaptation as a closed-loop process involving preference modeling, grounded generation, and iterative verification to overcome the limitations of one-size-fits-all approaches and Large Language Model hallucinations.

Vittoria Vineis, Matteo Silvestri, Lorenzo Antonelli, Filippo Betello, Gabriele Tolomei2026-03-09🤖 cs.AI

Artificial Intelligence for Detecting Fetal Orofacial Clefts and Advancing Medical Education

This paper presents an artificial intelligence system trained on over 45,000 ultrasound images that achieves diagnostic accuracy comparable to senior radiologists for fetal orofacial clefts, significantly enhances junior radiologists' performance when used as a copilot, and accelerates clinical expertise development for rare conditions.

Yuanji Zhang, Yuhao Huang, Haoran Dou, Xiliang Zhu, Chen Ling, Zhong Yang, Lianying Liang, Jiuping Li, Siying Liang, Rui Li, Yan Cao, Yuhan Zhang, Jiewei Lai, Yongsong Zhou, Hongyu Zheng, Xinru Gao, Cheng Yu, Liling Shi, Mengqin Yuan, Honglong Li, Xiaoqiong Huang, Chaoyu Chen, Jialin Zhang, Wenxiong Pan, Alejandro F. Frangi, Guangzhi He, Xin Yang, Yi Xiong, Linliang Yin, Xuedong Deng, Dong Ni2026-03-09🤖 cs.AI

RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering

RAMoEA-QA is a hierarchically routed generative model that employs a two-stage conditional specialization mechanism—combining an Audio Mixture-of-Experts for acoustic encoding and a Language Mixture-of-Adapters for query intent—to achieve state-of-the-art robustness and accuracy in respiratory audio question answering across diverse devices, environments, and task shifts.

Gaia A. Bertolino, Yuwei Zhang, Tong Xia, Domenico Talia, Cecilia Mascolo2026-03-09🤖 cs.AI

LiveSense: A Real-Time Wi-Fi Sensing Platform for Range-Doppler on COTS Laptop

LiveSense is a cross-platform system that transforms commercial off-the-shelf Wi-Fi 6E/7 laptops into real-time, centimeter-accurate Range-Doppler sensors capable of extracting synchronized channel state information and performing on-device signal processing to detect distance, velocity, and micro-motions while maintaining simultaneous communication.

Jessica Sanson, Rahul C. Shah, Maximilian Pinaroc, Cagri Tanriover, Valerio Frascolla2026-03-09🤖 cs.AI

Boosting deep Reinforcement Learning using pretraining with Logical Options

This paper proposes Hybrid Hierarchical RL (H^2RL), a two-stage framework that leverages logical option-based pretraining to inject symbolic structure into deep reinforcement learning agents, effectively mitigating reward misalignment and improving long-horizon decision-making while outperforming existing neural, symbolic, and neuro-symbolic baselines.

Zihan Ye, Phil Chau, Raban Emunds, Jannis Blüml, Cedric Derstroff, Quentin Delfosse, Oleg Arenz, Kristian Kersting2026-03-09🤖 cs.AI

Fly360: Omnidirectional Obstacle Avoidance within Drone View

This paper introduces Fly360, a lightweight two-stage perception-decision pipeline that enables drones with panoramic views to achieve stable, omnidirectional obstacle avoidance by converting RGB observations into depth maps and employing a fixed random-yaw training strategy, outperforming traditional forward-view baselines in both simulation and real-world scenarios.

Xiangkai Zhang, Dizhe Zhang, WenZhuo Cao, Zhaoliang Wan, Yingjie Niu, Lu Qi, Xu Yang, Zhiyong Liu2026-03-09🤖 cs.AI

Motion Illusions Generated Using Predictive Neural Networks Also Fool Humans

This paper introduces the Evolutionary Illusion GENerator (EIGen), a generative model based on video predictive neural networks that creates new visual motion illusions, which are confirmed to fool human participants, thereby supporting the hypothesis that such illusions arise from the brain's predictive processing rather than raw visual input and highlighting the value of studying "motivated failures" in AI research.

Lana Sinapayen, Eiji Watanabe2026-03-06💻 cs

Deep Learning Meets Mechanism Design: Key Results and Some Novel Applications

This paper reviews the technical details and key results of using deep learning to design mechanisms that approximately satisfy conflicting properties like incentive compatibility and welfare maximization, demonstrating the approach's effectiveness through case studies in vehicular energy management, mobile network resource allocation, and agricultural procurement auctions.

V. Udaya Sankar, Vishisht Srihari Rao, Mayank Ratan Bhardwaj + 1 more2026-03-06💻 cs

Seeing Through Uncertainty: A Free-Energy Approach for Real-Time Perceptual Adaptation in Robust Visual Navigation

This paper introduces FEP-Nav, a biologically-inspired framework that enables robust real-time visual navigation by minimizing Variational Free Energy through a dual-mechanism architecture of top-down decoding and adaptive normalization, allowing autonomous agents to maintain performance under noisy and shifting sensory conditions without gradient-based updates.

Maytus Piriyajitakonkij, Rishabh Dev Yadav, Mingfei Sun + 2 more2026-03-06💻 cs

Path Planning for Masked Diffusion Model Sampling

This paper introduces Path Planning (P2), a novel inference sampling strategy for Masked Diffusion Models that decomposes generation into planning and denoising stages to enable iterative token refinement, thereby establishing a new expanded evidence lower bound and achieving state-of-the-art performance across diverse domains including protein sequences, RNA, math, storytelling, and code generation.

Fred Zhangzhi Peng, Zachary Bezemek, Sawan Patel + 5 more2026-03-06💻 cs