Towards Cognitive Defect Analysis in Active Infrared Thermography with Vision-Text Cues

This paper introduces a novel language-guided framework that leverages pretrained vision-language models and a specialized adapter to achieve zero-shot, generative detection and localization of subsurface defects in carbon fiber-reinforced polymers using active infrared thermography, thereby eliminating the need for costly, task-specific training datasets while significantly improving signal-to-noise ratios and detection accuracy.

Mohammed Salah, Eman Ouda, Giuseppe Dell'Avvocato, Fabrizio Sarasini, Ester D'Accardi, Jorge Dias, Davor Svetinovic, Stefano Sfarra, Yusra Abdulrahman2026-03-12⚡ eess

Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents

This paper proposes a novel self-finetuning framework that enables Generative AI agents to autonomously learn continuous control for dynamic Radio Access Network slicing by distilling long-horizon experiences into model parameters via a bi-perspective reflection mechanism, thereby outperforming traditional Reinforcement Learning and standard LLM-based agents in sample efficiency and multi-objective optimization without relying on handcrafted reward signals.

Yuanhao Li, Haozhe Wang, Geyong Min, Nektarios Georgalas, Wang Miao2026-03-12🤖 cs.AI

CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents

This paper presents CUAAudit, a large-scale meta-evaluation demonstrating that while Vision-Language Models can serve as autonomous auditors for Computer-Use Agents with strong accuracy and calibration, their significant performance degradation in complex environments and notable inter-model disagreement reveal fundamental limitations that necessitate explicit accounting for evaluator reliability and uncertainty in real-world deployments.

Marta Sumyk, Oleksandr Kosovan2026-03-12🤖 cs.AI

Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning

This paper empirically demonstrates that contrary to the hypothesis that moral reasoning alignment requires diversity-seeking algorithms, standard reward-maximizing RLVR methods are equally or more effective because high-reward moral responses exhibit a concentrated distribution in semantic space similar to logical reasoning tasks.

Zhaowei Zhang, Xiaohan Liu, Xuekai Zhu, Junchao Huang, Ceyao Zhang, Zhiyuan Feng, Yaodong Yang, Xiaoyuan Yi, Xing Xie2026-03-12🤖 cs.AI

Gradient Flow Drifting: Generative Modeling via Wasserstein Gradient Flows of KDE-Approximated Divergences

This paper establishes a mathematical framework called Gradient Flow Drifting that proves the equivalence between the recently proposed Drifting Model and the Wasserstein gradient flow of the forward KL divergence under KDE approximation, while extending the approach to a mixed-divergence strategy on Riemannian manifolds to simultaneously mitigate mode collapse and blurring.

Jiarui Cao, Zixuan Wei, Yuxin Liu2026-03-12🤖 cs.LG

Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction

This paper proposes the Progressive Retrospective Framework (PRF), a plug-and-play method that utilizes a cascade of retrospective units and a rolling-start training strategy to effectively address the challenge of variable-length trajectory prediction in autonomous driving by progressively aligning features from incomplete observations with complete ones.

Hao Zhou, Lu Qi, Jason Li, Jie Zhang, Yi Liu, Xu Yang, Mingyu Fan, Fei Luo2026-03-12🤖 cs.AI

Trajectory-Informed Memory Generation for Self-Improving Agent Systems

This paper introduces a novel framework for self-improving LLM agents that automatically extracts structured learnings from execution trajectories—categorizing them into strategy, recovery, and optimization tips—and injects them via adaptive memory retrieval to significantly boost task completion rates, particularly on complex scenarios.

Gaodan Fang, Vatche Isahagian, K. R. Jayaram, Ritesh Kumar, Vinod Muthusamy, Punleuk Oum, Gegi Thomas2026-03-12🤖 cs.AI

Interleaving Scheduling and Motion Planning with Incremental Learning of Symbolic Space-Time Motion Abstractions

This paper proposes a novel framework that interleaves task scheduling and motion planning through an incremental learning loop, where symbolic feedback from motion feasibility checks guides the scheduler to generate efficient, collision-free plans for multi-object navigation in shared workspaces.

Elisa Tosello, Arthur Bit-Monnot, Davide Lusuardi, Alessandro Valentini, Andrea Micheli2026-03-12🤖 cs.AI

Emulating Clinician Cognition via Self-Evolving Deep Clinical Research

The paper introduces DxEvolve, a self-evolving diagnostic agent that emulates clinician cognition through an interactive deep clinical research workflow, autonomously requisitioning examinations and externalizing experience to achieve superior diagnostic accuracy and governed continual improvement compared to existing AI models.

Ruiyang Ren, Yuhao Wang, Yunsen Liang, Lan Luo, Jing Liu, Haifeng Wang, Cong Feng, Yinan Zhang, Chunyan Miao, Ji-Rong Wen, Wayne Xin Zhao2026-03-12🤖 cs.AI

A Platform-Agnostic Multimodal Digital Human Modelling Framework: Neurophysiological Sensing in Game-Based Interaction

This paper presents a platform-agnostic Digital Human Modelling framework that integrates OpenBCI Galea biosensing with a reproducible SuperTux game environment to provide structured, temporally aligned multimodal data for future AI-driven, ethics-approved research in accessibility and inclusive interaction design.

Daniel J. Buxton, Mufti Mahmud, Jordan J. Bird, Thomas Hughes-Roberts, David J. Brown2026-03-12🤖 cs.AI

Repurposing Backdoors for Good: Ephemeral Intrinsic Proofs for Verifiable Aggregation in Cross-silo Federated Learning

This paper proposes a lightweight, backdoor-based "Intrinsic Proofs" framework that ensures verifiable aggregation in cross-silo federated learning by embedding ephemeral verification signals into model parameters, thereby achieving high detection rates against malicious servers with over 1000x speedup compared to traditional cryptographic methods while preserving client anonymity and final model utility.

Xian Qin, Xue Yang, Xiaohu Tang2026-03-12🤖 cs.AI

EvoSchema: Towards Text-to-SQL Robustness Against Schema Evolution

This paper introduces EvoSchema, a comprehensive benchmark featuring a novel taxonomy of ten schema perturbation types to evaluate and enhance the robustness of text-to-SQL models against real-world database schema evolution, revealing that table-level changes significantly impact performance and demonstrating that training on diverse schema designs improves model resilience.

Tianshu Zhang, Kun Qian, Siddhartha Sahai, Yuan Tian, Shaddy Garg, Huan Sun, Yunyao Li2026-03-12💬 cs.CL