Grammar of the Wave: Towards Explainable Multivariate Time Series Event Detection via Neuro-Symbolic VLM Agents

This paper introduces a neuro-symbolic VLM agent framework called Knowledge-Guided TSED, which utilizes a novel Event Logic Tree representation to bridge natural language event descriptions with multivariate time series data, enabling accurate, zero-shot event detection and explainable reasoning while mitigating hallucinations in high-stakes domains.

Sky Chenwei Wan, Tianjun Hou, Yifei Wang, Xiqing Chang, Aymeric Jan2026-03-13🤖 cs.LG

INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs

The paper introduces \textsc{INFACT}, a comprehensive diagnostic benchmark with 9,800 QA instances and fine-grained taxonomies that evaluates Video-LLMs on faithfulness and factuality under various induced degradation modes, revealing that high base accuracy does not guarantee robustness against hallucinations and that many models struggle significantly with temporal sensitivity.

Junqi Yang, Yuecong Min, Jie Zhang, Shiguang Shan, Xilin Chen2026-03-13🤖 cs.AI

SPEGC: Continual Test-Time Adaptation via Semantic-Prompt-Enhanced Graph Clustering for Medical Image Segmentation

The paper proposes SPEGC, a Continual Test-Time Adaptation framework for medical image segmentation that mitigates error accumulation and domain shift by integrating a semantic prompt enhancement mechanism with a differentiable graph clustering solver to refine structural representations and guide robust model adaptation.

Xiaogang Du, Jiawei Zhang, Tongfei Liu, Tao Lei, Yingbo Wang2026-03-13🤖 cs.AI

KEPo: Knowledge Evolution Poison on Graph-based Retrieval-Augmented Generation

This paper introduces KEPo, a novel poisoning attack method specifically designed to exploit the graph-based retrieval mechanism of GraphRAG systems by fabricating toxic knowledge evolution paths that manipulate the knowledge graph structure to force Large Language Models into generating harmful responses, thereby achieving state-of-the-art attack success rates where conventional RAG attacks fail.

Qizhi Chen, Chao Qi, Yihong Huang, Muquan Li, Rongzheng Wang, Dongyang Zhang, Ke Qin, Shuang Liang2026-03-13🤖 cs.LG

Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices

This paper introduces Gen-Fab, a variation-aware conditional generative adversarial network that accurately predicts diverse, high-resolution fabrication outcomes for nanophotonic devices by modeling process-induced uncertainties, outperforming existing deterministic and probabilistic baselines in both accuracy and distribution alignment.

Rambod Azimi, Yuri Grinberg, Dan-Xia Xu, Odile Liboiron-Ladouceur2026-03-13🤖 cs.AI

Multi-Agent Collaboration for Automated Design Exploration on High Performance Computing Systems

This paper introduces MADA, a Large Language Model-powered multi-agent framework that automates complex design workflows on High Performance Computing systems to iteratively refine and optimize scientific simulations, specifically demonstrated through the suppression of Richtmyer-Meshkov Instability in Inertial Confinement Fusion.

Harshitha Menon, Charles F. Jekel, Kevin Korner, Brian Gunnarson, Nathan K. Brown, Michael Stees, M. Giselle Fernandez-Godino, Walter Nissen, Meir H. Shachar, Dane M. Sterbentz, William J. Schill, Yue Hao, Robert Rieben, William Quadros, Steve Owen, Scott Mitchell, Ismael D. Boureima, Jonathan L. Belof2026-03-13🤖 cs.AI

FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval

This paper introduces FBCIR, a method to diagnose and address focus imbalances in composed image retrieval models by identifying their tendency to over-attend to one modality, and proposes a data augmentation workflow with curated hard negatives to enforce balanced cross-modal reasoning and improve robustness in challenging scenarios.

Chenchen Zhao, Jianhuan Zhuo, Muxi Chen, Zhaohua Zhang, Wenyu Jiang, Tianwen Jiang, Qiuyong Xiao, Jihong Zhang, Qiang Xu2026-03-13🤖 cs.AI

EReCu: Pseudo-label Evolution Fusion and Refinement with Multi-Cue Learning for Unsupervised Camouflage Detection

The paper proposes EReCu, a unified unsupervised framework for camouflaged object detection that integrates a Multi-Cue Native Perception module, Pseudo-Label Evolution Fusion, and Local Pseudo-Label Refinement to overcome noisy label limitations and achieve state-of-the-art performance in detail perception and boundary alignment.

Shuo Jiang, Gaojia Zhang, Min Tan, Yufei Yin, Gang Pan2026-03-13🤖 cs.AI

Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing

This paper introduces Expert Threshold (ET) routing, a fully causal mechanism that dynamically allocates computation and balances load across experts without auxiliary losses by independently routing tokens based on score thresholds, thereby outperforming traditional Token-choice Mixture-of-Experts in autoregressive language modeling.

Hanchi Sun, Yixin Liu, Yonghui Wu, Lichao Sun2026-03-13🤖 cs.AI

RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks

RoboClaw is an agentic framework that unifies data collection, policy learning, and execution under a single VLM-driven controller using Entangled Action Pairs to enable self-resetting loops, thereby significantly improving the scalability and success rate of long-horizon robotic tasks while drastically reducing human intervention.

Ruiying Li, Yunlang Zhou, YuYao Zhu, Kylin Chen, Jingyuan Wang, Sukai Wang, Kongtao Hu, Minhui Yu, Bowen Jiang, Zhan Su, Jiayao Ma, Xin He, Yongjian Shen, Yangyang, Guanghui Ren, Maoqing Yao, Wenhao Wang, Yao Mu2026-03-13🤖 cs.AI

UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

This paper introduces UtilityMax Prompting, a formal framework that replaces ambiguous natural language instructions with mathematical influence diagrams and utility functions to guide Large Language Models toward explicitly maximizing expected utility, thereby achieving superior multi-objective optimization performance in tasks like movie recommendation compared to traditional prompting methods.

Ofir Marom2026-03-13💬 cs.CL