cs.AI papers | Gist.Science

Bridging Discrete Marks and Continuous Dynamics: Dual-Path Cross-Interaction for Marked Temporal Point Processes

The paper introduces NEXTPP, a dual-channel framework that unifies discrete event marks and continuous-time dynamics through cross-interaction mechanisms to significantly improve the prediction of irregularly spaced event sequences with discrete marks.

Yuxiang Liu, Qiao Liu, Tong Luo, Yanglei Gan, Peng He, Yao LIu2026-03-13🤖 cs.LG

Stage-Adaptive Reliability Modeling for Continuous Valence-Arousal Estimation

The paper proposes SAGE, a stage-adaptive reliability modeling framework that dynamically calibrates audio-visual confidence based on interaction stages to improve continuous valence-arousal estimation in noisy real-world environments.

Yubeen Lee, Sangeun Lee, Junyeop Cha, Eunil Park2026-03-13🤖 cs.AI

Grammar of the Wave: Towards Explainable Multivariate Time Series Event Detection via Neuro-Symbolic VLM Agents

This paper introduces a neuro-symbolic VLM agent framework called Knowledge-Guided TSED, which utilizes a novel Event Logic Tree representation to bridge natural language event descriptions with multivariate time series data, enabling accurate, zero-shot event detection and explainable reasoning while mitigating hallucinations in high-stakes domains.

Sky Chenwei Wan, Tianjun Hou, Yifei Wang, Xiqing Chang, Aymeric Jan2026-03-13🤖 cs.LG

INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs

The paper introduces \textsc{INFACT}, a comprehensive diagnostic benchmark with 9,800 QA instances and fine-grained taxonomies that evaluates Video-LLMs on faithfulness and factuality under various induced degradation modes, revealing that high base accuracy does not guarantee robustness against hallucinations and that many models struggle significantly with temporal sensitivity.

Junqi Yang, Yuecong Min, Jie Zhang, Shiguang Shan, Xilin Chen2026-03-13🤖 cs.AI

SPEGC: Continual Test-Time Adaptation via Semantic-Prompt-Enhanced Graph Clustering for Medical Image Segmentation

The paper proposes SPEGC, a Continual Test-Time Adaptation framework for medical image segmentation that mitigates error accumulation and domain shift by integrating a semantic prompt enhancement mechanism with a differentiable graph clustering solver to refine structural representations and guide robust model adaptation.

Xiaogang Du, Jiawei Zhang, Tongfei Liu, Tao Lei, Yingbo Wang2026-03-13🤖 cs.AI

OrthoEraser: Coupled-Neuron Orthogonal Projection for Concept Erasure

OrthoEraser is a novel concept erasure method for text-to-image models that utilizes sparse autoencoders and coupled-neuron detection to perform analytical orthogonal projection, effectively removing harmful content while preserving benign attributes by decoupling sensitive and non-sensitive feature subspaces.

Chuancheng Shi, Wenhua Wu, Fei Shen, Xiaogang Zhu, Kun Hu, Zhiyong Wang2026-03-13🤖 cs.AI

KEPo: Knowledge Evolution Poison on Graph-based Retrieval-Augmented Generation

This paper introduces KEPo, a novel poisoning attack method specifically designed to exploit the graph-based retrieval mechanism of GraphRAG systems by fabricating toxic knowledge evolution paths that manipulate the knowledge graph structure to force Large Language Models into generating harmful responses, thereby achieving state-of-the-art attack success rates where conventional RAG attacks fail.

Qizhi Chen, Chao Qi, Yihong Huang, Muquan Li, Rongzheng Wang, Dongyang Zhang, Ke Qin, Shuang Liang2026-03-13🤖 cs.LG

Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices

This paper introduces Gen-Fab, a variation-aware conditional generative adversarial network that accurately predicts diverse, high-resolution fabrication outcomes for nanophotonic devices by modeling process-induced uncertainties, outperforming existing deterministic and probabilistic baselines in both accuracy and distribution alignment.

Rambod Azimi, Yuri Grinberg, Dan-Xia Xu, Odile Liboiron-Ladouceur2026-03-13🤖 cs.AI

Multi-Agent Collaboration for Automated Design Exploration on High Performance Computing Systems

This paper introduces MADA, a Large Language Model-powered multi-agent framework that automates complex design workflows on High Performance Computing systems to iteratively refine and optimize scientific simulations, specifically demonstrated through the suppression of Richtmyer-Meshkov Instability in Inertial Confinement Fusion.

Harshitha Menon, Charles F. Jekel, Kevin Korner, Brian Gunnarson, Nathan K. Brown, Michael Stees, M. Giselle Fernandez-Godino, Walter Nissen, Meir H. Shachar, Dane M. Sterbentz, William J. Schill, Yue Hao, Robert Rieben, William Quadros, Steve Owen, Scott Mitchell, Ismael D. Boureima, Jonathan L. Belof2026-03-13🤖 cs.AI

FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval

This paper introduces FBCIR, a method to diagnose and address focus imbalances in composed image retrieval models by identifying their tendency to over-attend to one modality, and proposes a data augmentation workflow with curated hard negatives to enforce balanced cross-modal reasoning and improve robustness in challenging scenarios.

Chenchen Zhao, Jianhuan Zhuo, Muxi Chen, Zhaohua Zhang, Wenyu Jiang, Tianwen Jiang, Qiuyong Xiao, Jihong Zhang, Qiang Xu2026-03-13🤖 cs.AI

EReCu: Pseudo-label Evolution Fusion and Refinement with Multi-Cue Learning for Unsupervised Camouflage Detection

The paper proposes EReCu, a unified unsupervised framework for camouflaged object detection that integrates a Multi-Cue Native Perception module, Pseudo-Label Evolution Fusion, and Local Pseudo-Label Refinement to overcome noisy label limitations and achieve state-of-the-art performance in detail perception and boundary alignment.

Shuo Jiang, Gaojia Zhang, Min Tan, Yufei Yin, Gang Pan2026-03-13🤖 cs.AI

Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing

This paper introduces Expert Threshold (ET) routing, a fully causal mechanism that dynamically allocates computation and balances load across experts without auxiliary losses by independently routing tokens based on score thresholds, thereby outperforming traditional Token-choice Mixture-of-Experts in autoregressive language modeling.

Hanchi Sun, Yixin Liu, Yonghui Wu, Lichao Sun2026-03-13🤖 cs.AI

ReHARK: Refined Hybrid Adaptive RBF Kernels for Robust One-Shot Vision-Language Adaptation

ReHARK is a training-free framework that achieves state-of-the-art one-shot vision-language adaptation by addressing the stability-plasticity dilemma through a synergistic pipeline of hybrid prior construction, support set augmentation, adaptive distribution rectification, and multi-scale RBF kernels within a Reproducing Kernel Hilbert Space.

Md Jahidul Islam2026-03-13🤖 cs.AI

One Supervisor, Many Modalities: Adaptive Tool Orchestration for Autonomous Queries

This paper introduces an agentic AI framework featuring a central Supervisor that dynamically orchestrates specialized tools across text, image, audio, video, and document modalities to achieve significant reductions in response time, conversational rework, and costs while maintaining accuracy parity compared to hierarchical baselines.

Mayank Saini Arit Kumar Bishwas2026-03-13💬 cs.CL

MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks

This paper introduces MANSION, a language-driven framework that generates realistic, multi-floor 3D building environments and the MansionWorld dataset to address the limitations of current single-floor benchmarks in evaluating long-horizon robotic tasks requiring complex spatial reasoning.

Lirong Che, Shuo Wen, Shan Huang, Chuang Wang, Yuzhe Yang, Gregory Dudek, Xueqian Wang, Jian Su2026-03-13🤖 cs.AI

RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks

RoboClaw is an agentic framework that unifies data collection, policy learning, and execution under a single VLM-driven controller using Entangled Action Pairs to enable self-resetting loops, thereby significantly improving the scalability and success rate of long-horizon robotic tasks while drastically reducing human intervention.

Ruiying Li, Yunlang Zhou, YuYao Zhu, Kylin Chen, Jingyuan Wang, Sukai Wang, Kongtao Hu, Minhui Yu, Bowen Jiang, Zhan Su, Jiayao Ma, Xin He, Yongjian Shen, Yangyang, Guanghui Ren, Maoqing Yao, Wenhao Wang, Yao Mu2026-03-13🤖 cs.AI

AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions

This paper identifies and documents "helicoid dynamics," a failure regime in frontier LLMs where systems under high-stakes uncertainty recognize their own recurring errors yet continue to loop into them, prioritizing conversational comfort over reliability despite explicit protocols.

Alejandro R Jadad2026-03-13🤖 cs.AI

How Intelligence Emerges: A Minimal Theory of Dynamic Adaptive Coordination

This paper proposes a dynamical theory of adaptive coordination in multi-agent systems, demonstrating that intelligent behavior emerges from the structural coupling of agents, incentives, and a persistent environment through feedback loops, rather than from centralized optimization or rational expectations.

Stefano Grassi2026-03-13📈 econ

UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

This paper introduces UtilityMax Prompting, a formal framework that replaces ambiguous natural language instructions with mathematical influence diagrams and utility functions to guide Large Language Models toward explicitly maximizing expected utility, thereby achieving superior multi-objective optimization performance in tasks like movie recommendation compared to traditional prompting methods.

Ofir Marom2026-03-13💬 cs.CL

Toward Complex-Valued Neural Networks for Waveform Generation

This paper introduces ComVo, a complex-valued neural vocoder that utilizes native complex arithmetic, phase quantization, and an optimized block-matrix computation scheme to achieve higher synthesis quality and 25% faster training compared to existing real-valued iSTFT-based approaches.

Hyung-Seok Oh, Deok-Hyeon Cho, Seung-Bin Kim, Seong-Whan Lee2026-03-13🤖 cs.AI

← Previous Next →