cs.AI papers | Gist.Science

Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design

This paper presents a neurosymbolic collaboration between an LLM-powered agent, symbolic computation tools, and human researchers that successfully discovered and formally verified a new tight lower bound on the imbalance of Latin squares for the case $n \equiv 1 \pmod{3}$ , demonstrating the potential of AI-human partnerships in pure mathematical discovery.

Hai Xia, Carla P. Gomes, Bart Selman, Stefan Szeider2026-03-10🔢 math

EndoSERV: A Vision-based Endoluminal Robot Navigation System

EndoSERV is a novel vision-based navigation system for endoluminal robots that overcomes challenges like tissue deformation and label scarcity by combining segment-to-structure odometry with real-to-virtual transfer learning to achieve accurate localization without requiring real-world pose labels.

Junyang Wu, Fangfang Xie, Minghui Zhang, Hanxiao Zhang, Jiayuan Sun, Yun Gu, Guang-Zhong Yang2026-03-10💻 cs

SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation

SPD-RAG is a hierarchical multi-agent framework that improves scalability and answer quality for complex cross-document queries by assigning dedicated agents to process individual documents and synthesizing their outputs through a token-bounded coordinator, achieving superior performance on the LOONG benchmark with significantly reduced API costs compared to standard RAG and full-context baselines.

Yagiz Can Akay, Muhammed Yusuf Kartal, Esra Alparslan, Faruk Ortakoyluoglu, Arda Akpinar2026-03-10💬 cs.CL

Detecting Fake Reviewer Groups in Dynamic Networks: An Adaptive Graph Learning Method

The paper proposes DS-DGA-GCN, an adaptive graph learning model that integrates diversity- and similarity-aware dynamic graph attention with a Network Feature Scoring system to effectively detect organized fake reviewer groups in dynamic networks, achieving state-of-the-art performance on real-world datasets.

Jing Zhang, Ke Huang, Yao Zhang, Bin Guo, Zhiwen Yu2026-03-10💻 cs

Electrocardiogram Classification with Transformers Using Koopman and Wavelet Features

This paper demonstrates that while wavelet features excel in binary ECG classification, a transformer-based model utilizing Koopman operator features derived from an optimized Extended Dynamic Mode Decomposition (EDMD) with a radial basis function dictionary achieves superior performance in multi-class ECG classification, outperforming both wavelet-only and hybrid approaches.

Sucheta Ghosh, Zahra Monfared2026-03-10🤖 cs.LG

Towards plausibility in time series counterfactual explanations

This paper introduces a gradient-based optimization method for generating time series counterfactual explanations that ensures plausibility by integrating soft-DTW alignment with k-nearest neighbors, resulting in valid and temporally realistic outputs that outperform existing approaches in distributional alignment.

Marcin Kostrzewa, Krzysztof Galus, Maciej Zi\k{e}ba2026-03-10🤖 cs.LG

Computational modeling of early language learning from acoustic speech and audiovisual input without linguistic priors

This chapter reviews recent computational models demonstrating that self-supervised and visually grounded learning principles can effectively explain early language acquisition from acoustic and audiovisual speech without relying on strong linguistic priors.

Okko Räsänen2026-03-10💬 cs.CL

M $^3$ -ACE: Rectifying Visual Perception in Multimodal Math Reasoning via Multi-Agentic Context Engineering

The paper proposes M3-ACE, a multi-agentic context engineering framework that rectifies inaccurate visual perception in multimodal math reasoning by decoupling perception from reasoning and employing collaborative agents with specialized tools to dynamically refine visual evidence, thereby achieving state-of-the-art performance on benchmarks like MathVision.

Peijin Xie, Zhen Xu, Bingquan Liu, Baoxun Wang2026-03-10💻 cs

A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

This paper proposes the Hierarchical Error-Corrective Graph Framework (HECG) for autonomous agents, which integrates Multi-Dimensional Transferable Strategy (MDTS) for precise candidate selection, Error Matrix Classification (EMC) for structured failure attribution, and Causal-Context Graph Retrieval (CCGR) for enhanced contextual reasoning to improve execution reliability in complex, multi-step tasks.

Cong Cao, Jingyao Zhang, Kun Tong2026-03-10💻 cs

Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective

This paper introduces Token-Conditioned Reinforcement Learning (ToCoRL), a framework that leverages the intrinsic behavioral plasticity of Large Language Models to internalize and stabilize inference-time adaptations, enabling precise control over behavioral modes like switching from reasoning to direct answering without degrading overall capabilities.

Liyuan Mao, Le Yu, Jing Zhou, Chujie Zheng, Bowen Yu, Chang Gao, Shixuan Liu, An Yang, Weinan Zhang, JunYang Lin2026-03-10🤖 cs.LG

A Recipe for Stable Offline Multi-agent Reinforcement Learning

This paper identifies value-scale amplification as the primary cause of instability in non-linear value decomposition for offline multi-agent reinforcement learning and proposes a scale-invariant value normalization technique to stabilize training, ultimately providing a practical recipe to unlock the full potential of offline MARL.

Dongsu Lee, Daehee Lee, Amy Zhang2026-03-10🤖 cs.LG

Aligning to Illusions: Choice Blindness in Human and AI Feedback

This paper challenges the stability of human and AI preferences in Reinforcement Learning from Human Feedback (RLHF) by demonstrating that both are susceptible to "choice blindness," where preferences are easily manipulated by context and shallow cues, leading to undetected reward signal corruption and downstream policy degradation.

Wenbin Wu2026-03-10💬 cs.CL

Geometrically Constrained Outlier Synthesis

This paper introduces Geometrically Constrained Outlier Synthesis (GCOS), a training-time framework that generates virtual outliers in the feature space by respecting in-distribution manifold structures and using conformal shells to improve out-of-distribution detection robustness and provide formal error guarantees.

Daniil Karzanov, Marcin Detyniecki2026-03-10🤖 cs.LG

Human-Aware Robot Behaviour in Self-Driving Labs

This paper proposes an AI-driven perception method with hierarchical human intention prediction to enable mobile robot chemists in self-driving laboratories to proactively distinguish between human preparatory actions and transient interactions, thereby overcoming the inefficiencies of passive obstruction detection and streamlining human-robot coordination in shared-access scenarios.

Satheeshkumar Veeramani, Anna Kisil, Abigail Bentley, Hatem Fakhruldeen, Gabriella Pizzuto, Andrew I. Cooper2026-03-10💻 cs

SYNAPSE: Framework for Neuron Analysis and Perturbation in Sequence Encoding

The paper introduces SYNAPSE, a systematic, training-free framework that analyzes and stress-tests Transformer models by extracting layer representations and applying forward-hook interventions to reveal domain-independent internal organization, functional stability through redundant neuron subsets, and specific vulnerabilities to small manipulations.

Jesús Sánchez Ochoa, Enrique Tomás Martínez Beltrán, Alberto Huertas Celdrán2026-03-10🤖 cs.LG

IronEngine: Towards General AI Assistant

This paper introduces IronEngine, a general AI assistant platform featuring a unified orchestration core and a three-phase pipeline that integrates diverse backends, adaptive memory, and extensive tooling to achieve high task completion rates while separating planning quality from execution capability.

Xi Mo2026-03-10🤖 cs.LG

One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States

This paper proposes a method to equip LLM agents with native retrieval capabilities by projecting their hidden states directly into the embedding space via a lightweight head, thereby eliminating the need for a separate embedding model while retaining 97% of baseline retrieval quality.

Bo Jiang2026-03-10💬 cs.CL

Efficient Policy Learning with Hybrid Evaluation-Based Genetic Programming for Uncertain Agile Earth Observation Satellite Scheduling

This paper proposes a Hybrid Evaluation-based Genetic Programming (HE-GP) framework that dynamically switches between exact and approximate evaluation modes within an Online Scheduling Algorithm to efficiently solve the Uncertain Agile Earth Observation Satellite Scheduling Problem, achieving significant computational cost reductions while maintaining superior scheduling performance compared to existing methods.

Junhua Xue, Yuning Chen2026-03-10💻 cs

A prospective clinical feasibility study of a conversational diagnostic AI in an ambulatory primary care clinic

This prospective feasibility study demonstrates that a conversational AI system (AMIE) can safely and effectively conduct clinical history-taking and generate diagnostic suggestions in a real-world urgent care setting, achieving high patient satisfaction and diagnostic accuracy comparable to primary care providers while requiring no real-time human intervention.

Peter Brodeur, Jacob M. Koshy, Anil Palepu, Khaled Saab, Ava Homiar, Roma Ruparel, Charles Wu, Ryutaro Tanno, Joseph Xu, Amy Wang, David Stutz, Hannah M. Ferrera, David Barrett, Lindsey Crowley, Jihyeon Lee, Spencer E. Rittner, Ellery Wulczyn, Selena K. Zhang, Elahe Vedadi, Christine G. Kohn, Kavita Kulkarni, Vinay Kadiyala, Sara Mahdavi, Wendy Du, Jessica Williams, David Feinbloom, Renee Wong, Tao Tu, Petar Sirkovic, Alessio Orlandi, Christopher Semturs, Yun Liu, Juraj Gottweis, Dale R. Webster, Joëlle Barral, Katherine Chou, Pushmeet Kohli, Avinatan Hassidim, Yossi Matias, James Manyika, Rob Fields, Jonathan X. Li, Marc L. Cohen, Vivek Natarajan, Mike Schaekermann, Alan Karthikesalingam, Adam Rodman2026-03-10🤖 cs.LG

LycheeCluster: Efficient Long-Context Inference with Structure-Aware Chunking and Hierarchical KV Indexing

LycheeCluster is a novel KV cache management method that employs structure-aware chunking and hierarchical indexing to transform cache retrieval into a logarithmic-time process, achieving up to a 3.6x inference speedup with minimal performance degradation for long-context LLMs.

Dongfang Li, Zixuan Liu, Gang Lin, Baotian Hu, Min Zhang2026-03-10🤖 cs.LG

← Previous Next →

cs.AI