cs.AI papers | Gist.Science

Sensitivity-Guided Framework for Pruned and Quantized Reservoir Computing Accelerators

This paper presents a sensitivity-guided framework for compressing Reservoir Computing accelerators that systematically balances quantization and pruning to significantly improve hardware efficiency and reduce power consumption on FPGAs while maintaining high model accuracy across various time-series tasks.

Atousa Jafari, Mahdi Taheri, Hassan Ghasemzadeh Mohammadi, Christian Herglotz, Marco Platzner2026-03-11🤖 cs.AI

Architectural Design and Performance Analysis of FPGA based AI Accelerators: A Comprehensive Review

This paper reviews FPGA-based AI accelerators for deep learning, highlighting their advantages over ASICs and GPUs, detailing key hardware optimization techniques such as loop pipelining and quantization, and analyzing state-of-the-art designs to identify challenges for future innovations.

Soumita Chatterjee, Sudip Ghosh, Tamal Ghosh, Hafizur Rahaman2026-03-11🤖 cs.AI

Zipage: Maintain High Request Concurrency for LLM Reasoning through Compressed PagedAttention

The paper introduces Zipage, an LLM inference engine utilizing Compressed PagedAttention to combine token-wise KV cache eviction with PagedAttention, achieving over 2.1 $\times$ speedup in high-concurrency reasoning tasks while maintaining approximately 95% of the performance of full KV inference.

Mengqi Liao, Lu Wang, Chaoyun Zhang, Bo Qiao, Si Qin, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Huaiyu Wan2026-03-11🤖 cs.AI

Diagnosing FP4 inference: a layer-wise and block-wise sensitivity analysis of NVFP4 and MXFP4

This paper presents a systematic layer-wise and block-wise sensitivity analysis of NVFP4 and MXFP4 quantization across three Qwen2.5 model scales, revealing that MLP up- and down-projection layers are the most sensitive components while sensitivity patterns vary by format and model depth rather than being confined to final blocks.

Musa Cim, Burak Topcu, Mahmut Taylan Kandemir2026-03-11🤖 cs.AI

Permutation-Equivariant 2D State Space Models: Theory and Canonical Architecture for Multivariate Time Series

This paper introduces the Variable-Invariant Two-Dimensional State Space Model (VI 2D SSM) and its unified VI 2D Mamba architecture, which theoretically establish and implement a permutation-equivariant framework for multivariate time series that eliminates artificial variable ordering to achieve state-of-the-art performance with improved structural scalability.

Seungwoo Jeong, Heung-Il Suk2026-03-11🤖 cs.AI

Hindsight Credit Assignment for Long-Horizon LLM Agents

The paper introduces HCAPO, a novel framework that enhances long-horizon LLM agents by leveraging hindsight reasoning to refine step-level Q-values and employing a multi-scale advantage mechanism to address sparse reward challenges, thereby significantly outperforming state-of-the-art methods like GRPO on benchmarks such as WebShop and ALFWorld.

Hui-Ze Tan, Xiao-Wen Yang, Hao Chen, Jie-Jing Shao, Yi Wen, Yuteng Shen, Weihong Luo, Xiku Du, Lan-Zhe Guo, Yu-Feng Li2026-03-11🤖 cs.AI

Turn: A Language for Agentic Computation

This paper introduces **Turn**, a compiled, actor-based programming language that enhances agentic software by integrating LLM inference as a typed primitive with schema validation, confidence-based control flow, isolated actor contexts, capability-based identity, and compile-time schema absorption to enforce critical safety and state invariants at the language level.

Muyukani Kizito2026-03-11🤖 cs.AI

Generalized Reduction to the Isotropy for Flexible Equivariant Neural Fields

This paper introduces a principled method to reduce $G$ -invariant functions on product spaces $X \times M$ to $H$ -invariant functions on $X$ alone, where $H$ is the isotropy subgroup of $M$ , thereby enabling flexible Equivariant Neural Fields to handle arbitrary group actions and heterogeneous product spaces without structural constraints.

Alejandro García-Castellanos, Gijs Bellaard, Remco Duits, Daniel Pelt, Erik J Bekkers2026-03-11🤖 cs.AI

EDMFormer: Genre-Specific Self-Supervised Learning for Music Structure Segmentation

The paper introduces EDMFormer, a transformer model trained on a newly released dataset of 98 professionally annotated EDM tracks (EDM-98) to address the limitations of existing music segmentation methods by leveraging genre-specific energy, rhythm, and timbre features for improved structure detection in Electronic Dance Music.

Sahal Sajeer, Krish Patel, Oscar Chung, Joel Song Bae2026-03-11🤖 cs.AI

Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases

This paper critiques the current limitations of alignment-focused safety cases for frontier AI by drawing on established methodologies from safety-critical industries to propose a more robust, defensible framework, illustrated through a case study on deceptive alignment and CBRN capabilities.

Shaun Feakins, Ibrahim Habli, Phillip Morgan2026-03-11🤖 cs.AI

Multi-level meta-reinforcement learning with skill-based curriculum

This paper proposes a multi-level meta-reinforcement learning framework that systematically compresses Markov decision processes into hierarchical structures with skill-based curriculum learning to decouple sub-tasks, reduce stochasticity, and enable efficient transfer of skills across different problems and levels.

Sichen Yang (Johns Hopkins University), Mauro Maggioni (Johns Hopkins University)2026-03-11🤖 cs.AI

Large Language Model-Assisted Superconducting Qubit Experiments

This paper introduces a large language model (LLM) framework that automates the control and measurement of superconducting qubits by dynamically generating and invoking tools based on a knowledge base, thereby enabling rapid deployment of standard protocols and the flexible implementation of novel experimental procedures.

Shiheng Li, Jacob M. Miller, Phoebe J. Lee, Gustav Andersson, Christopher R. Conner, Yash J. Joshi, Bayan Karimi, Amber M. King, Howard L. Malc, Harsh Mishra, Hong Qiao, Minseok Ryu, Xuntao Wu, Siyuan Xing, Haoxiong Yan, Jian Shi, Andrew N. Cleland2026-03-11⚛️ quant-ph

Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications

This paper introduces Test-Driven AI Agent Definition (TDAD), a methodology that compiles tool-using LLM agents from behavioral specifications by iteratively refining prompts against executable tests, thereby ensuring measurable behavioral compliance and robustness against silent regressions through mechanisms like hidden test splits and semantic mutation testing.

Tzafrir Rehan2026-03-11🤖 cs.AI

Scale-Plan: Scalable Language-Enabled Task Planning for Heterogeneous Multi-Robot Teams

Scale-Plan is a scalable framework that leverages large language models to filter irrelevant perceptual information and construct compact, task-relevant representations from natural language instructions, thereby enabling efficient and reliable long-horizon planning for heterogeneous multi-robot teams while outperforming existing baselines on the new MAT2-THOR benchmark.

Piyush Gupta, Sangjae Bae, Jiachen Li, David Isele2026-03-11🤖 cs.AI

Beyond Relevance: On the Relationship Between Retrieval and RAG Information Coverage

This paper empirically demonstrates that coverage-based retrieval metrics serve as reliable early indicators of information coverage in RAG-generated responses, particularly when retrieval objectives align with generation goals, across diverse text and multimodal benchmarks.

Saron Samuel, Alexander Martin, Eugene Yang, Andrew Yates, Dawn Lawrie, Ian Soborof, Laura Dietz, Benjamin Van Durme2026-03-11🤖 cs.AI

Fish Audio S2 Technical Report

This paper introduces Fish Audio S2, an open-source text-to-speech system that leverages a multi-stage training pipeline to enable multi-speaker, multi-turn generation with natural-language instruction following, while providing production-ready weights and an efficient SGLang-based inference engine.

Shijia Liao, Yuxuan Wang, Songting Liu, Yifan Cheng, Ruoyi Zhang, Tianyu Li, Shidong Li, Yisheng Zheng, Xingwei Liu, Qingzheng Wang, Zhizhuo Zhou, Jiahua Liu, Xin Chen, Dawei Han2026-03-11🤖 cs.AI

Are Expressive Encoders Necessary for Discrete Graph Generation?

This paper introduces GenGNN, a modular message-passing framework that demonstrates expressive neural backbones like transformers are not strictly necessary for discrete graph generation, as diffusion models using GenGNN achieve competitive validity and superior inference speed on various datasets.

Jay Revolinsky, Harry Shomer, Jiliang Tang2026-03-11🤖 cs.AI

MASEval: Extending Multi-Agent Evaluation from Models to Systems

MASEval introduces a framework-agnostic library that shifts multi-agent evaluation from a model-centric to a system-centric approach, demonstrating through extensive experiments that implementation decisions regarding topology and orchestration impact performance as significantly as model selection.

Cornelius Emde, Alexander Rubinstein, Anmol Goel, Ahmed Heakl, Sangdoo Yun, Seong Joon Oh, Martin Gubri2026-03-11🤖 cs.AI

A Lightweight Multi-Cancer Tumor Localization Framework for Deployable Digital Pathology

The paper presents MuCTaL, a lightweight multi-cancer tumor localization framework trained on four cancer types that achieves high performance on training data and demonstrates generalization to unseen tumor types, enabling scalable deployment for digital pathology applications.

Brian Isett, Rebekah Dadey, Aofei Li, Ryan C. Augustin, Kate Smith, Aatur D. Singhi, Qiangqiang Gu, Riyue Bao2026-03-11🤖 cs.AI

LDP: An Identity-Aware Protocol for Multi-Agent LLM Systems

This paper introduces the LLM Delegate Protocol (LDP), an AI-native communication framework that enhances multi-agent system efficiency and governance by exposing model identity and reasoning profiles as first-class primitives, demonstrating significant reductions in latency and token usage alongside improved security and recovery capabilities in experimental evaluations.

Sunil Prakash2026-03-11🤖 cs.AI

← Previous Next →