cs.AI papers | Gist.Science

Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression

This paper introduces "information capacity," a novel metric that evaluates large language model efficiency by measuring text compression performance relative to computational complexity while accounting for tokenizer efficiency, thereby enabling accurate performance prediction and revealing linguistic biases across diverse models.

Cheng Yuan, Jiawei Shao, Xuelong Li2026-03-11💬 cs.CL

Lightweight Time Series Data Valuation on Time Series Foundation Models via In-Context Finetuning

This paper proposes LTSV, a lightweight and efficient method for valuing time series data in foundation models by leveraging in-context finetuning and temporal block aggregation to overcome the computational bottlenecks and temporal dependency limitations of traditional approaches.

Shunyu Wu, Tianyue Li, Yixuan Leng, Jingyi Suo, Jian Lou, Dan Li, See-Kiong Ng2026-03-11🤖 cs.AI

MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical Images

This paper introduces MediRound, a new task and baseline model for multi-round entity-level reasoning segmentation in medical images, supported by the large-scale MR-MedSeg dataset and a Judgment & Correction Mechanism to mitigate error propagation in multi-turn medical dialogues.

Qinyue Tong, Ziqian Lu, Jun Liu, Rui Zuo, Zheming Lu2026-03-11🤖 cs.AI

TSFM in-context learning for time-series classification of bearing-health status

This paper introduces a novel in-context learning approach using Time-Series Foundation Models (TSFMs) to classify bearing health status from vibration data without fine-tuning, enabling scalable, zero-shot maintenance solutions across varying operational conditions.

Michel Tokic, Slobodan Djukanovic, Anja von Beuningen, Cheng Feng2026-03-11🤖 cs.AI

Research and Prototyping Study of an LLM-Based Chatbot for Electromagnetic Simulations

This paper presents a Google Gemini 2.0 Flash-based chatbot that automates the setup, execution, and post-processing of two-dimensional finite element eddy current simulations using Gmsh and GetDP, thereby significantly reducing the time required to generate electromagnetic models with customizable geometries and analysis routines.

Albert Piwonski, Mirsad Hadžiefendic2026-03-11🤖 cs.AI

Mitigating Long-Tail Bias in HOI Detection via Adaptive Diversity Cache

This paper proposes the Adaptive Diversity Cache (ADC), a novel training-free and plug-and-play module that mitigates long-tail bias in Human-Object Interaction detection by dynamically accumulating diverse high-confidence features during inference to enhance rare category performance without additional training.

Yuqiu Jiang, Xiaozhen Qiao, Yifan Chen, Ye Zheng, Zhe Sun, Xuelong Li2026-03-11🤖 cs.AI

Periodic Asynchrony: An On-Policy Approach for Accelerating LLM Reinforcement Learning

This paper proposes "Periodic Asynchrony," a framework that accelerates LLM reinforcement learning by decoupling inference and training into a provably on-policy asynchronous pipeline, achieving a 3- to 5-fold throughput improvement on NPU platforms without sacrificing accuracy.

Jian Lu2026-03-11🤖 cs.AI

When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models

This paper introduces UPA-RFAS, a unified framework that generates universal and transferable physical adversarial patches to effectively attack diverse Vision-Language-Action (VLA) models across unknown architectures, finetuned variants, and sim-to-real shifts by leveraging robust feature alignment, a two-phase min-max optimization, and VLA-specific attention and semantic losses.

Hui Lu, Yi Yu, Yiming Yang, Chenyu Yi, Qixin Zhang, Bingquan Shen, Alex C. Kot, Xudong Jiang2026-03-11🤖 cs.AI

Multi-Agent Reinforcement Learning with Communication-Constrained Priors

This paper proposes a communication-constrained multi-agent reinforcement learning framework that utilizes a generalized model and dual mutual information estimator to distinguish between lossy and lossless messages, thereby quantifying their impact on global rewards to enhance cooperative policy learning in complex, dynamic environments.

Guang Yang, Tianpei Yang, Jingwen Qiao, Yanqing Wu, Jing Huo, Xingguo Chen, Yang Gao2026-03-11🤖 cs.AI

Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms

This paper introduces ELERAG, an enhanced Retrieval-Augmented Generation system that integrates Wikidata-based Entity Linking and a hybrid re-ranking strategy to significantly improve factual accuracy in Italian educational question-answering, particularly outperforming standard methods in domain-specific contexts while demonstrating the importance of domain-adapted strategies.

Francesco Granata, Francesco Poggi, Misael Mongiovì2026-03-11🤖 cs.AI

EMFusion: Conditional Diffusion Framework for Trustworthy Frequency Selective EMF Forecasting in Wireless Networks

This paper introduces EMFusion, a conditional multivariate diffusion-based framework that leverages a residual U-Net with cross-attention and imputation-based sampling to provide accurate, uncertainty-quantified, frequency-selective electromagnetic field forecasts for wireless network planning, significantly outperforming existing baseline models.

Zijiang Yan, Yixiang Huang, Jianhua Pei, Hina Tabassum, Luca Chiaraviglio2026-03-11🤖 cs.AI

Small Language Models for Efficient Agentic Tool Calling: Outperforming Large Models with Targeted Fine-tuning

This paper demonstrates that a single-epoch, domain-adapted fine-tuning of a 350M-parameter Small Language Model (OPT-350M) can significantly outperform larger models and existing baselines in tool-calling tasks, achieving a 77.55% pass rate on ToolBench and proving that targeted training can make generative AI more cost-effective and scalable for enterprise use.

Polaris Jhandi, Owais Kazi, Shreyas Subramanian, Neel Sendas2026-03-11🤖 cs.AI

Reinforcement Learning for Self-Improving Agent with Skill Library

This paper introduces SAGE, a novel Reinforcement Learning framework that enhances LLM-based agents' self-improvement capabilities by utilizing a skill library with sequential rollouts and skill-integrated rewards, achieving significantly higher goal completion rates and greater efficiency than existing methods on the AppWorld benchmark.

Jiongxiao Wang, Qiaojing Yan, Yawei Wang, Yijun Tian, Soumya Smruti Mishra, Zhichao Xu, Megha Gandhi, Panpan Xu, Lin Lee Cheong2026-03-11🤖 cs.AI

MCGI: Manifold-Consistent Graph Indexing for Billion-Scale Disk-Resident Vector Search

The paper proposes Manifold-Consistent Graph Indexing (MCGI), a geometry-aware, disk-resident indexing method that leverages Local Intrinsic Dimensionality to dynamically adapt search strategies, achieving significantly higher throughput and lower latency than state-of-the-art baselines on billion-scale datasets by resolving the Euclidean-Geodesic mismatch in high-dimensional spaces.

Dongfang Zhao2026-03-11🤖 cs.AI

CRANE: Causal Relevance Analysis of Language-Specific Neurons in Multilingual Large Language Models

This paper introduces CRANE, a causal relevance analysis framework that identifies language-specific neurons in multilingual LLMs through targeted interventions, demonstrating that these neurons functionally specialize in language-conditioned predictions rather than merely exhibiting high activation magnitudes.

Yifan Le, Yunliang Li2026-03-11🤖 cs.AI

An AI-powered Bayesian Generative Modeling Approach for Arbitrary Conditional Inference

This paper introduces Bayesian Generative Modeling (BGM), a unified framework that leverages a stochastic iterative Bayesian updating algorithm to learn a single generative model capable of performing arbitrary conditional inference with principled uncertainty quantification, without requiring retraining for different conditioning structures.

Qiao Liu, Wing Hung Wong2026-03-11🤖 cs.AI

Empowering All-in-Loop Health Management of Spacecraft Power System in the Mega-Constellation Era via Human-AI Collaboration

This paper addresses the challenges of health management for spacecraft power systems in the emerging mega-constellation era by proposing the "Aligning Underlying Capabilities" principle and introducing SpaceHMchat, an open-source Human-AI collaboration framework validated on a realistic hardware platform and a new large-scale dataset to achieve high-precision, interpretable, and efficient all-in-loop health management.

Yi Di, Zhibin Zhao, Fujin Wang, Xue Liu, Jiafeng Tang, Jiaxin Ren, Zhi Zhai, Xuefeng Chen2026-03-11🤖 cs.AI

CLEAR-Mamba:Towards Accurate, Adaptive and Trustworthy Multi-Sequence Ophthalmic Angiography Classification

The paper introduces CLEAR-Mamba, an enhanced MedMamba framework featuring a hypernetwork-based adaptive conditioning layer and a reliability-aware prediction scheme, which achieves superior accuracy and trustworthiness in multi-sequence ophthalmic angiography classification by addressing challenges in generalization and confidence estimation.

Zhuonan Wang, Wenjie Yan, Wenqiao Zhang, Xiaohui Song, Jian Ma, Ke Yao, Yibo Yu, Beng Chin Ooi2026-03-11🤖 cs.AI

Automating Forecasting Question Generation and Resolution for AI Evaluation

This paper presents an automated system using LLM-powered web research agents to generate and resolve diverse, real-world forecasting questions at scale, demonstrating high-quality question creation and resolution rates that surpass human-curated platforms while effectively evaluating and improving AI forecasting performance.

Nikos I. Bosse, Peter Mühlbacher, Jack Wildman, Lawrence Phillips, Dan Schwarz2026-03-11🤖 cs.AI

From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents

This paper introduces EigenData, a unified framework that combines a self-evolving multi-agent system for synthesizing verifiable tool-use dialogues with a verifier-based reinforcement learning recipe, enabling scalable post-training of interactive agents that achieve state-of-the-art performance on complex multi-turn benchmarks without relying on expensive human annotation.

Jiaxuan Gao, Jiaao Chen, Chuyi He, Shusheng Xu, Di Jin, Yi Wu2026-03-11🤖 cs.AI

← Previous Next →