cs.AI papers | Gist.Science

STAIRS-Former: Spatio-Temporal Attention with Interleaved Recursive Structure Transformer for Offline Multi-task Multi-agent Reinforcement Learning

STAIRS-Former is a novel transformer architecture for offline multi-task multi-agent reinforcement learning that leverages spatio-temporal attention, an interleaved recursive structure, and token dropout to effectively handle varying agent populations and long-horizon dependencies, achieving state-of-the-art performance across diverse benchmarks.

Jiwon Jeon, Myungsik Cho, Youngchul Sung2026-03-13🤖 cs.AI

OSCBench: Benchmarking Object State Change in Text-to-Video Generation

This paper introduces OSCBench, a novel benchmark derived from instructional cooking data that evaluates the ability of text-to-video models to generate accurate and temporally consistent object state changes, revealing that current models struggle significantly with this capability despite their progress in other areas.

Xianjing Han, Bin Zhu, Shiqi Hu, Franklin Mingzhe Li, Patrick Carrington, Roger Zimmermann, Jingjing Chen2026-03-13💬 cs.CL

Scaling Laws for Educational AI Agents

This paper introduces the "Agent Scaling Law" and the AgentProfile framework to demonstrate that the capabilities of educational AI agents scale predictably with structured profile richness—specifically role clarity, skill depth, tool completeness, runtime capability, and educator expertise—rather than solely through increased model size, as validated by the EduClaw platform's deployment of over 330 agent profiles.

Mengsong Wu, Hao Hao, Shuzhen Bi, Keqian Li, Wentao Liu, Siyu Song, Hongbo Zhao, Aimin Zhou2026-03-13🤖 cs.AI

Affect Decoding in Phonated and Silent Speech Production from Surface EMG

This paper introduces a new dataset and demonstrates that surface electromyography (sEMG) signals from facial and neck muscles can reliably decode affective states, particularly frustration, during both phonated and silent speech, highlighting their potential for affect-aware silent speech interfaces.

Simon Pistrosch, Kleanthis Avramidis, Tiantian Feng, Jihwan Lee, Monica Gonzalez-Machorro, Shrikanth Narayanan, Björn W. Schuller2026-03-13⚡ eess

When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

This paper proposes an architecture for an "Agentic Operating System for Hospital" that adapts the OpenClaw framework to safely deploy LLM agents in clinical environments by integrating a restricted execution environment, document-centric interactions, page-indexed long-term memory, and a curated medical skills library to ensure reliability, security, and auditability in dynamic workflows.

Wenxian Yang, Hanzheng Qiu, Bangqun Zhang, Chengquan Li, Zhiyong Huang, Xiaobin Feng, Rongshan Yu, Jiahong Dong2026-03-13🤖 cs.AI

Adapting Dijkstra for Buffers and Unlimited Transfers

This paper introduces Transfer Aware Dijkstra (TAD), an optimized algorithm that outperforms the state-of-the-art RAPTOR-based MR method in public transit routing by correctly handling buffer times through trip-sequence scanning while maintaining optimal results and achieving over a two-fold speed-up.

Denys Katkalo, Andrii Rohovyi, Toby Walsh2026-03-13🤖 cs.AI

Gender Bias in Generative AI-assisted Recruitment Processes

This study evaluates the potential for gender bias in generative AI-assisted recruitment by analyzing how a state-of-the-art model (GPT-5) suggests occupations for simulated Italian graduates, revealing that while job recommendations remain neutral, the model perpetuates gender stereotypes by attributing emotional traits to women and analytical traits to men.

Martina Ullasci, Marco Rondina, Riccardo Coppola, Antonio Vetrò2026-03-13🤖 cs.AI

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

The paper introduces CINDI, an unsupervised probabilistic framework based on conditional normalizing flows that unifies anomaly detection and imputation into a single end-to-end system to effectively restore data integrity in noisy multivariate time series, such as those found in power grids.

David Baumgartner, Helge Langseth, Heri Ramampiaro2026-03-13🤖 cs.AI

Compression Favors Consistency, Not Truth: When and Why Language Models Prefer Correct Information

This paper proposes the Compression-Consistency Principle to demonstrate that language models' apparent preference for truth is not an intrinsic drive but a byproduct of compression pressure favoring hypotheses that offer shorter, more internally consistent descriptions of the training data.

Konstantin Krestnikov2026-03-13💬 cs.CL

Anomaly detection in time-series via inductive biases in the latent space of conditional normalizing flows

This paper proposes an anomaly detection framework for multivariate time-series that leverages conditional normalizing flows with explicit inductive biases to constrain latent representations to prescribed temporal dynamics, thereby defining anomalies as violations of these dynamics rather than low observation likelihoods.

David Baumgartner, Eliezer de Souza da Silva, Iñigo Urteaga2026-03-13🤖 cs.AI

Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach

This paper proposes a free energy-based social bandit learning algorithm that enables agents to effectively identify and leverage the behavioral information of both expert and non-expert peers without reward observations, thereby achieving optimal policy convergence and significantly enhanced learning performance with logarithmic regret.

Erfan Mirzaei, Seyed Pooya Shariatpanahi, Alireza Tavakoli, Reshad Hosseini, Majid Nili Ahmadabadi2026-03-13📊 stat

Understanding Wikidata Qualifiers: An Analysis and Taxonomy

This paper analyzes the semantics and usage of Wikidata qualifiers to develop a refined taxonomy based on frequency and diversity metrics, aiming to improve knowledge graph querying, inference, and contributor guidance.

Gilles Falquet, Sahar Aljalbout2026-03-13🤖 cs.AI

Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability and Safety Governed Memory (SSGM) Framework

This paper introduces the Stability and Safety-Governed Memory (SSGM) framework to address critical risks like memory corruption, semantic drift, and privacy vulnerabilities in evolving LLM agents by decoupling memory evolution from execution through consistency verification, temporal decay modeling, and dynamic access control.

Chingkwun Lam, Jiaxin Li, Lingfei Zhang, Kuo Zhao2026-03-13🤖 cs.AI

An Automatic Text Classification Method Based on Hierarchical Taxonomies, Neural Networks and Document Embedding: The NETHIC Tool

This paper presents NETHIC, an automatic text classification tool that combines scalable neural networks with hierarchical taxonomies and document embeddings to achieve significant improvements in both effectiveness and efficiency across generic and domain-specific corpora.

Luigi Lomasto, Rosario Di Florio, Andrea Ciapetti, Giuseppe Miscione, Giulia Ruggiero, Daniele Toti2026-03-13🤖 cs.AI

From Debate to Deliberation: Structured Collective Reasoning with Typed Epistemic Acts

This paper introduces Deliberative Collective Intelligence (DCI), a structured multi-agent framework that employs typed epistemic acts and a convergent flow algorithm to significantly improve performance on complex, non-routine reasoning tasks through accountable deliberation, despite incurring high computational costs and underperforming on routine decisions.

Sunil Prakash2026-03-13🤖 cs.AI

HELM: Hierarchical and Explicit Label Modeling with Graph Learning for Multi-Label Image Classification

HELM is a novel framework for hierarchical multi-label image classification in remote sensing that combines hierarchy-specific class tokens, graph-based structural encoding, and self-supervised learning to achieve state-of-the-art performance, particularly in low-label scenarios.

Marjan Stoimchev, Boshko Koloski, Jurica Levatic, Dragi Kocev, Sašo Džeroski2026-03-13🤖 cs.AI

Locating Demographic Bias at the Attention-Head Level in CLIP's Vision Encoder

This paper proposes a mechanistic fairness audit framework to localize demographic bias within individual attention heads of CLIP's vision encoder, demonstrating that ablating specific heads can reduce gender bias while improving accuracy, whereas age bias appears more diffusely encoded.

Alaa Yasser, Kittipat Phunjanna, Marcos Escudero Viñolo, Catarina Barata, Jenny Benois-Pineau2026-03-13🤖 cs.AI

DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering

DocSage is an end-to-end agentic framework that addresses the limitations of existing RAG systems in multi-document, multi-entity question answering by integrating dynamic schema discovery, error-aware structured extraction, and schema-aware relational reasoning to significantly improve cross-document evidence aggregation and accuracy.

Teng Lin, Yizhang Zhu, Zhengxuan Zhang, Yuyu Luo, Nan Tang2026-03-13🤖 cs.AI

A Semi-Decentralized Approach to Multiagent Control

This paper introduces the SDec-POMDP framework and the exact RS-SDA* algorithm to unify decentralized and multiagent POMDPs under a semi-decentralized approach that models communication uncertainty through time-distributed action and observation histories.

Mahdi Al-Husseini, Mykel J. Kochenderfer, Kyle H. Wray2026-03-13🤖 cs.AI

Automating Skill Acquisition through Large-Scale Mining of Open-Source Agentic Repositories: A Framework for Multi-Agent Procedural Knowledge Extraction

This paper presents a framework for automating the acquisition of specialized procedural agent skills by systematically mining open-source repositories to extract, standardize, and evaluate capabilities like mathematical visualization, demonstrating that such methods can significantly enhance LLM performance in autonomous workflows without requiring model retraining.

Shuzhen Bi, Mengsong Wu, Hao Hao, Keqian Li, Wentao Liu, Siyu Song, Hongbo Zhao, Aimin Zhou2026-03-13🤖 cs.AI

← Previous Next →