cs.MA papers | Gist.Science

XR-DT: Extended Reality-Enhanced Digital Twin for Safe Motion Planning via Human-Aware Model Predictive Path Integral Control

This paper introduces XR-DT, an Extended Reality-enhanced Digital Twin framework that integrates a novel Human-Aware Model Predictive Path Integral (HA-MPPI) controller with an attention-based trajectory prediction model to enable safe, efficient, and interpretable motion planning for mobile robots operating alongside humans.

Tianyi Wang, Jiseop Byeon, Ahmad Yehia, Yiming Xu, Jihyung Park, Tianyi Zeng, Sikai Chen, Ziran Wang, Junfeng Jiao, Christian ClaudelMon, 09 Ma🤖 cs.AI

KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes

The paper introduces KramaBench, a comprehensive benchmark featuring 104 real-world data-to-insight challenges across diverse domains, which reveals that current AI systems struggle to orchestrate end-to-end data pipelines over data lakes, achieving a maximum of only 55% accuracy despite strong performance in isolated tasks.

Eugenie Lai, Gerardo Vitagliano, Ziyu Zhang, Om Chabra, Sivaprasad Sudhir, Anna Zeng, Anton A. Zabreyko, Chenning Li, Ferdi Kossmann, Jialin Ding, Jun Chen, Markos Markakis, Matthew Russo, Weiyang Wang, Ziniu Wu, Michael J. Cafarella, Lei Cao, Samuel Madden, Tim KraskaMon, 09 Ma🤖 cs.AI

Aligning Compound AI Systems via System-level DPO

This paper introduces SysDPO, a framework that aligns complex, multi-component Compound AI Systems with human preferences by modeling them as Directed Acyclic Graphs and extending Direct Preference Optimization to overcome the challenges of non-differentiable interactions and the difficulty of translating system-level preferences to component levels.

Xiangwen Wang, Yibo Jacky Zhang, Zhoujie Ding, Katherine Tsai, Haolun Wu, Sanmi KoyejoMon, 09 Ma🤖 cs.AI

A Multi-Agent System Enables Versatile Information Extraction from the Chemical Literature

This paper presents a multimodal large language model-based multi-agent system that significantly outperforms existing state-of-the-art methods in automatically extracting structured chemical information from diverse and complex literature graphics, thereby advancing AI-driven chemical research.

Yufan Chen, Ching Ting Leung, Bowen Yu, Jianwei Sun, Yong Huang, Linyan Li, Hao Chen, Hanyu GaoMon, 09 Ma🤖 cs.AI

MASFactory: A Graph-centric Framework for Orchestrating LLM-Based Multi-Agent Systems with Vibe Graphing

This paper introduces MASFactory, a graph-centric framework that utilizes a human-in-the-loop "Vibe Graphing" approach to automatically compile natural language intents into executable multi-agent system workflows, thereby addressing challenges in manual implementation, component reuse, and context integration while demonstrating effectiveness across seven public benchmarks.

Yang Liu, Jinxuan Cai, Yishen Li, Qi Meng, Zedi Liu, Xin Li, Chen Qian, Chuan Shi, Cheng YangMon, 09 Ma🤖 cs.AI

Evaluating LLM Alignment With Human Trust Models

This paper presents a white-box analysis of the EleutherAI/gpt-j-6B model, demonstrating through contrastive prompting that its internal representation of trust aligns most closely with the Castelfranchi socio-cognitive model, thereby validating the feasibility of using LLM activation spaces to analyze socio-cognitive constructs and inform human-AI collaboration.

Anushka Debnath, Stephen Cranefield, Bastin Tony Roy Savarimuthu, Emiliano LoriniMon, 09 Ma🤖 cs.AI

RACAS: Controlling Diverse Robots With a Single Agentic System

The paper introduces RACAS, a robot-agnostic agentic system that uses natural language communication between LLM/VLM-based modules to control diverse robotic platforms without requiring code modifications or retraining, successfully demonstrating its effectiveness across wheeled, multi-jointed, and underwater robots.

Dylan R. Ashley, Jan Przepióra, Yimeng Chen, Ali Abualsaud, Nurzhan Yesmagambet, Shinkyu Park, Eric Feron, Jürgen SchmidhuberMon, 09 Ma🤖 cs.AI

Talk Freely, Execute Strictly: Schema-Gated Agentic AI for Flexible and Reproducible Scientific Workflows

This paper proposes a schema-gated agentic AI architecture that resolves the trade-off between conversational flexibility and execution determinism in scientific workflows by enforcing machine-checkable specifications as mandatory execution boundaries, a solution validated through multi-model LLM scoring of 20 existing systems.

Joel Strickland, Arjun Vijeta, Chris Moores, Oliwia Bodek, Bogdan Nenchev, Thomas Whitehead, Charles Phillips, Karl Tassenberg, Gareth Conduit, Ben PellegriniMon, 09 Ma🤖 cs.AI

Conversational Demand Response: Bidirectional Aggregator-Prosumer Coordination through Agentic AI

This paper introduces Conversational Demand Response (CDR), a bidirectional coordination framework leveraging agentic AI to enable natural language interactions between aggregators and prosumers, thereby combining automated scalability with enhanced user transparency and agency to sustain residential demand response participation.

Reda El Makroum, Sebastian Zwickl-Bernhard, Lukas Kranzl, Hans AuerMon, 09 Ma🤖 cs.AI

Computing Evolutionarily Stable Strategies in Multiplayer Games

This paper introduces an algorithm capable of computing all evolutionarily stable strategies within nondegenerate normal-form games involving three or more players.

Sam Ganzfried2026-03-10📈 econ

Agent based decision making for Integrated Air Defense system

This paper proposes a fully autonomous, state-of-the-art Integrated Air Defense system utilizing two BDI-based agents that employ meta-level plan reasoning to automatically detect threats, assess risks, and allocate weapons without manual intervention, thereby advancing network-centric warfare capabilities.

Sumanta Kumar Das, Sumant Mukherjee2026-03-10💻 cs

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

The paper introduces RepoLaunch, an agent that automates the build and test pipeline for code repositories across any language and platform, thereby enabling fully automated creation of software engineering datasets for benchmarking and training coding agents.

Kenan Li, Rongzhi Li, Linghao Zhang + 17 more2026-03-06🤖 cs.LG

Competitive Multi-Operator Reinforcement Learning for Joint Pricing and Fleet Rebalancing in AMoD Systems

This paper introduces a multi-operator reinforcement learning framework that integrates discrete choice theory to model competitive dynamics in Autonomous Mobility-on-Demand systems, demonstrating that competition fundamentally alters learned pricing and fleet rebalancing strategies compared to monopolistic settings while maintaining robust convergence.

Emil Kragh Toft, Carolin Schmidt, Daniele Gammelli + 1 more2026-03-06🤖 cs.LG

MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus

MedCoRAG is an interpretable, end-to-end framework for hepatic disease diagnosis that combines hybrid evidence retrieval from knowledge graphs and clinical guidelines with a multi-agent collaborative reasoning process to emulate multidisciplinary consultation and outperform existing methods in accuracy and transparency.

Zheng Li, Jiayi Xu, Zhikai Hu + 4 more2026-03-06🤖 cs.AI

Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile

Jagarin is a three-layer mobile architecture comprising the DAWN urgency engine, ARIA email proxy, and ACE institutional protocol that enables personal AI agents to hibernate until contextually necessary, thereby resolving the conflict between battery efficiency and timely duty fulfillment without compromising privacy or requiring persistent background execution.

Ravi Kiran Kadaboina2026-03-06🤖 cs.AI

SCoUT: Scalable Communication via Utility-Guided Temporal Grouping in Multi-Agent Reinforcement Learning

SCoUT is a scalable multi-agent reinforcement learning framework that improves coordination by dynamically grouping agents into latent clusters for efficient value estimation and employing counterfactual advantages to precisely guide decentralized communication decisions on when and whom to message.

Manav Vora, Gokul Puthumanaillam, Hiroyasu Tsukamoto + 1 more2026-03-06🤖 cs.AI

LLM-Guided Decentralized Exploration with Self-Organizing Robot Teams

This paper proposes a decentralized exploration framework for large robot swarms that combines a self-organizing algorithm for dynamic team formation with a novel large language model-based strategy for autonomous target selection, validated through extensive simulations.

Hiroaki Kawashima, Shun Ikejima, Takeshi Takai + 2 more2026-03-06💻 cs

Memory as Ontology: A Constitutional Memory Architecture for Persistent Digital Citizens

This paper challenges the prevailing view of AI memory as a mere functional tool by proposing the "Memory-as-Ontology" paradigm and its associated "Animesis" system, which prioritizes identity continuity and governance over retrieval performance to enable persistent digital citizens that survive model transitions.

Zhenghui Li2026-03-06💻 cs

iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics

This paper introduces iAgentBench, a dynamic open-domain question answering benchmark designed to evaluate the cross-source sensemaking capabilities of information-seeking agents on high-traffic topics by requiring the integration of evidence from multiple sources rather than simple retrieval.

Preetam Prabhu Srikar Dammu, Arnav Palkhiwala, Tanya Roosta + 1 more2026-03-06💻 cs

Strategic Interactions in Multi-Level Stackelberg Games with Non-Follower Agents and Heterogeneous Leaders

This paper proposes a novel three-level Stackelberg game framework that incorporates heterogeneous leaders and non-follower agents to accurately model congestion-coupled strategic interactions, demonstrating that ignoring non-participating agents leads to distorted equilibrium predictions in systems like EV charging markets.

Niloofar Aminikalibar, Farzaneh Farhadi, Maria Chli2026-03-06💻 cs

← Previous Next →