Greedy-based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning

This paper proposes the Greedy-based Value Representation (GVR) method, which addresses the relative overgeneralization and optimal consistency issues in multi-agent reinforcement learning by transforming the optimal node into a unique self-transition through inferior target shaping and superior experience replay, thereby outperforming state-of-the-art baselines on various benchmarks.

Lipeng Wan, Zeyang Liu, Xingyu Chen + 2 more2026-03-05💻 cs

Learning Approximate Nash Equilibria in Cooperative Multi-Agent Reinforcement Learning via Mean-Field Subsampling

This paper proposes an alternating learning framework for cooperative multi-agent reinforcement learning under communication constraints, where a global agent observes only a subset of local agents, and proves that this approach converges to an approximate Nash equilibrium with improved sample complexity compared to methods operating on the full joint state space.

Emile Anand, Ishani Karmarkar2026-03-05🤖 cs.AI

HAMLET: A Hierarchical and Adaptive Multi-Agent Framework for Live Embodied Theatrics

This paper presents HAMLET, a hierarchical adaptive multi-agent framework that leverages large language models to autonomously generate and perform immersive, interactive theatrical experiences by combining narrative planning, persona-driven improvisation, and real-time embodied interactions with physical props, all evaluated by a specialized critic model.

Shufan Jiang, Sizhou Chen, Chi Zhang + 2 more2026-03-05🤖 cs.AI