Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems in Minecraft

This paper proposes a novel dual-thread framework that enables concurrent planning and acting with interruptible execution for LLM-based multi-agent systems, overcoming the limitations of serialized paradigms to enhance real-time responsiveness and adaptability in dynamic environments like Minecraft.

Yaoru Li, Shunyu Liu, Tongya Zheng, Li Sun, Mingli Song

Published Tue, 10 Ma
📖 4 min read☕ Coffee break read

Imagine you are playing a video game like Minecraft with a team of friends. In a normal game, you all talk, decide what to do, and then act. But in the world of AI, most "smart" agents (robots powered by Large Language Models) work like a very slow, rigid team: one person thinks, then everyone stops and waits for them to finish talking before anyone can move.

If the game world changes while they are thinking (like a monster appearing or a bridge collapsing), the agent is stuck. It's like trying to drive a car while the driver is reading a map, and the car only moves when the driver puts the map down.

This paper introduces a new way to make AI teams work: Parallelized Planning-Acting. Here is the breakdown in simple terms:

1. The Problem: The "Stop-and-Go" Traffic Jam

Current AI systems work in Serial Mode.

  • Think: The AI stops everything to plan its next move.
  • Act: It executes that move.
  • Repeat: It stops again to plan the next move.

In a chaotic game like Minecraft, this is terrible. If a dragon attacks while the AI is planning, the AI can't react until it finishes its thought process. It's like a chess player who has to freeze their hand for 10 seconds every time they think, while their opponent keeps moving pieces.

2. The Solution: The "Thinking While Running" Team

The authors propose a Dual-Thread Architecture. Imagine a human runner who can also talk on the phone while running.

  • Thread 1 (The Runner/Acting): This part of the AI is constantly moving, fighting, and gathering resources. It doesn't wait for permission; it just keeps going.
  • Thread 2 (The Navigator/Planning): This part is constantly looking at the map, reading the chat, and thinking about the next best move.

The Magic Trick: These two threads run at the same time. The "Runner" keeps moving while the "Navigator" figures out the next step.

3. The "Interrupt" Button

This is the most important feature. In the old system, once the AI started digging for gold, it had to finish digging even if a monster attacked.

In this new system, the "Navigator" can hit an Interrupt Button.

  • Scenario: The AI is mining a diamond. Suddenly, the Navigator sees a zombie coming.
  • Action: The Navigator instantly yells, "Stop mining! Fight the zombie!"
  • Result: The "Runner" drops the pickaxe immediately and switches to a sword. No waiting, no finishing the current task first.

4. The "Central Brain" (Shared Memory)

In many AI teams, if Agent A sees a monster, Agent B doesn't know about it until Agent A finishes its whole task and tells them. This is like playing a game of "Telephone" where the message takes too long to get across.

This system uses a Centralized Memory System.

  • Think of it as a Live Google Doc that everyone is editing in real-time.
  • As soon as Agent A sees a monster, it updates the Doc.
  • Agent B reads the Doc instantly and knows to run away or help.
  • This ensures the whole team is always on the same page, using the freshest information.

5. The "Recipe Book" (Recursive Skill Library)

Minecraft is complex. To make a Diamond Sword, you need diamonds. To get diamonds, you need a pickaxe. To get a pickaxe, you need iron. To get iron, you need coal... and so on.

Old AI agents often got stuck because they didn't know the steps to get the tools they needed.

  • This paper gives the AI a Massive, Smart Recipe Book.
  • If you ask the AI to "Get a Diamond Sword," it doesn't just say "I don't know how." It automatically breaks it down: Need sword -> Need diamonds -> Need pickaxe -> Need iron -> Need coal.
  • It solves the whole chain of tasks automatically, like a chef who knows exactly how to get from "raw ingredients" to "gourmet meal" without being told every single step.

The Result?

The researchers tested this in Minecraft with teams of AI agents fighting bosses (like the Ender Dragon) and gathering resources.

  • Faster: They finished tasks much quicker because they didn't stop to think.
  • Smarter: They reacted instantly to danger because of the "Interrupt" button.
  • Better Teamwork: They shared information instantly via the "Live Google Doc."

In a nutshell: This paper teaches AI agents how to be like a professional sports team: they are constantly moving and reacting to the game in real-time, while a coach (the planner) is shouting instructions and adjusting the strategy on the fly, all without the players ever stopping to catch their breath.