GRAND: Guidance, Rebalancing, and Assignment for Networked Dispatch in Multi-Agent Path Finding

The paper proposes GRAND, a hybrid hierarchical algorithm that combines reinforcement learning-based graph guidance with minimum-cost flow rebalancing and local assignment to achieve up to 10% higher throughput than state-of-the-art schedulers for large-scale, lifelong multi-agent pickup-and-delivery tasks while maintaining real-time execution.

Johannes Gaber, Meshal Alharbi, Daniele Gammelli, Gioele Zardini

Published 2026-03-06
📖 4 min read☕ Coffee break read

Imagine a massive, bustling warehouse filled with hundreds of tiny robots. Their job is simple: pick up packages from one spot and drop them off at another. But here's the catch: there are thousands of packages, the robots are constantly moving, and if they aren't coordinated perfectly, they'll get stuck in traffic jams, bump into each other, and the whole operation will slow to a crawl.

This paper introduces a new "traffic controller" for these robot fleets called GRAND.

Think of GRAND not as a single brain trying to micromanage every robot's every step, but as a smart, three-layered management system that combines intuition with math.

Here is how it works, broken down into three simple steps:

1. The Intuition Layer: The "Weather Forecaster" (Guidance)

In a traditional system, a manager might just say, "Robot A, go to the nearest package." This is fast, but it often leads to everyone rushing to the same spot, causing a gridlock.

GRAND uses a Graph Neural Network (a type of AI) trained like a video game character. Instead of telling specific robots where to go, this AI acts like a weather forecaster. It looks at the whole warehouse and predicts: "Hey, the North aisle is getting crowded, but the South aisle is empty. We need more robots to head South."

It doesn't give specific orders; it just sets a desired distribution. It's like a coach shouting, "More players to the left side!" rather than telling every single player exactly which foot to move.

2. The Math Layer: The "Traffic Router" (Rebalancing)

Once the AI says, "We need more robots in the South," GRAND uses a classic math tool called Minimum-Cost Flow.

Imagine you have a pile of empty delivery trucks in the North and a pile of packages in the South. You need to move the trucks to the South efficiently. This step calculates the absolute most efficient way to move the "free" robots from where they are to where they are needed, without worrying about specific packages yet. It's the logistics manager ensuring the workforce is in the right neighborhoods before the work starts.

3. The Local Layer: The "Matchmaker" (Assignment)

Now that the robots are in the right general areas, the system does the final, quick math to pair them up with specific packages.

Because the robots are already in the right "neighborhoods" (thanks to the first two steps), this part is very fast. It's like a local matchmaker at a party. Since everyone is already in the right room, it's easy to pair Person A with Task B without them having to cross the whole building. This step solves small, local puzzles instantly.

Why is this a big deal?

  • Speed vs. Smarts: Usually, you have to choose between being fast (simple rules) or being smart (complex math that takes too long). GRAND does both. The AI provides the "smart" big-picture view, and the math handles the "fast" execution.
  • No Traffic Jams: The paper tested this in a simulation with up to 500 robots. The result? GRAND moved 10% more packages than the previous best system. That's a huge difference in a real warehouse.
  • Real-Time: It does all this thinking in less than one second per step. It's fast enough to run on a standard computer while the robots are actually moving.

The Analogy Summary

Imagine a busy airport:

  • Old Way: A single controller tries to tell every single plane exactly where to land and taxi, one by one. It gets overwhelmed, and planes circle waiting for instructions.
  • GRAND Way:
    1. Guidance: An AI predicts, "Terminal 3 is getting backed up; send more ground crews to Terminal 1."
    2. Rebalancing: The ground crew manager moves the empty trucks to Terminal 1 efficiently.
    3. Assignment: The local staff at Terminal 1 quickly grab the nearest truck and assign it to the waiting plane.

The Bottom Line: GRAND is a hybrid system that uses AI to see the "big picture" and math to handle the "details." It keeps the robots moving smoothly, prevents traffic jams, and gets more work done in less time. It's a blueprint for how to manage massive fleets of robots in the real world.