Less is More: Robust Zero-Communication 3D Pursuit-Evasion via Representational Parsimony

This paper demonstrates that explicitly reducing observation dimensionality and implementing locality-aware credit assignment in a communication-free multi-agent system enhances robustness and performance in asymmetric 3D pursuit-evasion tasks within cluttered environments.

Jialin Ying, Zhihao Li, Zicheng Dong, Guohua Wu, Yihuan Liao

Published Tue, 10 Ma
📖 5 min read🧠 Deep dive

Imagine a high-stakes game of "Tag" played in a massive, three-dimensional maze made of giant floating blocks (like a voxel game). You have four drones (the pursuers) trying to catch one fast, agile drone (the evader).

The problem? The maze is full of obstacles, the drones can't turn instantly, and—most importantly—they cannot talk to each other. In the real world, radio signals get delayed, blocked, or corrupted by noise. If the drones rely on hearing each other to coordinate, a split-second delay could cause them to crash into each other or miss the target entirely.

This paper asks a counter-intuitive question: What if the drones are too smart? What if giving them too much information about their teammates actually makes them worse at catching the target when communication is bad?

Here is the breakdown of their solution, "Less is More," using simple analogies.

1. The Problem: The "Over-Connected" Team

In many AI systems, robots are taught to constantly share their thoughts: "I see the target!" "I'm turning left!" "Watch out!"

The authors argue that in a noisy, delayed environment, this is like trying to run a relay race while everyone is shouting instructions over a walkie-talkie with a bad signal.

  • The Issue: If Drone A hears a message from Drone B that is 0.5 seconds old, Drone A might turn left based on that old info, only to realize Drone B has already turned right. This "stale information" causes confusion and crashes.
  • The Old Way: Give the AI more data (83 dimensions of information) about where every teammate is.
  • The Result: The AI gets overwhelmed by outdated, noisy data and starts making mistakes.

2. The Solution: "Representational Parsimony" (The Blindfolded Team)

The authors decided to try the opposite: Give the drones less information.

They created a "Parsimonious" (simple) version where the drones are effectively blindfolded regarding their teammates. They only see:

  • Where they are.
  • Where the target is (if visible).
  • The general shape of the maze (a map).
  • They do NOT see where their teammates are.

The Analogy: Imagine a group of hikers trying to find a lost friend in a dense forest.

  • The "Rich" Team: Everyone is constantly shouting, "I'm here!" "I'm over there!" But the wind (noise) distorts the voices. They get confused, run in circles, and trip over each other.
  • The "Parsimonious" Team: They agree to a simple rule: "Everyone move toward the center of the forest, but keep your eyes on the ground and the path." They don't talk. They just react to the terrain. Surprisingly, they move more smoothly and catch the target faster because they aren't distracted by confusing, delayed voices.

3. The Secret Sauce: "Contribution-Gated Credit Assignment" (The Fair Coach)

If the drones can't talk, how do they know they are working together? How do they know who gets the "credit" for the catch?

The authors invented a system called CGCA. Think of this as a fair coach who watches the game from a high tower.

  • The Rule: The coach only gives points if you are actually helping right now.
  • How it works: If a drone is far away (more than 60 meters) or just hovering around doing nothing, the coach says, "You get zero points for this catch." But if a drone is close, moving fast toward the target, and actively squeezing the target into a corner, the coach gives them a big reward.
  • The Result: This forces the drones to naturally coordinate without talking. They realize, "If I stay far away, I get no points. I need to get close and help." They self-organize into a perfect formation (like a net) just by chasing the reward.

4. The Results: Why "Less" Won

The team tested this in a brutal simulation (Stage 5) with 4 drones vs. 1 fast evader in a cluttered 3D city.

  • The "Rich" Team (Full Info): Caught the target 72% of the time but crashed into things 25% of the time. They were confused by the noise.
  • The "Parsimonious" Team (Simple Info + CGCA): Caught the target 75% of the time and crashed only 22% of the time.
  • The Stress Test: When they added delays, noise, or made the evader faster, the "Rich" team fell apart. The "Parsimonious" team just slowed down gracefully but kept working.

The Big Takeaway

The paper proves a simple but powerful design principle: In a chaotic, noisy world, simplicity is robust.

By stripping away the complex, fragile connections between robots (the "team-coupled" data) and relying on simple local rules and a fair reward system, the robots became better at working together. They didn't need to talk to coordinate; they just needed to know the rules and the map.

In short: Sometimes, to catch a fast target in a messy world, you don't need a super-connected team. You need a team that knows when to stop listening and start looking at the ground.