Holos: A Web-Scale LLM-Based Multi-Agent System for the Agentic Web

This paper introduces Holos, a web-scale, five-layer large language model-based multi-agent system designed to overcome scaling and coordination challenges in the Agentic Web through an efficient generation engine, market-driven orchestration, and an endogenous value cycle to enable long-term ecological persistence and self-organization.

Xiaohang Nie, Zihan Guo, Zicai Cui, Jiachi Yang, Zeyi Chen, Leheyi De, Yu Zhang, Junwei Liao, Bo Huang, Yingxuan Yang, Zhi Han, Zimian Peng, Linyao Chen, Wenzheng Tom Tang, Zongkai Liu, Tao Zhou, Botao Amber Hu, Shuyang Tang, Jianghao Lin, Weiwen Liu, Muning Wen, Yuanjian Zhou, Weinan Zhang

Published 2026-04-06
📖 5 min read🧠 Deep dive

Imagine the internet today as a massive library where every book is a robot (an AI agent) that can only read one specific page and then goes to sleep. If you ask a robot to "write a story," it might do a great job, but if you ask it to "plan a vacation, book flights, and then write a story about the trip," it gets confused, forgets things, or gives up.

Holos is a new idea that changes the library into a living, breathing city.

Instead of one giant, super-smart robot trying to do everything, Holos is a city of millions of specialized robots living together, talking to each other, hiring each other, and evolving over time. It's not just a tool; it's an ecosystem.

Here is how it works, broken down into simple concepts:

1. The Big Idea: From "One Super-Brain" to "A Whole Society"

Most people think Artificial General Intelligence (AGI)—the "holy grail" of AI that can do anything—will come from building one massive, perfect brain. The authors of this paper say: "No, that's wrong."

Think of it like a symphony orchestra. You don't need one musician who can play every instrument perfectly to make beautiful music. You need a violinist, a drummer, a conductor, and a trumpet player, all working together.

  • The Old Way: One giant robot trying to be a doctor, a lawyer, and a chef all at once.
  • The Holos Way: A city where one robot is a master chef, another is a brilliant lawyer, and a third is a traffic controller. They collaborate to solve big problems.

2. The Five Layers: How the City is Built

Holos is built like a skyscraper with five distinct floors, each doing a specific job to keep the city running:

  • The Front Door (Interface Layer): This is where you, the human, walk in. You just say, "I want to plan a trip to Mars." You don't need to know how the city works; you just talk to the doorman.
  • The Factory (Substrate Layer): This is the "sleeping" part. Imagine a warehouse with millions of robot blueprints. Most robots are asleep (saving energy). When a task comes in, the system wakes up the exact right robot, gives it tools, and sends it to work. It's like a "Just-in-Time" factory that only builds what it needs, right when it needs it.
  • The Conductor (Coordination Layer): This is the most important part. When you ask for a complex task, a "Conductor" robot breaks it down. It doesn't just give orders; it acts like a marketplace. It says, "Who is the best robot to book flights? Who is the best to write the itinerary?" It matches the job to the best worker, even if that worker is a stranger.
  • The Bank (Value Layer): This is the secret sauce. In most AI systems, robots work for free and then disappear. In Holos, robots get paid. If a robot does a good job, it earns "credits." If it does a bad job, it loses reputation. This creates a self-cleaning system: good robots get more work and become richer; bad robots get fired naturally. It's like a Yelp review system for AI, but with real money involved.
  • The Result (Outcome Layer): This is where the final product is delivered to you, along with a report card showing who did what and how well.

3. The Magic Sauce: "Blind Planning" and "Market Dispatch"

How does the system handle millions of robots without getting chaotic?

  • Blind Planning: The Conductor doesn't say, "Bob, do this." Instead, it says, "We need someone who can write code." It creates a "job description" without knowing who will take it. This prevents the system from getting stuck if a specific robot goes offline.
  • The Market: Once the job is posted, any robot that thinks they can do it can "bid" for it. The system uses a smart algorithm to pick the best bidder based on their past performance (credit score) and the price they ask. It's like an Uber for AI tasks.

4. Why This Matters: The "Self-Healing" City

The paper tested this system with over 1 million robots. Here is what happened:

  • Resilience: If a robot crashes or gets hacked, the system doesn't panic. It just finds a replacement robot from the millions available and keeps going. It's like a forest; if one tree falls, the forest survives.
  • Evolution: Because robots want to earn more credits, they naturally get better at their jobs. The system "evolves" over time, becoming smarter and more efficient without humans needing to constantly reprogram it.
  • Efficiency: The system is so efficient that it can hold 1 million robots in the memory of a standard laptop (using less than 500 bytes of memory per robot!).

5. The Future: A Digital Ecosystem

The authors call this the "Agentic Web."
Imagine a future where:

  • Your personal AI assistant doesn't just answer questions; it negotiates with your bank's AI to lower your interest rate.
  • A travel AI hires a weather AI, a traffic AI, and a hotel AI to build a perfect vacation plan in seconds.
  • These AI agents have their own "careers," building reputations and economies.

In a nutshell:
Holos is the operating system for a future where AI isn't just a tool you use, but a society of workers that lives, works, trades, and improves itself on the internet. It turns the internet from a place where we search for information into a place where we hire intelligence.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →