Mozi: Governed Autonomy for Drug Discovery LLM Agents

This paper introduces Mozi, a dual-layer architecture that combines a governed supervisor-worker control plane with a stateful workflow plane to enable reliable, auditable, and error-resistant autonomous drug discovery by balancing flexible reasoning with deterministic execution.

He Cao, Siyu Liu, Fan Zhang, Zijing Liu, Hao Li, Bin Feng, Shengyuan Bai, Leqing Chen, Kai Xie, Yu Li

Published 2026-03-05
📖 5 min read🧠 Deep dive

Mozi: The "Governed Autopilot" for Drug Discovery

Imagine trying to build a skyscraper. You have a brilliant architect (the AI) who can dream up amazing designs, but if you let them run wild without a foreman, they might try to build a bridge out of jelly or forget to check if the foundation is solid. In the world of drug discovery, a mistake isn't just a collapsed building; it's a wasted decade and billions of dollars.

This paper introduces Mozi, a new system designed to be the perfect "Chief of Staff" for AI scientists. It doesn't just let the AI chat and guess; it puts the AI on a strict, safe, and highly organized leash.

Here is how Mozi works, explained through simple analogies:

1. The Problem: The "Wild West" AI

Current AI agents are like enthusiastic interns who read a lot of books but have never worked in a lab.

  • The Issue: If you ask a standard AI to "find a cure for Alzheimer's," it might hallucinate (make things up), pick the wrong tools, or get stuck in a loop. In drug discovery, one small mistake early on (like picking the wrong protein) ruins everything that comes after. It's like trying to bake a cake with the wrong flour; no matter how good the frosting is, the cake is ruined.
  • The Bottleneck: We need AI that is creative but also follows strict safety rules and scientific procedures.

2. The Solution: Mozi's Two-Layer Architecture

Mozi solves this by splitting the work into two distinct teams, like a General and a Factory.

Layer A: The General (The Control Plane)

Think of this as the Project Manager or the Traffic Cop.

  • Role: It doesn't do the heavy lifting. Instead, it listens to your request (e.g., "Find a drug for Sepsis") and breaks it down into a strict checklist.
  • The Rules: It acts as a gatekeeper. It decides who is allowed to do what.
    • Analogy: If the "Research Intern" asks to use the expensive, dangerous nuclear reactor (a complex simulation), the General says, "No, you can only use the library books."
    • Self-Correction: If the intern makes a mistake, the General stops the process, says, "Wait, that didn't work," and re-plans the route before moving forward. It prevents the AI from drifting off course.

Layer B: The Factory (The Workflow Plane)

Think of this as the Assembly Line or the Master Chefs.

  • Role: This is where the actual work happens. Mozi has pre-built, step-by-step "recipes" for drug discovery (like Target Identification, Finding Hits, and Optimizing Leads).
  • The Safety Net: These recipes are rigid. They ensure that if Step 1 (finding a protein) isn't perfect, Step 2 (testing drugs) never starts.
  • The Human Checkpoint: Crucially, at the most dangerous or uncertain moments, the system pauses. It calls a human expert (a real scientist) to say, "Hey, we found this protein. Is this the right one?" The human hits "Go," and the machine continues. This turns the AI from a "black box" into a co-scientist.

3. How It Works in Real Life: The "Drug Discovery Pipeline"

Mozi treats drug discovery like a relay race with four distinct legs. The baton (the data) must be passed perfectly between runners.

  1. Target Identification (Finding the Enemy): The AI looks at a disease (like Parkinson's) and finds the specific protein causing the trouble.
    • Mozi's trick: It checks multiple databases and asks a human, "Are we sure this is the right target?" before moving on.
  2. Hit Identification (Finding the Bullet): It searches millions of chemical compounds to find ones that might stick to that protein.
    • Mozi's trick: It uses two strategies at once: one that creates new molecules from scratch and another that screens existing libraries. It filters out the junk immediately.
  3. Hit-to-Lead (Polishing the Bullet): It takes the best candidates and tweaks them to make them stronger and safer.
    • Mozi's trick: It runs strict "safety tests" (like checking if the drug would poison the liver). If a candidate fails, it's tossed out automatically.
  4. Lead Optimization (The Final Polish): It fine-tunes the best candidate to ensure it works in the human body, can be manufactured, and isn't toxic.

4. Why Mozi is a Game-Changer

The paper tested Mozi on real-world scenarios (Crohn's disease, Parkinson's, and Sepsis) and compared it to other AI systems.

  • Reliability: While other AIs often get confused or hallucinate, Mozi's "General" keeps it on track. If a computer simulation crashes, Mozi catches the error, logs it, and keeps going without the whole system failing.
  • Speed & Scale: In the Parkinson's test, Mozi screened 377,000 compounds in just 35 minutes. That's a job that would take a human team months.
  • Quality: The drugs Mozi designed were not just random guesses. They were chemically sound, safe, and competitive with drugs currently in clinical trials.

The Bottom Line

Mozi is the bridge between "Creative AI" and "Rigid Science."

Before Mozi, using AI for drug discovery was like giving a toddler a scalpel and saying, "Go perform surgery." It might work by luck, but it's dangerous.
With Mozi, it's like giving the toddler a scalpel but putting them in a surgical theater with a strict head surgeon watching every move, ready to step in if things go wrong.

It transforms the AI from a chatty, unreliable conversationalist into a reliable, governed co-scientist that can help us discover life-saving medicines faster and safer than ever before.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →