Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery

The paper introduces MACRO, a self-evolving medical imaging agent that autonomously discovers and synthesizes reusable composite tools from verified execution trajectories to overcome the brittleness of static tool chains and enhance multi-step clinical decision-making across diverse domains.

Lin Fan, Pengyu Dai, Zhipeng Deng, Haolin Wang, Xun Gong, Yefeng Zheng, Yafei Ou

Published Mon, 09 Ma
📖 4 min read☕ Coffee break read

Imagine you are a junior doctor trying to learn how to diagnose complex eye diseases from X-ray images.

The Old Way (Current AI):
Right now, most medical AI agents are like a junior doctor who has been given a rigid, unchangeable checklist.

  • "Step 1: Look at the image."
  • "Step 2: Measure the size."
  • "Step 3: Compare to the average."

If the hospital changes its X-ray machine, or if a patient has a rare condition that doesn't fit the checklist, the AI gets stuck. It can't think outside the box. It's like a chef who only knows how to make a sandwich because that's the only recipe they were given, even if they are handed a pizza. If the ingredients change, the chef panics.

The New Way (MACRO):
The paper introduces MACRO, a medical AI that learns more like a human expert than a robot. Instead of sticking to a static checklist, MACRO learns by doing, failing, and then teaching itself new tricks.

Here is how MACRO works, using a simple analogy:

1. The "Mental Notebook" (Experience-Grounded Memory)

Imagine MACRO has a magical notebook. Every time it successfully diagnoses a patient, it doesn't just throw the case away. It writes down: "Hey, for this type of blurry eye image, I found that doing A, then B, then C worked perfectly."

If a new patient comes in with a similar blurry image, MACRO opens its notebook, finds that past success, and says, "I remember how to handle this!" This helps it adapt to new situations immediately, rather than starting from scratch.

2. The "Shortcut Discovery" (Self-Skill Discovery)

This is the coolest part. Imagine MACRO is solving a puzzle. It notices that every time it solves a specific type of problem, it has to do the same three steps over and over again:

  1. Clean the image.
  2. Highlight the edges.
  3. Measure the shape.

In the old way, the AI would have to remember to do all three steps every single time. But MACRO is smart. It realizes, "Wait, I do these three steps together so often, they should be one single step!"

So, it creates a new tool called "Clean-Highlight-Measure" and adds it to its toolbox. Now, instead of taking three steps, it just clicks one button. It's like a carpenter who, after sawing, sanding, and painting a specific type of chair 100 times, invents a single "Chair-Maker" machine to do it all at once.

3. The "Coach" (Reinforcement Learning)

MACRO has a virtual coach (the training loop). When MACRO tries a new shortcut and it works, the coach gives it a high-five (a reward). When it tries a shortcut and fails, the coach says, "Try something else." Over time, MACRO builds a massive library of these "super-tools" (composite tools) that are proven to work.

Why Does This Matter?

  • Adaptability: Hospitals change. New machines are bought. Old diseases get new names. The old AI breaks when things change. MACRO just learns the new pattern, creates a new shortcut, and keeps going.
  • Efficiency: By turning long, complicated sequences into single "super-tools," MACRO gets faster and more accurate.
  • Real-World Ready: It doesn't need a team of engineers to rewrite its code every time a new medical protocol is introduced. It learns on the job, just like a human doctor does.

In a nutshell:
Current medical AI is like a robot following a script.
MACRO is like a curious apprentice who watches the master, figures out the best ways to do things, writes those ways down as new rules, and gets better every single day without needing a human to rewrite its manual.