The Geometry of Transfer: Unlocking Medical Vision Manifolds for Training-Free Model Ranking

This paper proposes a novel training-free Topology-Driven Transferability Estimation framework that leverages global and local topological metrics to accurately rank medical foundation models for segmentation tasks, significantly outperforming existing classification-based methods on the OpenMind benchmark.

Jiaqi Tang, Shaoyang Zhang, Xiaoqi Wang, Jiaying Zhou, Yang Liu, Qingchao Chen

Published 2026-03-02
📖 4 min read☕ Coffee break read

Imagine you are a chef trying to cook a perfect meal for a very specific guest. You have a massive library of 114,000 pre-cooked ingredients (these are the "Medical Foundation Models" trained by AI on huge amounts of unlabeled data). You need to pick the one ingredient that will work best for your specific dish (e.g., detecting a tiny tumor in a kidney vs. mapping a large brain region).

The Problem:
Usually, to find the best ingredient, you would have to cook a test batch with every single one of those 114,000 ingredients, taste them, and see which one wins. In the world of medical AI, "cooking" means fine-tuning the model, which takes days of computing power and costs a fortune. It's like trying to find the perfect spice by baking a whole cake for every single spice jar in the world.

The Old Way (The Flawed Map):
Previous methods tried to guess the winner by looking at the "statistics" of the ingredients. They asked, "Do the colors and shapes of the ingredients look similar to the final dish?"

  • The Flaw: This is like judging a map of a city by looking only at the average color of the paint. It misses the roads. In medical imaging, the most important part isn't the big, blurry background; it's the sharp, jagged edges where a tumor meets healthy tissue. Old methods got lost in the big picture and failed to see the critical boundaries.

The New Solution: "The Topology Detective"
This paper introduces a new way to pick the best model without cooking a single test batch. Instead of looking at statistics, they look at the shape and structure (the "Topology") of the data.

Think of it like this:

  1. The Global View (GRTD): The "Tree of Life"
    Imagine you have a bunch of people (data points) in a room. The old way just counted how many people are wearing red shirts. The new way builds a Minimum Spanning Tree—a single, unbroken line connecting everyone in the room based on how close they stand to each other.

    • If the AI model is good, the line connecting "Tumor" people will naturally stay separate from the line connecting "Healthy" people.
    • If the line gets tangled and mixes them up, the model is bad. This checks if the model understands the overall shape of the problem.
  2. The Local View (LBTC): The "Fence Inspector"
    Sometimes, the big picture looks fine, but the fence between two neighbors is broken. In medical scans, this is the boundary between a disease and healthy tissue.

    • The new method zooms in on these critical edges. It checks: "If I stand right on the border, can the AI clearly tell me which side is which, or is it confused?"
    • It ensures the AI doesn't just guess the general area but respects the sharp lines where lives depend on precision.
  3. The Smart Mixer (Task-Adaptive Fusion): The "Tailored Suit"
    Not all tasks are the same.

    • If you are looking for a large organ (like a whole liver), you care more about the "Global View" (the big tree).
    • If you are looking for a tiny, fragmented lesion (like a small stroke), you care more about the "Local View" (the fence).
    • The paper's system is a smart tailor. It automatically decides how much weight to give the "Tree" vs. the "Fence" based on how complex the specific medical task is. It creates a custom score for every single job.

The Result:
The authors tested this on a massive benchmark called "OpenMind."

  • Old Methods: Got confused, often ranking the worst models as the best (negative correlation).
  • New Method: Correctly predicted the winner 31% better than the best existing methods.
  • Speed: It did this in minutes (by just looking at the data structure) instead of days (by retraining the models).

In a Nutshell:
Instead of trying every key in a giant keyring to open a door (fine-tuning), this new method looks at the shape of the key's teeth (topology) to instantly know which one fits. It checks both the overall shape of the key and the tiny notches on the edge, ensuring it picks the perfect tool for the job without ever having to try it in the lock. This saves massive amounts of time and money, making it possible to deploy the best medical AI models quickly and efficiently.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →