Task-Agnostic Continual Learning for Chest Radiograph Classification

This paper introduces CARL-XRay, a task-agnostic continual learning framework for chest radiograph classification that utilizes a fixed backbone with incremental adapters and a latent task selector to achieve robust performance and accurate routing without retraining on historical data or storing raw images.

Muthu Subash Kavitha, Anas Zafar, Amgad Muneer, Jia Wu

Published 2026-02-18
📖 4 min read☕ Coffee break read

Imagine a hospital's AI system as a brilliant, overworked radiologist named "Dr. X-Ray."

The Problem: The "All-or-Nothing" Dilemma

In the past, if Dr. X-Ray needed to learn a new way of reading X-rays (say, from a different hospital with slightly different cameras or labeling habits), the old way of doing things was to wipe his memory clean and start over.

  • The Old Way: You'd show him 10,000 new X-rays, and he'd relearn everything from scratch. He might get really good at the new style, but he'd forget how to read the old ones perfectly. Or, you'd have to show him every single X-ray from the last 10 years every time he learned something new, which is impossible because of privacy laws and storage limits.

The Solution: The "Specialized Interns" System (CARL-XRay)

The authors of this paper propose a smarter way called CARL-XRay. Instead of retraining the whole doctor, they keep the main doctor's brain (the "Backbone") frozen and unchanged. This brain is already a master at seeing bones, lungs, and shadows.

When a new type of X-ray dataset arrives, they don't retrain the brain. Instead, they hire a tiny, specialized intern (called an "Adapter") just for that specific job.

  • The Backbone: The senior doctor who never forgets the basics.
  • The Adapters: A team of specialized interns. Intern A knows how to read "Hospital A's" X-rays. Intern B knows "Hospital B's." They are small, cheap to train, and don't mess with the senior doctor's brain.

The Big Challenge: "Who Am I Talking To?"

Here is the tricky part: In a real hospital, when a new X-ray comes in, the computer doesn't know which hospital it came from. It's like a patient walking in without a name tag.

  • If the computer guesses wrong and sends the X-ray to "Intern A" (who only knows Hospital A), but the X-ray is actually from Hospital B, the diagnosis will be wrong.
  • The system needs a Traffic Cop (called a "Latent Task Selector") to look at the X-ray and say, "Ah, this looks like it belongs to Intern B's group. Send it there!"

How They Keep the Traffic Cop Honest

The biggest fear in AI is Catastrophic Forgetting. As the Traffic Cop learns to recognize Hospital B, it might start forgetting what Hospital A looks like.

To fix this, the researchers use a clever trick called Feature-Level Experience Replay:

  • Instead of storing thousands of actual X-ray images (which is illegal or too expensive), the system saves tiny, compressed "snapshots" of what the interns saw when they learned their jobs.
  • Every time the Traffic Cop learns a new intern, it reviews these old snapshots to make sure it hasn't forgotten the old interns. It's like the Traffic Cop keeping a small, private diary of "what the interns looked like" rather than a photo album of every patient.

The Results: Why This Matters

The team tested this on two massive real-world datasets (MIMIC-CXR and CheXpert). Here is what they found:

  1. It Doesn't Forget: When they taught the system a second task, it didn't forget the first one. The "forgetting" was almost zero.
  2. It's Smarter at Guessing: When they didn't tell the system which hospital the X-ray was from (the "Task Unknown" scenario), their system guessed the right intern 75% of the time.
    • Comparison: A standard method that tries to learn everything at once (Joint Training) only guessed right 62.5% of the time.
  3. It's Efficient: They only had to train a tiny fraction of the parameters (0.08%). It's like upgrading a car's navigation system by just changing the map app, rather than rebuilding the whole engine.

The Takeaway

This paper introduces a way for medical AI to grow up naturally. Instead of being a rigid system that needs a total overhaul every time new data arrives, CARL-XRay is like a flexible organization:

  • It keeps its core knowledge safe.
  • It hires small, specialized teams for new jobs.
  • It uses a smart traffic cop to route patients correctly, even without ID tags.
  • It uses a "memory diary" to ensure no one gets forgotten.

This makes it a realistic, practical solution for hospitals that need to update their AI tools over years without breaking their current systems or violating patient privacy.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →