Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence Using Imperfect and Privacy-Sensitive Medical Data

Imagine you have a brilliant, super-smart medical assistant named "Dr. AI." Dr. AI has read every medical textbook, studied millions of patient records, and learned how to diagnose everything from a common cold to complex brain surgeries.

But there's a problem. Dr. AI has a very bad memory. It remembers everything too well, including:

Private Secrets: Specific details about a patient's surgery that shouldn't be public.
Outdated Info: Old medical advice that has since been proven wrong.
Restricted Knowledge: Highly specific surgical steps that a general doctor shouldn't know, only a specialist should.

If you ask Dr. AI to "forget" a specific patient's surgery, it usually panics. It either forgets everything (including how to treat a broken leg) or it forgets nothing (leaving the secret exposed).

This paper introduces a new, clever way to teach Dr. AI how to selectively forget without losing its smarts. They call it the "Hierarchical Dual-Strategy Unlearning" framework.

Here is how it works, using simple analogies:

1. The "Four-Layer Cake" (The Hierarchy)

First, the researchers realized that medical knowledge isn't just a giant pile of facts. It's like a four-layer cake:

Layer 1 (The Sponge): Basic biology (e.g., "Cells make up the body"). We must never forget this.
Layer 2 (The Frosting): General clinical skills (e.g., "How to check a fever"). Keep this safe.
Layer 3 (The Filling): Specialty skills (e.g., "How to treat heart disease"). Keep this mostly safe.
Layer 4 (The Cherry on Top): Specific, sensitive details (e.g., "The exact steps to remove Patient X's tumor"). This is what we need to remove.

The system knows exactly which layer it is touching. It won't accidentally eat the sponge (basic knowledge) while trying to remove the cherry (sensitive data).

2. The "Dual-Strategy" (Two Tools for the Job)

To remove the "cherry" without ruining the cake, they use two tools at the same time:

Tool A: The "Geometric Shield" (Gradient Updates)
Imagine Dr. AI's brain is a giant map of roads.

The Problem: When you try to erase a road (the surgery steps), you might accidentally block the highway to the hospital (general diagnosis).
The Solution: The researchers use a "geometric shield." They tell the AI: "You can erase this specific side street, but you must walk in a straight line that doesn't touch the highway."
How it works: It mathematically forces the AI to change its brain in a direction that only affects the specific thing it wants to forget, leaving the rest of the map untouched.

Tool B: The "Highlighter Pen" (Token Interventions)
Imagine the AI is reading a book.

The Problem: Some words are just general words (like "patient" or "pain"), while others are specific secrets (like "tumor resection step 4").
The Solution: The system uses a "highlighter pen" to mark exactly which words belong to the secret surgery.
How it works: It tells the AI: "When you see the word 'tumor resection,' make it feel very uncomfortable so you stop remembering it. But when you see the word 'patient,' keep feeling comfortable so you remember that."

3. The "Privacy Bubble" (Differential Privacy)

Even after teaching the AI to forget, there's a risk it might still "leak" a tiny bit of the secret.

The Solution: They add a "Privacy Bubble" (mathematical noise) around the learning process.
The Analogy: Imagine you are erasing a chalkboard. To make sure no one can see the faint ghost of the writing you erased, you sprinkle a little bit of confetti over the board. The confetti (noise) makes it impossible for anyone to guess what was written there before, but it doesn't stop the board from being used for new writing.

4. The Results: "The Magic Eraser"

The researchers tested this on real medical data (including tricky, messy data with missing labels).

The Test: They asked the AI to forget specific surgical details and mental health secrets.
The Result:
- Forgetting: The AI successfully "forgot" the sensitive info (82.7% success rate).
- Remembering: It kept its general medical skills almost perfect (88.5% success rate).
- Efficiency: It only changed 0.1% of the AI's brain. Usually, to fix an AI, you have to rebuild the whole thing. Here, they just tweaked a tiny fraction, saving huge amounts of time and money.

Why Does This Matter?

In the real world, hospitals and researchers have to follow strict rules (like GDPR) that say, "If a patient asks to be forgotten, you must delete their data."

Before this paper, deleting that data meant either:

Deleting the whole AI model (too expensive).
Leaving the data in the model (illegal).

This new method is like having a Magic Eraser that can remove a specific stain from a white shirt without shrinking the shirt or changing its color. It allows hospitals to stay compliant with privacy laws, keep their AI smart, and protect patient secrets, all while dealing with messy, imperfect real-world data.

Here is a detailed technical summary of the paper "Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence Using Imperfect and Privacy-Sensitive Medical Data."

1. Problem Statement

Large Language Models (LLMs) deployed in healthcare face two critical challenges:

Privacy Risks: LLMs tend to memorize training data, posing severe risks when trained on sensitive patient information (e.g., specific surgical procedures or mental health records). Regulations like GDPR mandate a "right to be forgotten," requiring models to remove specific data without retraining from scratch.
Imperfect Data: Real-world medical data is often incomplete, poorly labeled, imbalanced, or contains annotation noise. Existing unlearning methods struggle to selectively remove specific knowledge (e.g., surgical details) from such noisy datasets without degrading the model's general clinical reasoning capabilities or causing "catastrophic forgetting" of essential medical concepts.

Current approaches either require computationally prohibitive full retraining or lack the precision to handle the interconnected nature of medical knowledge and the noise inherent in imperfect datasets.

2. Methodology: Hierarchical Dual-Strategy Framework

The authors propose DuoLearn, a framework that combines geometric-constrained gradient updates with concept-aware token-level interventions, guided by a unified four-level medical concept hierarchy.

A. Unified Medical Concept Hierarchy

The system organizes medical knowledge into four levels to guide selective unlearning:

L1: Fundamental Biomedical Concepts (e.g., anatomy, physiology).
L2: General Clinical Concepts (e.g., common symptoms, diagnostics).
L3: Specialty-Specific Concepts (e.g., cardiology, neurology).
L4: Surgical/Target Concepts (e.g., specific resection steps, targeted for removal).

This hierarchy assigns specific modulation coefficients to each level, ensuring that foundational knowledge (L1/L2) is strictly preserved while target knowledge (L4) is aggressively unlearned.

B. Dual-Strategy Mechanics

The framework operates two synergistic strategies simultaneously:

Geometric-Constrained Gradient Updates (Parameter Level):
- Uses the Fisher Information Matrix (FIM) to identify parameters critical for retention.
- Applies orthogonal projection to the gradients of the "forgetting" dataset. This projects the update vector away from the direction of the "retention" dataset gradients, ensuring that updates to remove surgical knowledge do not negatively impact general medical reasoning.
- Formula: $g_{\perp} = g_f - \alpha \frac{g_f \cdot g_r}{\|g_r\|^2 + \epsilon}g_r$ , where $g_f$ is the forget gradient and $g_r$ is the retain gradient.
Concept-Aware Token Interventions (Token Level):
- Identifies specific tokens associated with the target domain (e.g., surgical terms) using gradient-based importance scoring.
- Applies sign flipping and weighted loss maximization to these specific tokens while suppressing the impact on fundamental vocabulary.
- This ensures that the model "forgets" the semantic meaning of surgical terms while retaining general medical vocabulary.

C. Privacy and Efficiency Integration

Differential Privacy (DP): Gaussian noise is added to the gradients ( $\nabla_{private} = \nabla + \mathcal{N}(0, \sigma^2 I)$ ) to provide formal $(\epsilon, \delta)$ -differential privacy guarantees, preventing membership inference attacks.
Parameter Efficiency: The method utilizes LoRA (Low-Rank Adaptation), modifying only 0.1% of the model parameters (specifically Q/K/V projections in the final layers), making the process computationally feasible for large models.

3. Key Contributions

Hierarchical Dual-Strategy Framework: A novel approach that unifies parameter-level geometric constraints and token-level interventions, specifically designed for the challenges of imperfect and noisy medical data.
Medical Concept Hierarchy: A four-level ontology (L1–L4) that enables precise targeting of specific knowledge domains (e.g., surgery) while preserving the integrity of foundational medical concepts.
Robust Evaluation on Imperfect Data: Comprehensive testing on real-world datasets with inherent noise and imbalance, demonstrating that the method works effectively where traditional methods fail.
Privacy-Preserving Paradigm: Integration of Differential Privacy with unlearning, offering strong theoretical guarantees (MIA resistance) while maintaining high model utility.

4. Experimental Results

The framework was evaluated on two datasets: MedMCQA (surgical knowledge removal) and MHQA (mental health knowledge removal).

Performance on MedMCQA (Surgical Unlearning):
- Forgetting Rate (FR): 82.7% (vs. 73.2% for Gradient Ascent, 78.9% for SOTA AILS-NTUA).
- Knowledge Preservation (KP): 88.5% on non-surgical queries (vs. 81.4% for Gradient Ascent).
- Harmonic Mean Task Aggregate (HMTA): 0.847, significantly outperforming all baselines.
- Selectivity: Surgical accuracy dropped from 89.2% to 17.3%, while internal medicine and pediatrics remained above 90%.
Performance on MHQA (Mental Health):
- Achieved a 79.4% forgetting rate for anxiety-related knowledge while preserving 89.1% accuracy on other mental health domains.
Privacy & Efficiency:
- MIA Resistance: 0.89 (near random guessing for attackers), with a Differential Privacy strength of 0.20 ( $\epsilon=4.0$ ).
- Parameter Efficiency: Only 0.11% of parameters were trainable (3.25M out of 3B total).
Ablation Studies: Removing either the geometric strategy, the token strategy, the hierarchy, or differential privacy resulted in significant performance drops, confirming the necessity of the full dual-strategy approach.

5. Significance and Impact

Regulatory Compliance: Provides a practical solution for hospitals and research institutions to comply with GDPR/HIPAA "right to be forgotten" mandates without the cost of full model retraining.
Clinical Safety: Enables the creation of "safe" AI agents that can perform general diagnostics (L1/L2) while being strictly prohibited from revealing sensitive procedural details (L4) or specific patient histories.
Handling Imperfect Data: Demonstrates that high-quality unlearning is possible even when training data is noisy, incomplete, or imbalanced, a common scenario in real-world healthcare.
Auditability: The framework supports end-to-end audit trails, allowing for the verification of data removal, which is crucial for clinical liability and ethical AI deployment.

In summary, this work establishes a new paradigm for privacy-preserving medical AI, proving that selective knowledge removal can be achieved with high precision, robust privacy guarantees, and minimal computational overhead, even in the presence of imperfect data.