Identifying and Characterising Response in Clinical Trials: Development and Validation of a Machine Learning Approach in Colorectal Cancer

This paper presents and validates a novel machine learning framework that integrates partly conditional modelling, the Virtual Twins method, and survLIME to identify and characterize both static and dynamic treatment responses in metastatic colorectal cancer trials, demonstrating improved performance over existing methods and identifying key predictive factors such as genetic mutations, metastasis sites, and ethnicity.

Adam Marcus, Paul Agapow

Published 2026-03-03
📖 5 min read🧠 Deep dive

Imagine you are a chef trying to perfect a new recipe for a complex dish. In the past, you might have cooked one giant pot of soup, tasted it, and declared, "This is good for everyone!" But in reality, some people love the soup, some find it too salty, and others get a stomach ache.

Precision Medicine is the idea that instead of one giant pot, we should cook individualized meals for every single person based on their unique taste buds (genetics, lifestyle, etc.). The problem is, figuring out who likes what is incredibly hard, especially when people's tastes change over time.

This paper by Adam Marcus and Paul Agapow is like a new, high-tech kitchen tool designed to solve this exact problem. Here is a simple breakdown of what they did, using some everyday analogies.

1. The Problem: The "Snapshot" vs. The "Movie"

Most medical studies are like taking a photograph at the very beginning of a trip. They look at a patient's health once, give them a drug, and see what happens.

  • The Flaw: Life isn't a photo; it's a movie. Patients' bodies change, tumors evolve, and how they react to medicine can shift day by day. Old methods ignore this "movie" and only look at the first frame, missing crucial clues.

2. The Solution: A "Virtual Twin" Time Machine

The authors built a machine learning system that works in three steps:

  • Step 1: The Virtual Twin (The "What If" Simulator)
    Imagine you have a twin. You give the real you a new medicine, but you can't give the twin the same medicine (for ethical reasons). However, this AI creates a "Virtual Twin" of you. It simulates: "If you had taken the other drug instead, what would have happened?"
    By comparing the Real You (who took Drug A) with the Virtual You (who took Drug B), the system calculates the exact difference the drug made.

  • Step 2: The Time-Traveling Detective (Partly Conditional Modelling)
    This is the paper's secret sauce. Instead of just looking at the start of the movie, the system watches the whole film. It treats every time a patient gets a blood test or a check-up as a new "scene."

    • Analogy: Think of a detective solving a crime. Old methods only looked at the crime scene at 9:00 AM. This new method looks at the scene at 9:00 AM, 10:00 AM, and 11:00 AM, noticing that the suspect's behavior changed as the day went on. This helps catch "dynamic responders"—people who start out doing poorly but turn the corner later, or vice versa.
  • Step 3: The Translator (survLIME)
    Once the AI knows who is responding, it needs to explain why. AI models are often "black boxes" (we know the answer, but not how they got there).
    The authors used a tool called survLIME (a translator for survival data). It looks at the AI's decision and says, "Okay, the AI decided this person is a responder because of their specific gene mutation and where their cancer spread." It turns the complex math into a readable list of reasons.

3. The Test Drive: The Simulation

Before using this on real patients, they tested it in a "video game" world (simulation).

  • They created 1,000 fake patients.
  • They gave some a "magic drug" that worked only on people with specific traits.
  • The Result: The old methods (the "snapshot" approach) were okay at finding the winners (73% accuracy). But the new "movie" method (using their time-traveling technique) was much better (77% accuracy).
  • The Dynamic Twist: When they made the patients' traits change during the game (like a tumor mutating), the old methods got confused and failed (dropping to 60% accuracy). The new method adapted and stayed strong (68% accuracy).

4. The Real World Test: Colorectal Cancer

They then applied this to real data from four major cancer trials involving a drug called Panitumumab.

  • What they found: The system correctly identified that specific genetic mutations (like KRAS and BRAF) and where the cancer had spread (like to the brain or bones) were the biggest factors in whether the drug worked.
  • The Surprise: It also flagged ethnicity as a factor. This aligns with real-world observations that different racial groups sometimes respond differently to treatments, likely due to biological differences or social factors affecting care.
  • Why it matters: The system didn't just say "It works." It said, "It works best for this specific type of person, at this specific stage of their disease."

5. The Catch (Limitations)

No tool is perfect. The authors admit:

  • It needs a big crowd: To work well, you need a lot of data (like 1,000+ patients). Small studies might not give clear answers.
  • It's computationally heavy: It requires a lot of computer power to run these simulations.
  • It's a suggestion, not a law: Because it looks at data after the fact (retrospectively), it can't prove cause-and-effect on its own. It's a brilliant map that tells us where to look, but we still need to go out and build the road (run new clinical trials) to confirm it.

The Bottom Line

This paper is about upgrading our medical "GPS." Instead of giving everyone the same directions based on where they started, this new system watches the journey in real-time, accounts for traffic jams and detours (changing health conditions), and tells us exactly which route works best for each driver. It's a step toward a future where cancer treatment isn't a "one-size-fits-all" guess, but a tailored plan that evolves with the patient.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →