Robust evaluation of treatment effects in longitudinal studies with truncation by death or other intercurrent events

This paper proposes the Pairwise Last Observation Time (PLOT) estimand, a novel, assumption-free causal inference method that robustly evaluates treatment effects in longitudinal studies by comparing matched individuals at their last common observation time before intercurrent events, thereby avoiding the unverifiable assumptions and sensitivity issues inherent in existing frameworks.

Georgi Baklicharov, Kelly Van Lancker, Stijn Vansteelandt

Published Thu, 12 Ma
📖 5 min read🧠 Deep dive

Imagine you are running a race to see which of two running shoes (Shoe A and Shoe B) helps runners go the fastest. You have a huge group of people, and you randomly give half of them Shoe A and the other half Shoe B.

But here's the problem: Life happens.

Some runners get a flat tire (they need a "rescue" shoe). Some runners get a sudden cramp and have to stop running entirely (they "drop out"). And tragically, some runners get sick and have to leave the race forever (they "die").

In a standard race analysis, if you just look at who finished the race, you might get the wrong answer.

  • If Shoe A makes people run so fast they get exhausted and quit, but Shoe B makes them run slowly and finish, a standard analysis might say "Shoe B is better!" even if Shoe A was actually the faster shoe.
  • If the race stops because someone died, you can't measure their speed after that point. If you just ignore the dead runners, you are only looking at the "survivors," which might be a very different group of people than the ones who died.

This paper introduces a new, clever way to judge the shoes called PLOT (Pairwise Last Observation Time).

The Old Ways vs. The New Way

1. The "Last Look" Method (LOCF):
Imagine you look at the runners at the very end of the race. If a runner quit early, you just take their last known speed and pretend they kept running at that speed until the finish line.

  • The Flaw: This is like guessing a runner's speed at mile 26 based on how fast they were at mile 1. It's a guess, and it can be very wrong.

2. The "Survivors Only" Method:
You only look at the people who finished the race.

  • The Flaw: This is biased. Maybe Shoe A is so tough that only the strongest, most elite runners can finish in it, while the weaker ones quit. If you only look at the finishers, you might think Shoe A is amazing, but it's actually just filtering out the weak runners.

3. The "What If" Method (Hypothetical):
You try to imagine a magical world where no one ever got a flat tire or died, and calculate what would have happened.

  • The Flaw: This requires making up a lot of rules about a world that doesn't exist. If your rules are slightly wrong, your whole answer is wrong.

The New "PLOT" Method: The "Handshake" Analogy

The authors propose a method called PLOT. Instead of looking at the end of the race or guessing the future, they use a "pairing" strategy.

Imagine you take two runners: one wearing Shoe A and one wearing Shoe B. You pair them up based on how fit they were at the start (their "baseline").

Now, you watch them run together.

  • Runner A gets a flat tire at mile 10.
  • Runner B gets a cramp at mile 12.

The race for this specific pair stops at mile 10. Why? Because that is the last moment both of them were still running freely.

You compare their speeds exactly at mile 10. You don't guess what they would have done at mile 20. You don't ignore Runner A because they stopped. You just say, "Okay, at the moment the first person in this pair stopped, here is how they compared."

You do this for thousands of pairs. Some pairs stop at mile 2, some at mile 15. You average all these "last moments" together.

Why is this better?

  • Fairness: You are comparing the shoes at the exact same point in time for both runners. You aren't giving Shoe A an unfair advantage because its runners lasted longer.
  • No Magic: You aren't guessing what would happen if they didn't get sick. You are using real data from the moment they were actually running.
  • Robustness: Even if the "flat tires" happen for weird reasons (like unmeasured health issues), this method is very good at ignoring those tricks and telling you the truth about the shoes.

The "Synchronous" Concept

Think of it like a dance.

  • In the old methods, one dancer might keep dancing after the music stops, while the other stops early. You try to guess how the early stopper would have danced.
  • In the PLOT method, you stop the music the moment either dancer stops. You look at how they were dancing together right up until that moment. It's a "synchronous" snapshot.

The Real-World Test (The DEVOTE Trial)

The authors tested this idea on a real medical study about diabetes drugs. In this study, some patients died or had to stop taking the drug.

  • Old methods gave confusing or very uncertain results.
  • The PLOT method gave a clear, confident answer: One drug (Insulin Degludec) caused significantly fewer dangerous low-blood-sugar events than the other, even when accounting for people dying or dropping out.

The Bottom Line

When a study is ruined by people dropping out, getting sick, or dying, don't just guess what would have happened, and don't just ignore the people who left.

Instead, pair people up, watch them run until the first one stops, and compare them right there. It's a simpler, fairer, and more honest way to see which treatment really works.