Nonparametric bounds for vaccine effects in randomized trials

This paper relaxes the strong assumption of no unmeasured confounding between infection risk and vaccine belief in blinded randomized trials to derive nonparametric causal bounds for vaccine efficacy using linear programming and monotonicity-based methods, demonstrating their application through synthetic and semi-synthetic data.

Rachel Axelrod, Uri Obolski, Daniel Nevo

Published Thu, 12 Ma
📖 5 min read🧠 Deep dive

Imagine you are trying to figure out how well a new umbrella works at keeping you dry. You give half a group of people the new umbrella (the vaccine) and the other half a fancy, useless stick (the placebo). You want to know: Does the umbrella actually stop the rain (the virus), or did the people just stop getting wet because they thought they had an umbrella?

This is the core problem of vaccine trials when "blinding" fails.

The Problem: The "Broken Blind"

In a perfect trial, no one knows if they got the real vaccine or the placebo. This is called being "blinded." But sometimes, the real vaccine causes a sore arm or a fever, while the placebo doesn't. Suddenly, people guess, "Oh, I must have the real one!"

Once they guess, their behavior changes. If they think they are protected, they might stop wearing masks, go to crowded parties, or hug more people.

  • The Immunological Effect: The vaccine's actual biological power to fight the virus.
  • The Behavioral Effect: The change in behavior because the person thinks they are protected.

If you just look at the final infection rates, you get a messy mix of both. It's like trying to measure how good a car's engine is, but the driver is also driving faster because they think the car is a Ferrari. You can't tell if the speed comes from the engine or the driver's confidence.

The Old Solution vs. The New Solution

Previously, statisticians tried to separate these two effects by making a very strict guess: "There is no hidden personality trait that makes someone both guess they have the vaccine AND take more risks."

The authors of this paper say: "That guess is too strong. It's unrealistic."

  • Example: An optimistic person might think, "I feel great, I must have the vaccine!" (Guessing correctly). But that same optimism might make them more social and less careful (Behavior).
  • Because this "hidden personality" (let's call it U) exists, we can't pinpoint the exact number for how well the vaccine works. The exact number is hidden.

The New Idea: Drawing a "Fence" (Bounds)

Instead of trying to find the single, exact number (which is impossible without that strong guess), the authors propose drawing a fence around the answer. They calculate a range (a lower limit and an upper limit) where the true answer must lie.

Think of it like this: You can't tell exactly how much money is in a piggy bank without breaking it. But if you know the bank weighs between 2 and 3 pounds, and you know the coins inside, you can say, "The money is definitely between $50 and $80." You don't know the exact cent, but you have a useful, safe range.

The paper offers two ways to build this fence:

1. The "Math Puzzle" Method (Linear Programming)

Imagine you have a giant jigsaw puzzle where some pieces are missing. You know the shape of the box and a few pieces you do have. You try to fit the missing pieces in every possible way that doesn't break the rules of the puzzle.

  • You find the arrangement that gives the lowest possible vaccine effectiveness.
  • You find the arrangement that gives the highest possible vaccine effectiveness.
  • The truth is somewhere in between. This method is very rigorous but can sometimes give a very wide fence (e.g., "The vaccine is between 0% and 100% effective"), which isn't very helpful.

2. The "Common Sense" Method (Monotonicity)

This method adds a little bit of "common sense" to the math puzzle. It assumes that if a hidden factor (like optimism) makes someone more likely to think they have the vaccine, it probably also makes them more likely to get infected (by taking more risks). It assumes these things move in the same direction.

  • By adding this reasonable assumption, the fence gets much tighter. Instead of "0% to 100%," you might get "36% to 47%." This is much more useful for doctors and policymakers.

The Real-World Test: The "Sore Arm" Trial

The authors tested their new math on a real COVID-19 vaccine trial (ENSEMBLE2).

  • The Situation: People who got the vaccine got sore arms (Side Effects). People who got the placebo didn't. This broke the blind.
  • The Result: The standard calculation said the vaccine was about 39% effective.
  • The New Calculation:
    • Without extra assumptions, the "fence" was wide (not very helpful).
    • With the "common sense" assumption (that feeling protected makes people riskier), the fence tightened to 36.5% to 47.0%.
    • This confirmed that the vaccine was indeed working, but the "real-world" effectiveness (including behavior changes) was slightly different than the pure biological effect.

Why This Matters

In a world where people often guess their treatment status (because of side effects, news, or rumors), we can't always get a perfect, single number for how well a vaccine works.

This paper gives us a toolbox to say: "We can't know the exact number, but we can be 100% sure the answer is at least X and at most Y." This helps policymakers make safer decisions without needing to pretend that human behavior is perfectly predictable or that hidden personality traits don't exist.

In short: When the blind is broken, don't panic and guess a single number. Instead, build a sturdy fence around the truth using math and common sense, so we know exactly where the answer lies.