Heritable confounding in Mendelian randomization studies

This paper demonstrates that Mendelian randomization studies using genetic variants with small effect sizes are susceptible to heritable confounding that biases causal estimates in a consistent direction across methods, but shows that this bias can be mitigated through pre-estimation filtering or multivariable MR when potential confounders are identified.

Original authors: Sanderson, E., Rosoff, D., Vitt, N., Palmer, T., Tilling, K., Davey Smith, G., Hemani, G.

Published 2026-04-29
📖 5 min read🧠 Deep dive

Original authors: Sanderson, E., Rosoff, D., Vitt, N., Palmer, T., Tilling, K., Davey Smith, G., Hemani, G.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

The Big Picture: Trying to Find the "Real" Cause

Imagine you are a detective trying to solve a mystery: Does high inflammation (measured by a protein called C-Reactive Protein, or CRP) actually cause Type 2 Diabetes?

In the real world, it's hard to tell cause from effect because everything is mixed up. Maybe people who eat a lot of junk food have both high inflammation and diabetes. If you just look at the data, it looks like inflammation causes diabetes, but really, the junk food is the culprit. This is called confounding.

To solve this, scientists use a method called Mendelian Randomization (MR). Think of MR as using genetic lottery tickets as a detective's tool.

  • You are born with certain genetic "tickets" (variants) that naturally make your CRP levels higher or lower.
  • Because these tickets are assigned at conception (like a lottery), they aren't influenced by your diet, lifestyle, or junk food.
  • If people with "high CRP tickets" get diabetes more often, you can be more confident that CRP causes the diabetes, not the junk food.

The New Problem: The "Hidden Cousin"

This paper argues that while the genetic lottery ticket method is great, it has a new, sneaky flaw that is getting worse as science advances.

The Flaw: Heritable Confounding
Imagine your genetic ticket doesn't just control your CRP levels directly. Instead, imagine the ticket controls your weight (adiposity).

  1. Being heavier makes your CRP go up.
  2. Being heavier also makes you more likely to get diabetes.
  3. So, your genetic ticket is actually pulling two strings at once: it makes you heavy, which raises CRP and causes diabetes.

In this scenario, the genetic ticket is not a pure test of CRP. It's a test of "being heavy." If you use this ticket to prove CRP causes diabetes, you are actually just proving that "being heavy causes diabetes," but you think you've proved it's the CRP. The paper calls this heritable confounding.

Why is this getting worse? (The "Deep Dive" Problem)

The paper explains that as scientists get bigger and bigger data sets (like looking at millions of people instead of thousands), they start finding weaker genetic tickets.

  • The Strong Tickets: The obvious, powerful genetic tickets that directly control CRP are easy to find. These are usually "clean" and don't cause much trouble.
  • The Weak Tickets: As we dig deeper, we find tiny, weak genetic signals. These are often "distal"—they are far away in the biological chain. They might control a tiny protein that controls another protein, which then controls your weight, which then controls CRP.

The Analogy:
Think of a Rube Goldberg machine.

  • Strong tickets are the ball hitting the first domino. It's a direct line to the result.
  • Weak tickets (found in huge studies) are the ball hitting a feather, which hits a fan, which blows a sail, which moves a boat, which hits the domino.
  • The more steps in the chain, the more likely there is a "side path" (a confounder) that messes up the result.

The paper shows that the more genetic tickets we find, the more likely we are to pick the "weak, messy" ones that are actually linked to hidden confounders (like weight). This makes the results look more biased, not less.

The "Fake Confidence" Trap

The paper warns that when scientists use standard computer methods to analyze these messy tickets, the results often look very consistent and confident.

  • It's like having a group of witnesses who all tell the same story, but they are all actually repeating the same lie because they are all influenced by the same hidden factor (the "heavy" gene).
  • The paper shows that standard methods often fail to spot this, leading researchers to believe they have found a cause-and-effect relationship when they haven't.

The Solution: How to Fix the Detective Work

The authors suggest two simple ways to fix this, using the CRP and Diabetes example:

  1. The "Multivariable" Approach (Checking the Side-Effects):
    Instead of just asking, "Does this ticket affect CRP?" ask, "Does this ticket also affect weight?"

    • If a ticket affects weight, and weight affects diabetes, we can use a statistical tool to "subtract" the weight effect. This isolates the pure CRP effect.
    • Result in the paper: When they did this for CRP, the link to diabetes disappeared. It turned out CRP wasn't the cause; weight was the real culprit.
  2. The "Steiger Filter" (The Quality Control Check):
    This is a filter that throws away any genetic ticket that explains more variation in the confounder (weight) than it does in the exposure (CRP).

    • If a ticket is better at predicting weight than it is at predicting CRP, it's a "bad ticket" for this specific test. Throw it out.
    • Result in the paper: Removing these bad tickets also made the link between CRP and diabetes vanish.

The Takeaway

The paper concludes that Mendelian Randomization is not immune to bias. In fact, as we find more and more tiny genetic signals, we might be accidentally picking up more "confounded" signals.

  • The Lesson: Just because a study uses genetics doesn't mean it's perfect. Researchers need to check if their genetic "tickets" are actually linked to other things (like weight, diet, or other diseases) that could be the real cause.
  • The Specific Finding: In the case of C-Reactive Protein and Type 2 Diabetes, the apparent link was likely an illusion caused by adiposity (body fat). Once the researchers adjusted for this "heritable confounding," the evidence that CRP causes diabetes went away.

In short: Don't trust the genetic lottery just because it's random. You have to make sure the ticket you picked isn't actually a ticket for something else entirely.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →