Reconsidering Brain Age: Why Age-Prediction Models Fail as Measures of Brain Aging

This paper argues that current brain age models are fundamentally flawed as biomarkers for accelerated aging because they are trained to prioritize shared chronological patterns while ignoring individual trajectories, leading to misleading conclusions that stable anatomical differences are signs of neurodegeneration.

Original authors: Grodem, E. O. S., Smith, S. M., Vidal-Pineiro, D., Elliott, M. L., for the Alzheimer's Disease Neuroimaging Initiative,, Walhovd, K. B., Fjell, A. M.

Published 2026-05-08
📖 5 min read🧠 Deep dive

Original authors: Grodem, E. O. S., Smith, S. M., Vidal-Pineiro, D., Elliott, M. L., for the Alzheimer's Disease Neuroimaging Initiative,, Walhovd, K. B., Fjell, A. M.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

The Big Idea: The "Brain Age" Clock is Broken

Imagine you have a machine that looks at a photo of your brain and guesses your age. If it guesses you are 60 but you are actually 50, people say, "Wow, your brain is aging fast!" If it guesses 40, they say, "Great, your brain is young!"

This paper argues that this machine is fundamentally broken. It isn't actually measuring how fast your brain is aging; it is mostly measuring how big your brain was when you were born.

The authors say that using this "Brain Age Gap" to tell if someone is aging too fast is like trying to measure how fast a car is speeding by looking at how big the car is. It just doesn't work.

The Core Problem: The "Average" Trap

To understand why the machine is broken, imagine a classroom of students taking a test every year.

  1. The Goal: The teacher wants to build a computer program that can look at a student's test score and guess their grade level (e.g., 5th grade, 6th grade).
  2. How the Computer Learns: To be good at guessing the grade, the computer looks for patterns that are the same for everyone. It learns that "5th graders usually get 80% right" and "6th graders usually get 90% right."
  3. The Mistake: The computer is trained to ignore the students who are different.
    • If a student is naturally very smart and always scores high (a stable trait), the computer thinks, "Ah, this student must be in a higher grade!"
    • If a student is naturally less skilled and always scores low, the computer thinks, "This student must be in a lower grade!"

The Catch: The computer is so focused on guessing the average grade that it ignores how much a student is actually improving or slowing down over time. It confuses being naturally big with growing fast.

The Two Experiments: Proving the Machine is Lying

The authors tested this idea with two real-world scenarios to show how the "Brain Age" machine fails.

1. The "False Alarm" Test (Birth Weight)

  • The Setup: They looked at people's birth weight. We know from science that babies born heavier tend to have bigger brains their whole lives. However, we also know that birth weight does not change how fast a brain shrinks or ages later in life. A heavy baby doesn't age faster or slower than a light baby; they just start with a bigger brain.
  • The Result: The "Brain Age" machine looked at the heavy babies and said, "Your brain is older than it should be!" It thought the big brain meant "accelerated aging."
  • The Reality: The machine was just confused. It saw a big brain (a stable trait) and mistakenly called it "fast aging." This is a False Positive.

2. The "Missed Signal" Test (Tau Proteins)

  • The Setup: They looked at Tau proteins, which are a sign of Alzheimer's disease. When these proteins build up, they cause the brain to actually shrink and change rapidly over time. This is real, active "aging" or damage.
  • The Result: The "Brain Age" machine barely noticed this. It was not very good at detecting that these people were losing brain tissue.
  • The Reality: Because the machine was trained to ignore "differences between people" (to get a perfect age guess), it actually ignored the very people who were changing the most. It missed the real danger. This is a False Negative.

The "Reliability Paradox"

The paper points out a funny irony: The better the machine gets at guessing your age, the worse it gets at telling you if you are aging strangely.

  • If the machine is perfect at guessing age, it has learned to ignore all the "noise" (the differences between people).
  • But the "noise" is exactly where the real story of aging is hiding.
  • So, a "perfect" brain age model is actually blind to the individual differences it is supposed to find.

The Conclusion: Stop Using the "Brain Age" Score

The authors conclude that we cannot use the "Brain Age Gap" (the difference between predicted age and real age) as a measure of brain health or aging speed.

  • What it actually measures: It mostly measures stable, lifelong differences in brain size and shape (like how tall you are).
  • What it fails to measure: It fails to measure the actual rate at which your brain is changing or deteriorating.

The Solution: Instead of asking, "How old does this brain look?", we should ask, "How much has this brain changed compared to where it started?" The paper suggests we need new models that look for change, not just a static guess of age. Until then, the "Brain Age" number is misleading and should not be used to diagnose accelerated aging or brain health issues.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →