Handling onset age inconsistencies in longitudinal healthcare survey data

This paper proposes and evaluates two methods—a reliability score-based stratification and a Bayesian adjustment model—to resolve inconsistencies in self-reported onset ages within longitudinal healthcare surveys, demonstrating that both approaches significantly enhance data quality, predictive performance, and disease clustering coherence using data from the Canadian Partnership for Tomorrow's Health.

Li, W., Yuan, M., Park, Y., Dao Duc, K.

Published 2026-02-23
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to build a massive, detailed map of everyone's health history using a survey. You ask people, "When did you first get sick with Condition X?" You ask them again five or ten years later.

Ideally, they should say the same thing both times. But in reality, human memory is like a leaky bucket. Sometimes people forget, sometimes they guess, and sometimes they just get the date wrong. One person might say, "I got diabetes at 45," and five years later, they say, "Oh, I think it was actually 52."

This creates a mess for scientists. If you throw away all the answers that don't match, you lose half your data. If you keep them, your map is full of holes and errors.

This paper is about two clever ways to fix this "leaky bucket" problem without throwing away the water.

The Problem: The "Fuzzy Memory" Effect

The researchers looked at a huge Canadian health study (CanPath) with nearly 100,000 people. They found that 57% of people gave different answers about when their diseases started when asked again later. It's like asking someone, "What year did you graduate high school?" and getting a different answer every time you ask.

Solution 1: The "Trust Score" (Stratification)

The Analogy: The Jury Selection

Imagine you are a judge trying to solve a mystery. You have 100 witnesses. Some witnesses are known to be very sharp and consistent; others are known to be forgetful or prone to exaggeration.

Instead of listening to everyone equally, you decide to split the witnesses into two groups:

  1. The "Sharp" Group: People who gave consistent answers across all their health questions.
  2. The "Fuzzy" Group: People whose answers jumped around a lot.

How they did it:
The authors created a "Reliability Score" for every single person. They looked at how much a person's answers changed over time. If your answers were all over the place, you got a low score. If you were consistent, you got a high score.

The Result:
When the researchers looked only at the "Sharp" group, the patterns in the data became crystal clear.

  • Better Connections: They could see that diseases like high blood pressure and heart attacks were much more tightly linked in the "Sharp" group than in the "Fuzzy" group.
  • Clearer Clusters: It was like looking at a blurry photo and suddenly putting on glasses. Diseases that belong together (like different types of gut issues) started grouping together naturally, whereas in the "Fuzzy" group, they were scattered randomly.
  • Better Predictions: When they tried to predict future health issues, the models trained on the "Sharp" group were much more accurate.

When to use this: Use this if you have a huge crowd of people and you can afford to ignore the "fuzzy" ones to get a cleaner picture.

Solution 2: The "Time-Traveling Detective" (Bayesian Adjustment)

The Analogy: The Detective's Best Guess

Now, imagine you can't throw anyone away. Maybe you only have a small group of people, or you need every single data point. What do you do with the person who said "45" the first time and "52" the second time?

Instead of picking one or the other, the researchers acted like a detective using a special math formula (Bayesian statistics). They treated the "true" age of onset as a secret that is hidden behind two noisy clues.

  • Clue A: The answer given at the start (Age 45).
  • Clue B: The answer given later (Age 52).

The detective knows that memory gets worse as people get older and as more time passes. So, the math formula weighs the clues. It asks: "Given that the person is older now, and given that memory fades, what is the most likely true age?"

It doesn't just pick 45 or 52. It calculates a "corrected" age (maybe 48.5) that accounts for the fact that human memory is imperfect. It essentially smooths out the bumps in the road.

The Result:

  • Stronger Links: Just like the "Trust Score" method, this method made the connections between related diseases stronger and more logical.
  • Supercharged Predictions: When they used these "corrected" ages to predict things like diabetes, the predictions got significantly better. The more variables they fixed at once, the better the results became.

When to use this: Use this if you have a small group of people and can't afford to lose any data, or if you need to keep the uncertainty in your calculations (like a detective noting how sure they are about their guess).

The Big Picture: Which Tool to Use?

The authors give a simple guide for doctors and researchers:

  1. Use the "Trust Score" (Solution 1) if: You have a massive dataset (like a stadium full of people). You can afford to set aside the "fuzzy" answers to get a super-clear view of the "sharp" ones. It's fast and easy to explain.
  2. Use the "Time-Traveling Detective" (Solution 2) if: You have a smaller group, or you need to keep every single person in the study. It's more complex math, but it saves your data and gives you a "corrected" version of the truth.

Why This Matters

Health surveys are the backbone of modern medicine. If the data is messy, our understanding of disease is messy. By fixing these inconsistencies, this paper helps scientists draw a clearer, more accurate map of human health. It's the difference between navigating with a blurry, torn map versus a high-definition GPS.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →