Outcome Risk Modeling for Disability-Free Longevity: Comparison of Random Forest and Random Survival Forest Methods

In a study using ASPREE trial data, Random Survival Forests (RSF) demonstrated comparable discrimination and calibration to standard Random Forests (RF) for predicting disability-free longevity, suggesting that RSF does not always offer superior predictive accuracy over RF for time-to-event outcomes.

Vanghelof, J. C., Tzimas, G., Du, L., Tchoua, R., Shah, R. C.

Published 2026-02-17
📖 3 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to predict who will stay healthy and active for a long time, and who might face challenges like memory loss, physical disability, or passing away sooner. This is a bit like trying to forecast the weather for a specific group of people over the next decade.

To do this, scientists used a massive group of older adults from a big medical study called ASPREE. They wanted to see which "weather forecasting tool" was better at predicting these life events. They compared two very smart computer programs:

  1. Random Forest (RF): Think of this as a team of experts looking at a snapshot of a person's life right now. They look at 115 different clues (like age, blood pressure, lifestyle, and genetics) to guess what might happen. However, this team doesn't really care when things happen; they just care if they happen.
  2. Random Survival Forest (RSF): This is the same team of experts, but they have a stopwatch. They look at the same 115 clues, but they also pay close attention to the timing. They ask, "Will this happen next month? Next year? Or in ten years?"

The Big Question:
The researchers had a hunch that the team with the stopwatch (RSF) would be better at predicting the future because life is all about timing. They thought knowing when something happens would make the prediction more accurate.

The Experiment:
They split the group of 2,291 people into two teams: a "training class" to teach the computers, and a "test class" to see how well they learned. They fed both computer programs the same massive list of 115 health clues. The goal was to see who could best predict the first time a person experienced dementia, disability, or death.

The Results:
Here is the surprise: The stopwatch didn't help.

When the researchers checked the results, both teams performed almost exactly the same.

  • Accuracy: Both were about 75% good at spotting who would have an event, and about 57% good at spotting who wouldn't.
  • Timing: Even though one team had a stopwatch, they didn't get a better score at predicting the specific timing of events.
  • Reliability: Both teams were equally good at matching their predictions to reality.

The Takeaway:
It's like having two chefs. One chef just looks at the ingredients to guess if a cake will taste good. The other chef looks at the ingredients and checks the oven timer. You'd expect the second chef to be better, right? But in this specific kitchen, they both made cakes that tasted exactly the same.

The study concludes that for this specific group of people and this specific health question, adding the "time" element didn't make the prediction any sharper. Sometimes, the simpler tool (the snapshot) works just as well as the complex tool (the stopwatch).

What's Next?
The scientists say we shouldn't throw away the stopwatch just yet. Maybe in other groups of people or with different health problems, the timing does matter. They need to test these tools in other "kitchens" to figure out exactly when the extra complexity is worth the effort.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →