Nationwide Prediction of Missed and Cancelled Appointments Using Real-World EHR Data

This retrospective study utilizing a large national EHR dataset demonstrates that machine learning models, particularly XGBoost, can accurately predict unused outpatient appointments with an AUC of 0.95, suggesting that integrating such predictive tools into scheduling workflows could significantly improve healthcare efficiency.

Original authors: Miran, S. A., Cheng, Y., Faselis, C., Brandt, C., Vasaitis, S., Nesbitt, L., Zanin, L., Tekle, S., Ahmed, A., Nelson, S. J., Zeng-Treitler, Q.

Published 2026-04-13
📖 4 min read☕ Coffee break read

Original authors: Miran, S. A., Cheng, Y., Faselis, C., Brandt, C., Vasaitis, S., Nesbitt, L., Zanin, L., Tekle, S., Ahmed, A., Nelson, S. J., Zeng-Treitler, Q.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine a massive, bustling train station where millions of people try to catch a train every day. Sometimes, they show up on time. Other times, they forget, get stuck in traffic, or decide to stay home. When a train leaves with empty seats because no one showed up, it's a waste of fuel, time, and money. In the world of healthcare, these "empty seats" are missed or cancelled doctor appointments.

This study is like hiring a team of super-smart detectives to figure out exactly who is likely to miss their train and why, so the station can run more smoothly.

Here is the breakdown of their investigation in plain English:

The Mission: Predicting the "No-Shows"

The researchers wanted to build a crystal ball (a computer model) that could look at a patient's file and say, "Hey, there's a high chance this person won't make it to their appointment."

They didn't just guess; they used a giant digital library of medical records from across the entire United States. It's like looking at the history of 5.7 million people to find patterns. They focused on adults who had routine check-ups between 2010 and 2025.

The Detective Work: How They Did It

To solve the mystery, the team compared three different types of "detectives" (algorithms):

  1. The Old-School Detective (Logistic Regression): A traditional method that looks for straight-line connections.
  2. The Team of Experts (Random Forest): A group of decision-makers that vote on the answer.
  3. The Super-Genius (XGBoost): A highly advanced machine learning tool that can spot incredibly complex, hidden patterns that the others miss.

They fed these detectives clues like:

  • Who is the patient? (Age, insurance type, where they live).
  • When is the appointment? (Is it a rainy Monday morning? Is it peak flu season?).
  • What's their history? (Have they missed appointments before? How long has it been since their last visit?).

The Results: The Super-Genius Wins

After running the numbers, the results were clear:

  • The Old-School Detective was okay, getting about 89% of the predictions right.
  • The Team of Experts did better, hitting 92%.
  • The Super-Genius (XGBoost) was the champion, achieving a 95% accuracy rate.

This means the computer could predict with almost perfect certainty whether an appointment would be wasted or attended.

The "Why": Uncovering Hidden Patterns

The researchers didn't just want a black box that gave answers; they wanted to know how it worked. They used a special tool called "Explainable AI" to peek under the hood.

They found that the reasons people miss appointments aren't simple. It's not just "older people miss more." It's a messy, non-linear mix of factors. For example, a specific time of day might only be a problem for people with a certain type of insurance living in a city, but not for someone in the country. The computer figured out these weird, specific combinations better than any human could.

The Big Picture: Why This Matters

The Problem: When patients miss appointments, doctors sit idle, and other patients who need care can't get in. It's like a restaurant with empty tables while people are waiting in line.

The Solution: If clinics can use this "Super-Genius" computer to predict who is likely to miss an appointment, they can take action before the day arrives. They might send a reminder, offer a different time slot, or fill that empty seat with someone else on a waitlist.

The Catch (Limitations)

Even with a 95% success rate, the study had a few hiccups:

  • The "Black Box" of History: Since they looked at old records, they couldn't ask the patients, "Why did you miss?" They had to guess based on what was written down.
  • The Mix-Up: The database didn't always clearly say if a patient cancelled (which is polite) or no-showed (which is rude). The model treated them both as "unused," which is a bit like grouping a guest who politely declined an invitation with one who ghosted you.

The Takeaway

This study proves that we can use the data we already have in our computers to stop wasting time and money in healthcare. By treating appointment scheduling like a weather forecast—predicting the "storm" of a missed appointment—we can keep the healthcare system running on time, ensuring that when a doctor is ready to see you, you are actually there.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →