Machine learning-based advanced coronary artery disease pretest probability model: Comparison with conventional pretest probability models

This study developed and validated a machine learning-based pretest probability model (K-CAD) using ridge-penalized logistic regression on extensive Korean datasets, demonstrating that it significantly outperforms conventional models (UDF and CAD2) in predicting coronary artery disease risk within the Korean population.

Hong, Y., Lee, J., Park, H.-B., Kim, W., Yoon, Y. E., Jeong, H., Kim, G., So, B., Lee, J., Dalakoti, M., Sung, J. M., Kook, W., Chang, H.-J.

Published 2026-03-27
📖 4 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are a doctor trying to figure out if a patient has a clogged pipe in their heart (coronary artery disease). To decide whether to send them for a risky, expensive, and invasive test (like a camera inside the heart), you first need to guess the odds. This guess is called the Pretest Probability.

For decades, doctors have used two main "rulebooks" (models) to make this guess: the Updated Diamond–Forrester (UDF) and the CAD Consortium (CAD2) models. Think of these rulebooks as weather forecasts created by meteorologists in London. They are great for predicting rain in London, but if you take them to Seoul, they might say it's going to pour when it's actually sunny, or vice versa.

Here is the story of how this paper fixed that problem for people in Korea.

1. The Problem: The "London Weather Forecast" in Seoul

The old rulebooks were built using data mostly from Western populations (Europe and the US). However, people in Korea (and East Asia generally) have different body types, different diets, and different patterns of heart disease.

When Korean doctors used the old "London" rulebooks, the results were messy:

  • False Alarms: The models kept screaming "DANGER!" for people who were actually fine.
  • The Consequence: Because the models said "High Risk," too many healthy people were sent for unnecessary, scary, and expensive heart scans. It was like calling a fire truck for a burnt piece of toast.

2. The Solution: Building a "Seoul-Specific" Weather App

The researchers in this paper decided to build a brand-new model specifically for Koreans, which they named K-CAD.

Instead of just looking at the basics (Age, Gender, and "Do you have chest pain?"), they built a smarter system that also looked at routine blood test results that almost everyone gets at the doctor's office anyway.

  • The Ingredients: They took data from nearly 5,000 Korean patients. They fed the computer information about cholesterol, blood sugar, kidney function, and blood pressure, along with the usual symptoms.
  • The Engine: They used a special type of math (Ridge-Penalized Logistic Regression). Think of this as a "smart filter" that prevents the computer from getting confused by too much information or memorizing the wrong answers. It finds the true signal in the noise.

3. The Test Drive: How did it perform?

The team put their new K-CAD model to the test against the old rulebooks using two different groups of people:

  1. High-Risk Patients: People already in the hospital with chest pain.
  2. The General Public: Over 117,000 people from a national health checkup database.

The Results:

  • The Old Models (UDF & CAD2): They were okay, but they kept misclassifying people. They were like a security guard who stops everyone at the airport, even the people just visiting their grandma.
  • The New Model (K-CAD): It was much sharper.
    • It correctly identified more people who actually had heart blockages.
    • Crucially, it stopped screaming "High Risk" for people who were actually low-risk. It successfully reclassified nearly 80% of the people the old models wrongly labeled as "High Risk" into "Low Risk."

4. The Analogy: The Tailored Suit vs. The Off-the-Rack Jacket

Imagine the old models are like buying a suit off the rack from a store in New York. It might fit a tall, broad-shouldered American, but for a Korean man, the sleeves might be too long, and the shoulders too tight. You can wear it, but it's uncomfortable and doesn't look right.

The K-CAD model is like a custom-tailored suit made specifically for the Korean body type. It uses measurements (blood tests and symptoms) that fit the local population perfectly. It fits better, looks better, and tells you exactly how you need to dress (or in this case, how much medical attention you need).

5. Why Does This Matter?

This isn't just about math; it's about saving people from unnecessary stress and money.

  • Less Unnecessary Testing: By accurately identifying low-risk patients, fewer people will be sent for invasive heart procedures they don't need.
  • Better Care: Doctors can focus their resources on the people who actually need them.
  • Transparency: Unlike some "black box" AI models that no one understands, this model is open. The authors even built a free online calculator so any doctor can use it right now.

The Bottom Line

The researchers took a tool that was designed for the West, realized it didn't fit the East, and built a new, smarter tool using local data and routine blood tests. It's a "Seoul-specific" weather forecast that finally tells the truth about the rain in Korea, helping doctors make better decisions without overreacting.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →