An in silico framework for evaluating PRS-guided prognostic enrichment in clinical trial design

This study presents an in silico framework demonstrating that integrating polygenic risk scores into clinical trial designs to enrich for high-risk participants significantly improves statistical power, reduces required sample sizes, and accelerates event accrual across various disease contexts, though optimal enrichment thresholds must be balanced against population availability.

Cai, R., Gillard, J., Yang, S., Gasparyan, S. B., Lu, Y., Tian, L., Vedin, O., Ashley, E. A., Rivas, M. A., O'Sullivan, J. W.

Published 2026-03-24
📖 4 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to test a new medicine to prevent heart attacks. The old way of doing this is like throwing a giant net into the ocean and hoping to catch a few fish. You recruit thousands of people, wait for years, and hope that enough of them actually get sick (have a heart attack) while on the study so you can see if the medicine works.

The problem? Most people in your net are healthy. They aren't going to get sick anytime soon. So, you end up waiting a long time, spending a fortune, and often still not catching enough "fish" (events) to prove your medicine works. This is why clinical trials are so expensive and slow.

The New Idea: The "Smart Net"

This paper proposes a smarter way to design these trials using a "digital simulator" and a tool called a Polygenic Risk Score (PRS).

Think of your DNA like a weather forecast for your health. A PRS is like a "storm risk score." Some people have a genetic makeup that makes them very likely to get sick soon (a "stormy" forecast), while others are very unlikely to get sick (a "sunny" forecast).

The researchers suggest: Why not only recruit the people with the "stormy" forecasts?

If you only test the medicine on people who are genetically likely to get sick soon, you will see results much faster. You need fewer people, you spend less money, and you get your answer sooner. This is called Prognostic Enrichment.

How They Tested It (The "Time Machine" Simulation)

The researchers didn't want to wait years to see if this idea works in real life. Instead, they built a computer simulation (an in silico framework) using data from the UK Biobank, which contains health and DNA data from half a million people.

They used a clever trick: Natural Experiments.
They looked for people who naturally carry "protective" genetic mutations. Imagine these mutations are like a built-in shield.

  • The "Treatment" Group: People with the shield (the mutation).
  • The "Control" Group: People without the shield.

Since these people already have the "shield" naturally, the researchers could pretend the shield was a new drug they invented. They ran their simulation to see: If we only recruited the "stormy" people (high risk) for our trial, would we find the difference between the shield and no shield faster and cheaper?

The Results: It Depends on the Disease

They tested this idea on three different diseases: Heart Disease, Glaucoma (eye disease), and Inflammatory Bowel Disease (IBD).

  1. Heart Disease (CAD): This was a huge success. By only recruiting the top 25% of people with the highest genetic risk, they could cut the number of people needed for the trial by 60%. It was like finding the fish in a small pond instead of the whole ocean.
  2. IBD: This was even more dramatic. They could cut the required sample size by 78%.
  3. Glaucoma: This one was tricky. While picking the highest-risk people helped, if they picked too high a risk (the top 25%), there weren't enough people left to study to get a clear answer. It's like trying to find a needle in a haystack, but you only look in a tiny, tiny piece of the haystack—you might miss the needle entirely because the sample is too small.

The Big Takeaway

This paper gives scientists a blueprint or a calculator for the future.

Before they spend millions of dollars starting a real clinical trial, they can now run this simulation to ask: "If we use genetic risk scores to pick our participants, how much money and time will we save?"

The Analogy of the "Goldilocks" Zone
The study shows that there isn't one perfect rule for everyone.

  • For some diseases, you want to be very picky (only the top 25% risk).
  • For others, being too picky leaves you with too few people.
  • The goal is to find the "Goldilocks" zone: High enough risk to see results fast, but low enough risk to have enough people to study.

In Summary
This research is like giving clinical trial designers a GPS. Instead of driving blind and hoping to find a destination (a successful trial), they can now use genetic data to plot the most efficient route, saving time, money, and getting life-saving treatments to patients much faster.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →