Cross-ancestry performance of Parkinson's disease polygenic risk scores in admixed Latin American populations

This study demonstrates that in admixed Latin American populations, polygenic risk scores for Parkinson's disease derived from large European GWAS currently outperform those from smaller ancestry-matched datasets, though methods incorporating functional annotations like SBayesRC offer the best predictive performance, highlighting the urgent need for larger, diverse genetic studies to ensure equitable clinical translation.

Flores-Ocampo, V., Reyes-Perez, P., Ogonowski, N. S., Sevilla-Parra, G., Diaz-Torres, S., Leal, T. P., Waldo, E., Ruiz-Contreras, A. E., Alcauter, S., Arguello-Pascualli, P., Mata, I. F., Renteria, M. E., Medina-Rivera, A., Dennis, J. K.

Published 2026-03-03
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

The Big Picture: A Recipe for Prediction

Imagine you are trying to bake the perfect cake (predicting who might get Parkinson's disease). To do this, you need a recipe (a Polygenic Risk Score, or PRS) that tells you which ingredients (genetic markers) matter most.

For a long time, scientists have only had access to recipes written by bakers from Europe. These recipes work great if you are baking for a European audience. But when you try to use those same European recipes to bake for Latin American populations, the cake often turns out flat or tastes weird. Why? Because Latin American populations are a unique "three-way mix" of Native American, European, and African ancestry. The genetic "ingredients" and how they interact are different.

This paper asks: How can we bake the best cake for Latin American people when our best recipes come from Europe, and our local recipes are still being written?


The Experiment: Testing Different Bakers

The researchers gathered a huge group of people from Latin America (about 3,300 individuals, split between those with Parkinson's and those without). They wanted to see which "baking method" worked best to predict the disease.

They tested four different "bakers" (statistical tools) using three different "cookbooks" (data sources):

  1. The Cookbooks (Data Sources):

    • The Giant European Cookbook: A massive collection of data from over 80,000 European people. It's huge and detailed, but it doesn't know much about Latin American genetics.
    • The Small Local Cookbook: A tiny collection of data from about 1,500 Latin American people. It's culturally perfect, but it's so small it's missing many important details.
    • The Mixed-World Cookbook: A medium-sized collection mixing data from Europe, Latin America, Africa, and Asia.
  2. The Bakers (Methods):

    • The Traditionalist (PRSice-2): Uses a simple "clump and threshold" approach. It's like picking the top 10 ingredients from a list. It's fast but often misses the nuance.
    • The Smart Chef (SBayesRC): A newer method that uses "functional annotations." Think of this as a chef who not only looks at the ingredients but knows which ingredients are actually biologically important (like knowing that flour matters more than a random speck of dust).
    • The Bridge Builder (PRS-CSx & BridgePRS): These methods try to "bridge" the gap. They take the big European data and the small local data and try to blend them together to create a custom recipe for the mixed population.

The Results: Who Won the Bake-Off?

Here is what they found, translated into plain English:

1. Size Matters (The "Big Cookbook" Wins)
Surprisingly, the Smart Chef (SBayesRC) using the Giant European Cookbook performed the best overall.

  • The Analogy: Even though the European recipe wasn't written for Latin Americans, it was so detailed and complete that it was still better than using a tiny, incomplete local recipe. The sheer volume of data from Europe outweighed the fact that the ancestry wasn't a perfect match.
  • The Catch: The local recipe (Latin American data) was just too small to be useful on its own yet.

2. The "Best" Depends on What You Measure

  • If you wanted to know how much risk a person had (the "odds"), the European-based score was the winner.
  • If you wanted to know how well the score could distinguish between sick and healthy people (the "AUC"), the score using the Mixed-World Cookbook was slightly better.
  • The Analogy: The European recipe told you how bad the cake might be, while the Mixed recipe was slightly better at telling you which cake was the "bad" one and which was the "good" one.

3. The "Mix" Matters
The researchers noticed that the more "European" ancestry a person had, the better the prediction worked.

  • The Analogy: If you are baking a cake that is 80% European flour and 20% local flour, the European recipe works great. If you are baking a cake that is 80% local flour, the European recipe starts to fail. The prediction gets weaker as the person's genetics get further away from the European data source.

The Takeaway: What Does This Mean for the Future?

The Good News:
We can already use the massive European data to help Latin American people, especially if we use smart tools (like SBayesRC) that know how to filter out the noise. It's better than nothing, and it's significantly better than using our tiny local data alone.

The Bad News:
We are still relying too much on European data. The "Local Cookbook" is too small. Until we gather more data from Latin American, African, and other underrepresented populations, we can't build the perfect, custom recipe for them.

The Solution:
The paper argues that we need to stop just "translating" European recipes. We need to hire more local bakers and write more local cookbooks. Programs like GP2 (Global Parkinson's Genetics Program) are doing exactly this—gathering more diverse data so that one day, we can predict Parkinson's risk accurately for everyone, regardless of their ancestry.

Summary in One Sentence

While we can currently use massive European genetic data to predict Parkinson's risk in Latin American populations (especially with smart tools), the most accurate predictions will only come when we finally build large, diverse genetic databases that truly represent the world's mixed populations.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →