Computing coalescence rates for complex demographies and sampling configurations

This paper introduces *demestats*, a differentiable software library that computes first-coalescence and cross-coalescence rates for complex demographic models to overcome the limitations of pairwise summaries in resolving recent population history, demonstrating improved accuracy in simulations and new insights into human expansion using 1000 Genomes Project data.

Liang, J., Terhorst, J.

Published 2026-04-10
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are a detective trying to solve a family mystery, but instead of looking at old photo albums, you are looking at the DNA of people living today. Your goal is to reconstruct the history of their ancestors: when did their populations grow? When did they split into different groups? When did they mix with neighbors?

For a long time, detectives had a major blind spot. They could only look at pairs of people (like comparing just you and your cousin). This is like trying to figure out the history of a massive city by only looking at two random people on the street. You can tell a lot about the distant past (when the city was a tiny village), but you can't see what happened yesterday or last week. Recent events are too rare to show up in just two people's DNA.

This paper introduces a new detective tool called demestats that solves this problem by looking at groups of people instead of just pairs.

Here is a simple breakdown of how it works, using some everyday analogies:

1. The Problem: The "Two-Person" Blind Spot

Think of a family tree as a river flowing backward in time.

  • The Old Way (Pairs): If you only trace two branches of the river, they might not merge (coalesce) until they reach a very old, deep part of the history. It's like trying to find out if it rained yesterday by looking at two trees that only share roots from 100 years ago. You miss the recent rain.
  • The Result: Scientists could guess the size of ancient populations well, but they were terrible at guessing how fast populations grew recently or how recently two groups separated.

2. The Solution: The "Crowd" Approach

The authors built a software library called demestats. Instead of looking at two people, it looks at a crowd (say, 50 people) at once.

  • The Analogy: Imagine a crowded room where everyone is trying to find their long-lost twin.
    • If you have 2 people, they might not find each other for a long time.
    • If you have 50 people, the odds are high that someone in that crowd will find their twin very quickly.
  • The Magic: By watching when the first person in a large group finds their match, demestats gets a very clear signal about what happened recently. The "first merge" in a large group happens much faster and tells us exactly how big the population was just a few generations ago.

3. How It Handles Complexity: The "Traffic Control" System

Real human history isn't simple. People move between cities (migration), populations split, and they mix again. Calculating the odds for a crowd in this messy scenario is a mathematical nightmare.

  • The Exact Method: For small groups, the software does the math perfectly, tracking every single possible path a family line could take. It's like a traffic controller watching every single car on a small road network.
  • The "Mean-Field" Shortcut: For huge groups (like 50 or 100 people), tracking every single car is impossible. So, the software uses a clever shortcut. Instead of tracking individual cars, it tracks the average flow of traffic.
    • Analogy: Instead of counting every drop of water in a rushing river, you just measure the speed and volume of the river as a whole. This is fast and usually accurate enough to get the job done.

4. What They Discovered

Using this new tool, the authors tested it on real human data (from the 1000 Genomes Project) and found some cool things:

  • Recent Growth: They confirmed that human populations have exploded in size very recently (in the last few thousand years). The "crowd" method gave a much sharper picture of this explosion than the old "pair" method.
  • Migration: They could detect when different groups (like Europeans and Asians) were still mixing with each other much more recently than previously thought.
  • The "Blind Spot" Fix: They showed that if you want to know about recent history, you don't need to know every tiny detail about ancient history. By focusing on large groups, the tool ignores the "noise" of the deep past and focuses on the recent signal.

5. The Catch (The "Blurry Camera")

The paper also admits a limitation. The tool relies on "tree sequences," which are reconstructed family trees built from DNA.

  • The Analogy: It's like trying to reconstruct a movie from a blurry, low-resolution recording.
  • The Issue: The software is great, but the "camera" (the method used to build the family trees from DNA) sometimes gets the timing slightly wrong, making recent events look like they happened a bit more recently than they actually did. The authors warn that while the tool is powerful, the input data needs to be as clear as possible.

Summary

demestats is a new mathematical microscope.

  • Old Microscope: Looked at 2 people. Good for seeing the distant past, but blurry for the recent past.
  • New Microscope: Looks at 50+ people. It brings the recent past into sharp focus, allowing us to see exactly how fast human populations grew and mixed in the last few thousand years.

It's a game-changer for understanding our recent human story, turning a blurry snapshot into a high-definition video of our ancestors' recent lives.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →