Inference of population demographic history captures differing evolutionary signals based on the number of individuals in the dataset

This study demonstrates that demographic inference results are critically dependent on sample size, as varying the number of individuals can shift inferred evolutionary signals from ancient contractions to recent expansions by altering which historical epoch dominates the coalescent branch lengths.

Mah, J. C., Lohmueller, K. E.

Published 2026-04-08
📖 3 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to reconstruct the history of a family by looking at a photo album. But instead of photos, you have a collection of tiny genetic clues (DNA) left behind by ancestors. Scientists use these clues to figure out if the family once shrank to just a few people (a bottleneck) and then exploded in size (an expansion).

This paper is about a tricky problem: The size of the photo album changes the story you tell.

The Problem: Too Few or Too Many Photos?

The researchers found that the number of people (individuals) you include in your study acts like a camera lens with a changing focus.

  • Small Sample Size (A blurry, zoomed-out lens): If you only look at a few people, the genetic clues make it look like the family had a long, slow decline in the distant past. It's like looking at a forest from far away and only seeing the bare, old trees, missing the new saplings.
  • Large Sample Size (A sharp, zoomed-in lens): If you look at hundreds of people, the story flips. The clues suddenly scream, "We just had a baby boom!" It's like walking into that same forest and suddenly seeing thousands of new, young trees that were invisible from afar.

The Experiment: A Tale of Two Epochs

To test this, the scientists created a "fake" family history with two distinct chapters:

  1. Chapter 1 (Ancient Times): A terrible famine caused the population to crash (a bottleneck).
  2. Chapter 2 (Recent Times): The survivors recovered and the population grew rapidly.

They then ran their detective work on this fake history using different numbers of "witnesses" (sample sizes).

  • With few witnesses: The detectives concluded, "This family has been shrinking for a long time." They missed the recent boom.
  • With many witnesses: The detectives concluded, "This family is exploding with new life!" They missed the ancient crash.

The "Why": The Branch Length Analogy

Why does the lens change the story? The authors explain this using a family tree analogy.

Imagine the history of a population as a giant, upside-down tree. The roots are the ancient past, and the branches are the present.

  • Ancient events (like the famine) affect the thick, main trunk of the tree.
  • Recent events (like the boom) affect the tiny, delicate twigs at the very tips.

When you have a small sample, you are mostly grabbing the thick trunk. You see the old, heavy history clearly, but the tiny new twigs are too small to notice.
When you have a large sample, you are grabbing so many branches that you finally catch all those tiny, new twigs. The "recent boom" becomes the loudest signal, drowning out the quiet history of the old trunk.

The Takeaway

The paper teaches us that there is no single "correct" sample size that tells the whole truth.

  • If you only look at a few people, you might think a population is dying out.
  • If you look at many, you might think it's thriving.

The Solution: Don't just pick one number and stick with it. The best way to understand a population's full history is to look at the data through multiple lenses (different sample sizes). By doing this, you can see both the ancient tragedy (the bottleneck) and the modern triumph (the expansion), piecing together the complete, complex story of the family's past.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →