Assembling a fully-dated complete tree of life

This paper introduces novel linear-time algorithms to overcome previous computational limitations, enabling the creation of a fully-dated, time-scaled phylogenetic tree of 2.3 million species to significantly expand the scope of evolutionary analyses.

Duke, J. D., Guo, J., Forest, F., Gumbs, R., McTavish, E. J., Rosindell, J.

Published 2026-03-20
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you have a massive, ancient family album containing every single living creature on Earth—2.3 million species, from the tiniest bacteria to the blue whale. You can see who is related to whom (the family tree), but the album is missing the most important part: dates. You don't know when your great-great-grandparents were born, or when the family first split into different branches. Without these dates, the album tells a story of who, but not when or how fast things happened.

This paper is about fixing that missing timeline. The authors created a brand-new, fully-dated "Family Album of Life" that covers every known species, and they did it by inventing a super-fast way to fill in the gaps.

Here is the story of how they did it, broken down into simple concepts:

1. The Problem: A Tree with Missing Pages

Scientists already had a "Supertree" called the Open Tree of Life. Think of this as a giant, messy draft of the family album.

  • The Good News: It includes almost every known species (2.3 million of them!).
  • The Bad News: It's full of blank spaces. Many branches are just fuzzy blobs (called polytomies) because we don't have DNA data for every species yet. Worse, most of the branches have no dates. It's like having a map of the world where you know the cities are connected, but you don't know how long the roads are or when they were built.

Previously, trying to fill in these dates was like trying to count every grain of sand on a beach using a teaspoon. The old computer methods were too slow and required too much memory (like needing a warehouse full of hard drives) to handle a tree this big.

2. The Solution: The "Lightning-Fast" Calculator

The authors invented new computer algorithms (mathematical recipes) that act like a high-speed express train instead of a slow, winding bus.

  • The Old Way: If you doubled the size of the tree, the time it took to calculate the dates quadrupled. It was impossible to run on the whole tree.
  • The New Way: Their new method scales linearly. If you double the tree, it only takes double the time.
  • The Result: They calculated dates for all 2.3 million species in just 26 seconds. To put that in perspective, the old method would have taken 14 days and crashed your computer because it ran out of memory.

3. How They Filled the Gaps: The "Equal Splits" Trick

Since they didn't have dates for every node (branching point), they had to guess. But they didn't just guess randomly; they used logic.

Imagine you know your great-grandfather was born in 1900 and your father was born in 1950. You don't know exactly when your grandfather was born, but you know it was somewhere in between.

  • The Method: They looked at the known dates (like 1900 and 1950) and "stretched" the time evenly across the unknown branches in between.
  • The Innovation: They tested five different ways to stretch this time. They found that a specific mix of methods (which they called EQS-LS) was the most accurate. It was like finding the perfect recipe that balanced the "longest path" and "shortest path" guesses to get the most realistic timeline.

4. Dealing with Uncertainty: The "Cloud of Possibilities"

Science is rarely about having one single "truth." It's about understanding the range of possibilities.

  • Topological Uncertainty: Sometimes we aren't sure exactly how two species are related (the shape of the branch). The authors created 501 different versions of the tree, each with slightly different shapes, to see how much that changed the results.
  • Temporal Uncertainty: Sometimes different studies give different dates for the same event (e.g., Study A says 100 million years ago, Study B says 110 million). They randomly picked different dates from these studies to create a "cloud" of possible timelines.

By combining these, they didn't just give you one answer; they gave you a distribution of answers, showing the most likely timeline and the range of error.

5. The Big Reveal: How Much "Evolutionary History" Do We Have?

Once they had the dates, they could calculate Phylogenetic Diversity (PD). Think of PD as the total amount of "evolutionary time" represented by a group of species. It's like measuring the total length of all the branches in the family tree.

  • The Finding: The total evolutionary history of all life on Earth is roughly 39.8 trillion years.
  • The Comparison: This is a massive number. It helps conservationists understand that losing a species isn't just losing one animal; it's erasing millions of years of unique evolutionary history. For example, losing a unique type of shark might erase more evolutionary history than losing a common bird.

Why This Matters

Before this paper, scientists could only study the "rich" parts of the tree of life (like birds and mammals) where they had enough DNA data. Now, they have a fully-dated map for everything.

  • For Conservation: It helps prioritize which species to save to protect the most unique evolutionary history.
  • For Biology: It allows researchers to study how traits (like wings or venom) evolved over time across the entire planet.
  • For the Future: The authors made all their data and code free for everyone. It's like handing the keys to the library to the whole world, so anyone can explore the history of life without needing a supercomputer.

In a nutshell: The authors built a super-fast engine that took a messy, undated sketch of the entire tree of life and turned it into a precise, time-stamped movie of evolution, allowing us to see the full story of life on Earth for the first time.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →