High-resolution retrospective single cell lineage tracing with mutable homopolymers

The paper introduces RETrace2, a high-resolution single-cell dual-omic method that leverages highly mutable homopolymers and sparse methylation profiling to simultaneously reconstruct detailed cell lineage trees and identify cell types with unprecedented accuracy in both in vitro and in vivo models.

Cheng, P.-C., Kamenev, D., Kameneva, P., Fitzpatrick, C., Adameyko, I., Kharchenko, P. V., Zhang, K.

Published 2026-03-12
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine trying to figure out the family history of a massive, bustling city. You want to know who moved in first, which families stayed in the same neighborhood, and how the city grew from a single house into a sprawling metropolis.

In biology, this "city" is your body, and the "families" are the trillions of cells that make up your organs. For a long time, scientists could only guess at this history or had to tag cells with artificial "name tags" that often ran out or got confusing.

This paper introduces RETrace2, a new, super-powered detective tool that lets scientists read the natural "diaries" written inside our cells to reconstruct their entire family tree.

Here is how it works, explained through simple analogies:

1. The Problem: Reading a Faded Diary

Every time a cell divides (makes a copy of itself), it makes tiny, accidental typos in its DNA. These are called mutations. Over time, these typos accumulate, creating a unique history for every cell.

  • The Old Way: Previous tools tried to read these typos, but they were like trying to read a diary written in pencil that had been left in the rain. The "typos" were too few, too hard to find, or the tools made so many mistakes reading them that the family tree looked like a mess.
  • The New Tool (RETrace2): The authors realized that some parts of the DNA are like sticky notes that get messed up much faster than others. They call these homopolymers (long strings of the same letter, like "AAAAAA"). Because these "sticky notes" change so often, they act like a high-speed camera, capturing a new "snapshot" of the cell's history almost every time it divides.

2. The Upgrade: From a Flashlight to a Super-Scanner

The researchers didn't just find better "sticky notes"; they built a whole new way to read them.

  • The "Sticky Note" Discovery: They found that these single-letter repeats (homopolymers) are nearly twice as informative as the old types of repeats they used before. It's like switching from reading a book with big, spaced-out letters to reading a book with dense, detailed paragraphs.
  • Fixing the Noise: Homopolymers are tricky to read because they are so repetitive (like trying to count a long line of identical ants). The team invented a new protocol to stop the "reading errors." They used better enzymes (the "scissors" that cut the DNA) and a new type of sequencer (the "camera") that doesn't get confused by long strings of the same letter.
  • The Result: They increased the number of "clues" they could find in a single cell by 21 times and the number of shared clues between cells by 98 times. This is the difference between trying to solve a mystery with 5 clues versus having 500.

3. The Test: The "Ground Truth" Game

To prove their tool works, they played a game with a known answer.

  • The Lab Test: They took a single cell, let it multiply into a huge family in a petri dish, and then used RETrace2 to build the family tree. Because they knew exactly how the cells were related (the "ground truth"), they could check if the tool got it right.
  • The Result: With the new tool, they reconstructed the family tree with 100% accuracy. Even in "normal" cells (which don't mutate as fast), they could still trace the lineage back about 60 generations. In "fast-mutating" cells (like cancer cells or mice with a specific gene defect), they could theoretically trace back fewer than 5 cell divisions. That's like being able to tell exactly which generation great-great-grandparent you are descended from, just by looking at a single cell today.

4. The Real-World Adventure: The Mouse City

Finally, they took the tool into the wild. They used a special mouse that has a "hyper-active" mutation rate (making the DNA typos happen faster, like a diary being written in fast-forward).

  • The Mission: They collected cells from the mouse's brain, kidney, and liver.
  • The Discovery: They built a massive family tree showing how these organs developed.
    • They found that the brain, kidney, and liver are like three different neighborhoods in the city. While some early "founding families" spread out to build all three neighborhoods, other families stayed strictly in one.
    • They even figured out what kind of cell each one was (e.g., a brain cell vs. a liver cell) by reading a second layer of data (methylation) from the same sample. It's like reading the family tree and the cell's ID card at the same time.

Why This Matters

Think of RETrace2 as a time machine for biology.

  • Before: We could only see the "present day" of our cells or get a blurry, low-resolution picture of the past.
  • Now: We can watch the movie of development in high definition. We can see exactly how a single fertilized egg splits and specializes into a complex human being.

This technology could help us understand:

  • Development: How we grow from a baby to an adult.
  • Disease: How cancer starts and spreads (clonal evolution).
  • Aging: How our cells change over a lifetime.

In short, the authors turned a blurry, static photo of our cellular history into a crisp, high-definition movie, allowing us to finally see the full story of how we are built.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →