Recursive Repeat Extender (RRE): A recursive approach to automatically extend repeat element models

The paper introduces the Recursive Repeat Extender (RRE), a novel algorithm that utilizes profile hidden Markov models and a recursive extension strategy to automatically improve de novo repeat libraries by generating longer, more complete models and successfully reconstructing highly degenerate and fragmented repeat elements that previous methods often miss.

Original authors: Falcon, F., Tanaka, E. M., Rodriguez-Terrones, D.

Published 2026-04-17
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine your genome (your body's instruction manual) is a massive library. But here's the catch: a huge chunk of this library isn't filled with unique stories about you; it's filled with copy-paste errors, repeated slogans, and ancient, faded graffiti left by "jumping genes" (transposable elements) that invaded your DNA millions of years ago.

Scientists call these "repetitive elements." To understand how your body works, they need to find and catalog these repeats. But there's a problem: over millions of years, these repeats have been shredded, mutated, and scattered. They are like a shredded newspaper that has been blown across a field, with pieces missing, torn, and covered in mud.

The Old Way: The "Flashlight" Approach

Previously, scientists tried to reconstruct these shredded repeats using a method called BEEA (BLAST-Extend-Extract-Align).

Think of this like trying to reassemble a shredded newspaper using a flashlight.

  1. The Search: You shine a bright flashlight (BLAST) on the field. You can only see pieces that look exactly like the original text. If a piece is too faded or muddy (highly mutated), the flashlight misses it.
  2. The Extension: Once you find a piece, you guess what the neighbors might look like and grab them.
  3. The Limitation: You only shine the flashlight once. If the next piece of the newspaper is too muddy for the flashlight to see, you stop. You end up with a short, incomplete sentence instead of the full paragraph.

This worked okay for "young" repeats (the ones that haven't changed much), but for "ancient" repeats (the ones that have been mutating for hundreds of millions of years), the flashlight was too weak. It left huge gaps in the library.

The New Solution: RRE (Recursive Repeat Extender)

The authors of this paper, Francisco Falcon and his team, built a new tool called RRE. Instead of a flashlight, they used a super-sensitive metal detector and a never-ending scavenger hunt.

Here is how RRE works, using simple analogies:

1. The Super-Sensitive Metal Detector (HMMER)

Instead of a flashlight that needs a perfect match, RRE uses HMMER, which is like a metal detector that can sense the shape and magnetic field of the metal, even if it's rusted, bent, or half-buried in mud.

  • Why it matters: It can find the "faded" ancient repeats that the old flashlight missed. It understands that even if the letters have changed, the structure of the word is still there.

2. The Recursive Scavenger Hunt (The "Recursive" Part)

This is the magic trick. The old method took one step and stopped. RRE takes a step, then uses what it just found to take the next step.

  • Round 1: You find a piece of the newspaper.
  • Round 2: You use that piece to look for the next piece. Even if the next piece is very different from the original, it looks similar to the piece you just found.
  • Round 3: You use the new piece to find the next one.
  • The Loop: You keep doing this, hopping from fragment to fragment, like a frog jumping from lily pad to lily pad across a pond. You keep going until you can't find any more pads.

This allows RRE to bridge gaps that were previously impossible to cross, reconstructing the full story of ancient repeats.

What Did They Discover?

The team tested RRE on five different species (from worms to humans) and compared it to the old methods.

  • Longer Stories: The old methods produced short, broken sentences. RRE produced long, complete paragraphs.
  • Fewer, Better Books: The old methods created thousands of tiny, fragmented models. RRE merged them into fewer, higher-quality models.
  • More of the Library Found: Because RRE found the "muddy" pieces, it could identify more of the genome as repetitive. In humans, it found 10.9% more repetitive DNA than the previous best method.

The "Ancient" Breakthrough

The most exciting part? They used RRE to reconstruct a specific ancient repeat called CR1_Mam, which is a "fossil" from a mammal ancestor that lived 180 million years ago.

  • They started with a tiny, broken seed (just a fragment of the repeat).
  • RRE's recursive hopping found the rest of the fossil.
  • The Result: It didn't just rebuild the known part; it found 131 extra base pairs (letters) that were missing from the official reference library. It essentially "discovered" new parts of the human genome that scientists didn't know existed before.

The Bottom Line

Think of the genome as a giant, damaged jigsaw puzzle.

  • Old tools could only connect the pieces that looked exactly the same, leaving huge holes in the picture.
  • RRE is a smart assistant that can look at a piece, guess what the next piece might look like based on its shape, find it, and then use that new piece to find the next one.

This tool allows scientists to finally read the "ancient history" written in our DNA, helping us understand how our genes are regulated and how we evolved. It turns a library of shredded, unreadable papers back into a coherent, readable story.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →