Verticall: A fast and robust tool for recombination detection in large-scale bacterial genomic datasets

Verticall is a fast, non-parametric, open-source tool designed to efficiently detect recombination and generate recombination-free phylogenies in large-scale bacterial genomic datasets (ranging from hundreds to thousands of genomes), demonstrating superior or comparable performance to existing methods like Gubbins and ClonalFrameML across diverse evolutionary scales.

Original authors: Odih, E. E., Wick, R. R., Holt, K. E.

Published 2026-04-24
📖 4 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to draw a family tree for a massive group of bacteria. You want to know who is related to whom based on their ancestors. But here's the catch: bacteria are notorious "copycats." They don't just inherit traits from their parents; they often swap DNA with neighbors, strangers, or even distant relatives. This is called recombination.

If you try to draw a family tree without accounting for these swaps, your map will be a mess. It's like trying to figure out a human family tree if half the people suddenly swapped their eye color, height, and fingerprints with random strangers on the street. The result would be a confusing tangle that doesn't show the true history.

The Problem with Old Tools

Scientists have had tools to clean up this mess before, but they are like small, delicate fishing nets. They work great if you are trying to catch a few fish (a few hundred bacteria) in a small pond. But if you try to use them to drag a net through the entire ocean (thousands of bacteria from different species), the net gets tangled, breaks, or takes forever to pull in.

Enter Verticall: The Heavy-Duty Crane

The paper introduces a new tool called Verticall. Think of Verticall not as a fishing net, but as a high-speed, industrial crane designed to sort through massive piles of cargo.

Here is how it works, using simple analogies:

  1. The "Copy-Paste" Detector:
    Verticall looks at the DNA of thousands of bacteria at once. It asks a simple question: "Does this piece of DNA look like it came from a close relative, or did it just get borrowed from a stranger?"

    • Vertical inheritance is like a family heirloom passed down from grandmother to mother to daughter.
    • Horizontal recombination is like buying a cheap knock-off shirt from a different store and wearing it.
      Verticall is smart enough to spot the "knock-off" shirts and set them aside, keeping only the true "family heirlooms."
  2. Two Ways to Clean the Data:
    The tool offers two different strategies to get a clean family tree, depending on what you need:

    • The "Distance Map" Approach: Imagine measuring the distance between every single person in a crowd to see who stands close to whom, ignoring the people who just walked in from outside. Verticall does this mathematically to build a tree based only on the "true" family connections.
    • The "Reference Check" Approach: Imagine lining up every bacterium next to a "Gold Standard" reference model. If a bacterium has a weird patch of DNA that doesn't match the reference (because it was borrowed), Verticall puts a "Do Not Use" sticker on that patch and ignores it when drawing the tree.

Why This Matters

The researchers tested Verticall on four different groups of bacteria, ranging from small groups (154) to massive crowds (nearly 5,000).

  • Speed: While old tools might take days to process a large group, Verticall does it in a fraction of the time. It's the difference between walking across a field and taking a high-speed train.
  • Accuracy: The family trees it builds make more sense biologically. They show the true history of the bacteria rather than a confused mix of borrowed traits.
  • Time Travel: Because the trees are accurate, scientists can use them to guess when these bacteria evolved or spread, acting like a reliable time machine.

The Bottom Line

Verticall is a new, free, and open-source tool that allows scientists to study the evolutionary history of bacteria on a massive scale. Whether you are looking at a small family of bacteria or an entire genus, Verticall can quickly sort out the "family heirlooms" from the "borrowed clothes," giving us a clear, accurate picture of how these microscopic organisms have evolved and spread over time.

You can find this "industrial crane" for free on GitHub, ready to help scientists untangle the world's most complex bacterial family trees.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →