evo3D R package: a spatial haplotype framework for structure-informed analysis of molecular evolution

The paper introduces evo3D, an R package that overcomes the limitations of existing structure-informed evolutionary analysis by providing a flexible framework to extract spatial haplotypes and compute diverse statistics across complex protein structures, thereby enabling more comprehensive identification of evolutionary patterns within 3D contexts.

Original authors: Broyles, B. K., He, Q.

Published 2026-04-16
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

The Big Idea: Why "Flat" Maps Fail

Imagine you are trying to understand a complex machine, like a car engine. If you only looked at a flat list of parts (a spreadsheet), you might see that the spark plug is next to the fuel injector on the list. But in reality, inside the engine, the spark plug is actually touching the piston, which is far away on the list.

For a long time, scientists studying how viruses and proteins evolve have been looking at flat lists (linear DNA sequences). They assume that if two DNA letters are next to each other in the code, they are neighbors.

The Problem: Proteins don't work like flat lists. They fold up into complex 3D shapes (like origami). Two parts of a protein that are far apart in the DNA code might be touching each other in 3D space. If a virus mutates one part, it might break the connection with the part it's touching, even if they are miles apart on the "flat list."

The Solution: The authors built a new tool called evo3D. Think of it as a 3D GPS for evolution. Instead of looking at a flat list, it looks at the protein's 3D shape to find out which parts are actually neighbors.


How evo3D Works: The "Neighborhood" Analogy

Imagine a protein is a giant, crowded city.

  • The Old Way (Linear Windows): Scientists used to pick a neighborhood by looking at a street address. "Let's study houses 10 through 20." But in a 3D city, house #10 might be on the top floor of a skyscraper, and house #20 might be in the basement. They aren't neighbors at all!
  • The evo3D Way (Spatial Haplotypes): evo3D asks, "Who is actually standing next to whom?" It draws a circle around a specific person (a residue) and grabs everyone within arm's reach, regardless of their address on the list. It calls this group a "Spatial Haplotype."

Once it gathers this real-life neighborhood, it checks the DNA of everyone in that group to see how much they are changing (evolving). This tells scientists if that specific 3D cluster is under attack by the immune system or if it's a critical part of the machine that can't afford to change.

Key Features of the Tool

  1. It Handles the "Fold":
    Proteins often come in teams (multimers), like a soccer team of 8 players holding hands. evo3D is smart enough to know that if you are looking at Player A, you also need to look at Player B, C, and D who are holding hands with them, even if they are coded by different genes. It can merge these views or keep them separate, depending on what you want to study.

  2. It's a "Universal Adapter":
    Before this, scientists had to use different, clunky tools for different jobs. Some tools only worked on Windows, some only worked for cancer, and some only worked for viruses. evo3D is like a universal power strip. It takes your DNA data and your 3D structure file, plugs them in, and runs the analysis with a single button click (run_evo3d()). It's designed to be easy for anyone to use, not just computer experts.

  3. It Finds Hidden Secrets:
    The paper tested this on two viruses: Hepatitis C and Chikungunya.

    • The Hepatitis C Discovery: Scientists were looking for a "vaccine target"—a part of the virus that doesn't change much so a vaccine can stick to it. The old flat-list method missed a very stable, unchanging spot on the virus's surface. evo3D found it immediately because it saw that the surrounding 3D neighborhood was locked down tight.
    • The Chikungunya Discovery: They looked at how the virus interacts with human cells. They found that the "lock" (the binding site) is very stable, but the "doorframe" right next to it is chaotic and changing. This helps explain how the virus tricks the immune system: it keeps the important part steady while the surroundings distract the body's defenses.

Why This Matters

This tool removes the "technical barrier." Previously, you needed to be a master coder to combine DNA data with 3D protein shapes. Now, evo3D does the heavy lifting.

The Bottom Line:
Evolution doesn't happen on a piece of paper; it happens in 3D space. evo3D is the first tool that makes it easy to study evolution exactly where it happens: inside the folded, twisting, 3D world of proteins. It helps us find better vaccine targets, understand how viruses hide from our immune systems, and generally see the "big picture" of how life evolves.

Where to find it: The tool is free and open-source on GitHub, ready for any scientist to download and start exploring the 3D world of evolution.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →