OrthoGather: a local platform for orthology-based proteome and proteomics comparisons and Gene Ontology enrichment

OrthoGather is a locally hosted web application designed to streamline comparative proteomic analysis and Gene Ontology enrichment across species by integrating orthology inference with flexible functional annotation, thereby lowering technical barriers for researchers without computational expertise.

Vivas-Rodriguez, C., Matallanas, D., Ryan, C. J., McClean, S., Dennler, O., Drabinska, J.

Published 2026-03-11
📖 4 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to understand how different families (species) react to the same storm (a stimulus, like a drug or disease). You have a list of people who got sick in Family A, and another list for Family B. You want to know: Did they get sick for the same reason? Did they use the same tools to fight back?

The problem is that Family A speaks a language you understand perfectly (they have detailed manuals for every person), but Family B has very few manuals, and many of their people are listed only as "Unknown." Comparing them directly is like trying to match a detailed resume with a blank piece of paper.

OrthoGather is a new, easy-to-use tool that solves this problem. Think of it as a universal translator and family reunion planner for proteins.

Here is how it works, broken down into simple steps:

1. The "Family Tree" Builder (Orthogroups)

In biology, proteins that share a common ancestor are like cousins. Even if they look slightly different or have different names in different species, they are related.

  • The Old Way: Researchers had to use complex computer code (like a command line) to build these family trees. It was like trying to build a house using only a hammer and a blueprint written in a language you don't speak.
  • The OrthoGather Way: You just type in the names of the species you want to study (like E. coli or Mycobacterium). OrthoGather automatically downloads their "family photos" (proteomes) and builds the family tree for you. It groups all the "cousin" proteins together into Orthogroups.

2. The "Smart Inference" Trick

This is the magic part.

  • The Problem: If a protein in a poorly studied species (let's call him "Bob") has no manual, you don't know what he does.
  • The Solution: OrthoGather looks at Bob's cousin, "Steve," who lives in a well-studied species. Steve has a detailed manual saying, "I am a firefighter." Because Bob and Steve are in the same family group (Orthogroup), OrthoGather assumes Bob is also a firefighter.
  • The Result: You can now understand the function of proteins in "mystery" species by borrowing the knowledge from their well-known relatives.

3. The "Party Planner" (Comparative Analysis)

Once the families are grouped, OrthoGather helps you see who is at the party and who is missing.

  • It creates visual charts (called UpSet plots) that look like a Venn diagram on steroids. They show you:
    • Which proteins are unique to one species?
    • Which proteins are shared by everyone?
    • Which proteins are shared only by specific groups?
  • You can filter this list. For example, "Show me only the proteins that changed when we gave the bacteria a specific drug."

4. The "Theme Detector" (Gene Ontology Enrichment)

Now that you know who changed, you want to know what they are doing.

  • OrthoGather takes your list of "changed" proteins and asks: "Are these proteins mostly firefighters, or mostly chefs?"
  • It runs a statistical test to see if certain themes (like "fighting antibiotics" or "building cell walls") appear more often in your list than you would expect by random chance.
  • It spits out easy-to-read bar charts and downloadable reports that you can put straight into a scientific paper.

Why is this a big deal?

Before OrthoGather, doing this kind of comparison required you to be a computer expert, know how to code, and spend hours cleaning up messy data files. It was like needing a degree in mechanics just to change a tire.

OrthoGather is like a self-driving car for this process:

  • It's Local: You run it on your own computer, so your data stays private.
  • It's Visual: You click buttons and see pretty graphs, not lines of code.
  • It's Smart: It fills in the gaps for poorly studied organisms using the knowledge of well-studied ones.

In a nutshell: OrthoGather takes the messy, confusing task of comparing biological "families" across different species and turns it into a simple, visual story, allowing scientists to discover how life adapts to challenges without needing a PhD in computer science.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →