Disentangling mitochondrial copy number variation and PCR amplification bias in DNA metabarcoding

This study investigates the limitations of quantitative DNA metabarcoding by demonstrating that high variation in mitochondrial copy numbers and stable PCR amplification biases prevent accurate biomass estimation, ultimately revealing fundamental constraints and the need for new methodological approaches despite the development of a mathematical model to correct for these biases.

Wolany, L., Klinkenborg, K., Leese, F., Buchner, D.

Published 2026-04-09
📖 6 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

The Big Picture: Counting Bugs with a Flawed Magnifying Glass

Imagine you are a detective trying to figure out how many different types of bugs are in a jar. You can't count them one by one, so you decide to use a special "DNA magnifying glass" (metabarcoding). You take a tiny drop of water from the jar, zoom in on a specific genetic tag (a barcode) that every bug has, and count how many times that tag appears in your results.

The hope is: More tags = More bugs.

But this paper is a reality check. The authors found that this "DNA magnifying glass" is actually a bit broken. It doesn't just count bugs; it distorts the truth in two major ways.


The Two Big Problems

1. The "Battery Pack" Problem (Mitochondrial Copy Number)

Think of every bug as a house. Inside every house, there is a power plant (the mitochondria) that generates energy.

  • The Issue: Some bugs have tiny power plants with just one battery. Others have massive power plants with 200 batteries.
  • The Result: If you count the batteries (DNA copies) to guess how many houses (bugs) are there, you will be fooled. A bug with 200 batteries will look like 200 bugs, while a bug with 1 battery looks like just one.
  • The Finding: The researchers found that even within the same species, the number of "batteries" varies wildly depending on the bug's age, health, and tissue type. You can't just assume a 1:1 ratio between DNA and actual bug weight.

2. The "Unfair Amplifier" Problem (PCR Bias)

To see the DNA, you have to copy it millions of times using a machine called a PCR (like a photocopier for genes).

  • The Issue: The "photocopier" isn't perfect. It has a favorite. If the DNA of a specific bug matches the machine's settings perfectly, it copies it super fast. If the DNA has even a tiny typo (a mismatch), the machine struggles and copies it slowly.
  • The Result: The bugs with the "perfect match" DNA end up with millions of copies in the final count, drowning out the bugs that were actually there in equal numbers but had a "typo" in their code.
  • The Finding: This bias is specific to each species. Some bugs get amplified 100 times more than others, completely skewing the data.

The Failed Fix: "Just Run It Longer"

The researchers tried a clever trick to fix the "Unfair Amplifier." They thought: "If we run the photocopier for fewer cycles, maybe the bias won't have time to grow."

  • The Analogy: Imagine a race where one runner is fast and one is slow. You thought that if you stopped the race early, the gap between them wouldn't be so huge.
  • The Reality: They found that the bias happens in the very first two seconds of the race. Once the race starts, the fast runner is already way ahead, and running the race longer (more PCR cycles) doesn't change the ratio. The "photocopier" fixes its own mistakes after the first two rounds, locking in the bias immediately. So, changing the number of cycles didn't help.

The New Solution: The "Correction Factor" Cheat Sheet

Since they couldn't stop the bias, they decided to calculate exactly how biased the machine was and correct for it mathematically.

  • The Analogy: Imagine you are weighing apples on a scale that is broken. The scale always adds 5 pounds to every apple. You can't fix the scale, but if you know it adds 5 pounds, you can write a rule: "Whatever the scale says, subtract 5 pounds to get the real weight."
  • The Study's Method:
    1. They created "Mock Communities" (fake jars with known amounts of 5 specific bugs).
    2. They measured exactly how much each bug's DNA got amplified compared to a "Reference Bug" (Blattodea, a type of cockroach).
    3. They created a Correction Factor for each bug.
      • Example: If the machine amplifies the "Ant" DNA 30 times better than the "Cockroach," the math says: "Divide the Ant's count by 30 to get the real number."

The Result: When they applied this math, they could accurately predict how much DNA was actually in the jar, even though the machine had distorted it.


The Catch: Why This Isn't a Magic Bullet Yet

Even though they solved the math, the authors warn that we still can't perfectly count bugs in the wild yet. Here is why:

  1. The "Battery" Problem is still there: Even if we fix the photocopier bias, we still don't know how many "batteries" (mitochondria) a wild bug has. A sick bug might have fewer batteries than a healthy one. So, we can count the DNA copies accurately, but we still can't perfectly translate that into "how many bugs" or "how much bug weight."
  2. The "Unknowns" Problem: To use their math, you need to know every single species in the jar and have a correction factor for all of them. In a real forest or river, there are thousands of unknown species. If you miss even one, your math gets messy.
  3. The Cost: To get these correction factors, you have to do expensive lab tests for every new species you want to study.

The Bottom Line

This paper is like a mechanic showing us exactly how a car engine is misfiring.

  • Good News: We now understand why DNA counts are wrong (battery variation + photocopier bias) and we have a mathematical formula to fix the photocopier part.
  • Bad News: We still can't perfectly turn those counts into a real "bug census" because nature is too variable and we don't have data for every bug on Earth yet.

In short: DNA metabarcoding is great for making a list of who is there, but using it to count how many are there is still a work in progress. We are getting the math right, but the biological reality is still a bit messy.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →