Accuracy of occurrence and abundance estimates from insect metabarcoding

This study demonstrates that while both mild lysis and homogenization introduce distinct taxon-specific biases in insect occurrence data, combining homogenization with biological spike-ins enables reasonably accurate abundance estimates, advancing the potential of DNA metabarcoding for robust quantitative biodiversity monitoring.

Iwaszkiewicz-Eggebrecht, E., Granqvist, E., Nowak, K. H., Valdivia, C., Buczek, M., Srivathsan, A., Hartop, E., Miraldo, A., Roslin, T., Tack, A. J. M., Lukasik, P., Meier, R., Ronquist, F.

Published 2026-02-22
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you have a giant bowl of mixed-up fruit salad, but instead of fruit, it's filled with thousands of tiny insects from a forest. Your goal is to answer two questions:

  1. Who is in the bowl? (Which species are there?)
  2. How many of each are there? (Is there one beetle or a thousand?)

For a long time, scientists have used a high-tech method called DNA metabarcoding to answer these questions. It's like taking a tiny sip of the "insect soup," reading the genetic code of everything in it, and identifying the guests. But there's a big problem: the method isn't perfect. Sometimes it misses the small guests, and sometimes it thinks there are more of a certain bug than there actually are.

This paper is like a massive "quality control test" to figure out how to get the most accurate results possible. The researchers tested different ways of making the "soup" and different ways of measuring the ingredients.

Here is the breakdown of their findings, using some everyday analogies:

1. The Two Ways to Make the Soup: "Steeping" vs. "Blending"

The researchers compared two main methods for getting DNA out of the bugs:

  • Method A: The "Tea Bag" (Mild Lysis/Non-destructive).
    Imagine putting the insects in a cup of hot water (a chemical solution) and letting them sit. The DNA leaks out like tea from a tea bag, but the bugs stay whole.

    • The Good News: This is great for finding tiny, soft bugs (like tiny flies or parasitic wasps) because they dissolve easily. Plus, you can still look at the bugs under a microscope later if you want to study them.
    • The Bad News: It misses the tough, armored bugs (like beetles or ants with hard shells). Their DNA doesn't leak out well, so they might be invisible in the "tea."
  • Method B: The "Smoothie" (Homogenization/Destructive).
    Imagine throwing the bugs into a blender and turning them into a complete mush.

    • The Good News: This gets DNA out of everything, especially the tough, armored bugs. It gives a very accurate count of how many of each bug are in the bowl.
    • The Bad News: You destroy the bugs. You can't look at them later, and it's harder to find the tiny ones because they get lost in the "smoothie" of DNA from the big bugs.

The Verdict: Neither method is perfect. "Tea" is better for finding rare, tiny species and saving the bugs. "Smoothie" is better for counting the tough ones accurately. The best strategy? Do both! Use "Tea" for most samples to save the bugs, and "Smoothie" for a few samples to get accurate counts.

2. The "Calibration Weights" (Spike-ins)

One of the biggest headaches in this science is that reading the DNA doesn't tell you the number of bugs. A big bug might release 100 times more DNA than a small bug, making it look like there are 100 big bugs when there's actually only one.

To fix this, the researchers added "Calibration Weights" (called spike-ins) to every bowl.

  • Biological Weights: They added a known number of specific, foreign insects (like 5 fruit flies and 5 crickets) to every sample before processing.
  • Synthetic Weights: They added tiny, artificial DNA strands that don't exist in nature.

The Analogy: Imagine you are weighing apples on a scale that is broken and fluctuates wildly. If you put a known 1kg weight on the scale first, you can see how much the scale is off and adjust your apple weight accordingly.

The Discovery:

  • The Biological Weights (real bugs) were the champions. Because they went through the whole process (the "tea" or "smoothie" making), they told the scientists exactly how much DNA was lost or gained during the process.
  • The Synthetic Weights (fake DNA) were okay, but they were added after the bugs were processed. They couldn't tell the scientists about the messiness of the "blending" or "steeping" steps.
  • Result: Using real bugs as weights made the abundance estimates (counting the bugs) much more accurate.

3. The "Crystal Ball" (Bayesian Modeling)

Even with the weights, counting is still tricky. The researchers built a sophisticated computer model (a "Crystal Ball") that uses math to predict the number of bugs based on the DNA reads and the calibration weights.

  • The Result: This model was surprisingly good. For about 73% of the bug species, the model guessed the number of individuals within just one of the actual number.
    • Example: If there were actually 5 beetles, the model guessed 4, 5, or 6. That's a huge improvement over just guessing based on raw DNA numbers!

The Big Takeaway

This paper is a roadmap for the future of insect monitoring. It tells us:

  1. Don't just destroy everything: Keep some bugs intact ("Tea method") to find the tiny, rare ones and preserve them for the future.
  2. Use real "weights": Add real insects to your samples to calibrate the machine. Don't rely on fake DNA alone.
  3. Math helps: Use smart computer models to turn messy DNA data into accurate bug counts.

By combining these methods, scientists can finally get a clear, accurate picture of the insect world, helping us understand if insect populations are crashing or thriving, which is vital for our planet's health.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →