Assessing diagnostic accuracy of Ov16 rapid diagnostic tests for onchocerciasis using Bayesian latent class models

By applying Bayesian latent class models to pooled data from Mozambique, Ghana, and Benin, this study demonstrates that while the novel GADx Ov16 rapid test achieves the sensitivity required for stopping mass drug administration, none of the evaluated tests meet the necessary specificity thresholds to support elimination decisions without confirmatory strategies.

Original authors: Norman, J., Bassabi-Alladjie, N.-M., Boko-Collins, P. M., de Souza, D. K., Gass, K., Hamill, L., Langa, J., Moore, C., Nala, R., Sullivan, S. M., Giorgi, E.

Published 2026-01-22
📖 4 min read☕ Coffee break read

Original authors: Norman, J., Bassabi-Alladjie, N.-M., Boko-Collins, P. M., de Souza, D. K., Gass, K., Hamill, L., Langa, J., Moore, C., Nala, R., Sullivan, S. M., Giorgi, E.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to find a very shy, invisible guest (a parasite called Onchocerca volvulus) who causes river blindness. To catch them, health workers use special "sniffer dogs" (rapid diagnostic tests) that look for evidence the guest has been there, specifically a scent marker called "Ov16."

For years, the only sniffer dog available was the SD Bioline test. But there was a problem: this dog was a bit unreliable. Sometimes it missed the guest (low sensitivity), and sometimes it barked at a squirrel instead of the guest (low specificity). Also, it was picky about the food it ate; it worked best on dried blood samples but struggled with fresh blood, making field work slow and complicated.

Recently, two new sniffer dogs were trained: the GADx test and the DDTD test. The big question was: Are these new dogs better? And can they be trusted to tell health officials, "Okay, the guest is gone, we can stop the search and stop giving medicine"?

The Challenge: No Perfect "Truth"

Usually, to test a new dog, you compare it to a "perfect" dog that never makes mistakes. But in this disease, there is no perfect dog. The old "gold standard" (looking at skin under a microscope) is painful for people and often misses the guest if the infection is low.

So, the researchers couldn't just say, "New Dog A is better than Perfect Dog B." Instead, they used a clever statistical trick called a Bayesian Latent Class Model.

Think of this like a detective solving a mystery without a confession. The detective has three witnesses (the three different tests) who all saw the same scene but might have different opinions. The detective doesn't know who is telling the absolute truth, but by listening to all three and seeing where they agree or disagree, they can mathematically guess who is the most reliable witness and how often they are right.

The Investigation

The researchers gathered data from three different countries (Mozambique, Ghana, and Benin) where the parasite is found. They tested thousands of people with all three "sniffer dogs" at the same time and ran them through their detective math.

Here is what they found:

1. The "Sniffing" Ability (Sensitivity)
This measures how good the test is at finding the parasite when it is actually there. The World Health Organization (WHO) says a test needs to be at least 89% good to be trusted for stopping medicine programs.

  • GADx (The New Star): This test was the best. It found the parasite about 92% of the time. It consistently passed the 89% threshold.
  • SD Bioline (The Old Guard): This one was hit-or-miss. In some calculations, it passed the 89% mark, but in others, it dropped to about 83%. It wasn't reliable enough to be counted on every time.
  • DDTD (The Multi-Antigen Dog): This test looked for multiple scent markers. It did okay, finding the parasite about 87% of the time, but it consistently fell just short of the 89% goal.

2. The "False Alarm" Problem (Specificity)
This measures how good the test is at not barking when there is no guest. The WHO requires a test to be 99.8% perfect here. If a test barks at a squirrel (false positive), the program might keep giving medicine to a village that doesn't need it, wasting money and effort.

  • The Bad News: None of the three tests reached the 99.8% goal.
  • The best performer was GADx at 98.8%, followed closely by DDTD and SD Bioline.
  • While 98.8% sounds high, in a large population, that small gap means some people who don't have the parasite will still test positive.

The Conclusion

The paper concludes that while the new GADx test is a significant improvement and is very good at finding the parasite (sensitivity), none of the tests are perfect enough to be used alone to decide when to stop giving medicine.

Because they all have a slightly higher "false alarm" rate than the WHO demands, using them alone could lead to the wrong decision: either stopping medicine too early (if the test misses the parasite) or continuing medicine too long (if the test falsely alarms).

The researchers say we need either a new, even better test or a strategy that uses these tests as a first step, followed by a second check, before making the final call to stop the fight against river blindness.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →