Standardised Human Phenotype Ontology Annotation Enables High Quality Phenotypic Data Capture in a Real-World Common Variable Immunodeficiency Cohort

This study demonstrates that implementing a standardized Human Phenotype Ontology (HPO) framework with clinician training enables high-quality, granular phenotypic data capture in a large UK CVID cohort, successfully linking specific immune profiles and genetic variants to distinct clinical subgroups to advance genotype-phenotype understanding and therapeutic strategies.

Original authors: Campos, L. C., Favreau, E., Greene, D., Blach, J., Thomas, M., Alsehaim, K., Mutlu, L., Elhadari, S., Herwadkar, A., Payne, J., Lever, C., Mahmoud, D., Moreira, F., O'Sullivan, M., Berry, M., Twigg, G
Published 2026-04-29
📖 4 min read☕ Coffee break read

Original authors: Campos, L. C., Favreau, E., Greene, D., Blach, J., Thomas, M., Alsehaim, K., Mutlu, L., Elhadari, S., Herwadkar, A., Payne, J., Lever, C., Mahmoud, D., Moreira, F., O'Sullivan, M., Berry, M., Twigg, G., Hart, A. C. J., Joshi, N., Fuller, S., INTREPID Consortium,, Smith, K. G. C., Turro, E., Cook, M. C., Wallace, C., Burns, S. O.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to describe a complex recipe to a friend, but everyone uses their own slang. One person says "a pinch of spice," another says "a dash of heat," and a third says "something spicy." If you try to compare their notes later, it's a mess. You can't tell if they actually made the same dish or if they just used different words for the same thing.

This is exactly the problem doctors faced with Common Variable Immunodeficiency (CVID). It's a condition where the immune system is weak, but it looks different in every single patient. Some get lots of infections, while others get infections plus autoimmune problems, swollen organs, or lung issues. Because doctors described these symptoms in different ways, it was hard to see the big picture or group patients who were actually similar.

The Solution: A Universal "Medical Dictionary"
The researchers in this paper decided to fix this by using a tool called the Human Phenotype Ontology (HPO). Think of HPO as a massive, standardized dictionary for human diseases. Instead of writing "my tummy hurts," a doctor using HPO would select the exact term "Abdominal pain" from a list. Instead of "bad lungs," they'd pick "Bronchiectasis."

The team, led by the INTREPID consortium, built a special digital tool (a "Phenotype Capture Tool") to help doctors use this dictionary. But they knew that just giving doctors a dictionary isn't enough; they needed to teach them how to use it properly. So, they trained 28 clinicians across 11 UK hospitals on how to describe patients using these specific terms.

The Experiment: Training Makes Perfect
The researchers tested their idea in two ways:

  1. The Test Drive: They gave 10 doctors the same fake patient case and asked them to describe it using the HPO dictionary. Before training, their descriptions were all over the place. After training, they were almost identical. It was like teaching a group of chefs to measure ingredients with the same cup instead of guessing with their hands.
  2. The Real World: They looked at 526 real CVID patients. They compared the notes doctors wrote before the training to the notes written after.
    • Before: The notes were sparse. On average, doctors listed about 7 symptoms per patient.
    • After: The notes were rich and detailed. The average jumped to 19 symptoms per patient.
    • The Result: The doctors didn't just write more; they wrote better. They stopped using vague terms and started using precise ones, capturing the full complexity of the disease.

What They Discovered: Sorting the "Infection" vs. "Complex" Groups
With this high-quality data, the researchers could finally sort the patients into two distinct groups, like sorting a mixed bag of marbles by color:

  • Group A (Infection-Only): Patients who mostly just struggled with getting sick.
  • Group B (Complex): Patients who had infections plus other messy complications like autoimmune attacks, enlarged spleens, or liver issues.

They found that 58% of the patients fell into the "Complex" group.

Connecting the Dots: Genes and Symptoms
Because the data was so clean, they could finally draw clear lines between what was happening inside the patients' bodies (their genes and blood cells) and what was happening on the outside (their symptoms).

  • The "Complex" Clue: Patients in the "Complex" group were much more likely to have specific genetic mutations (specifically in a gene called NFKB1) and specific abnormalities in their immune cells (like a lack of "switched memory" B cells).
  • Specific Matches:
    • If a patient had a mutation in the NFKB1 gene, they were highly likely to have autoimmune neutropenia (where the body attacks its own infection-fighting white blood cells).
    • If a patient had a specific mutation in the TACI gene, they were more likely to get repeated yeast (Candida) infections.
    • Patients with high levels of a specific type of immune cell (CD21low) were more likely to have autoimmune thrombocytopenia (low platelet counts).

The Bottom Line
This study proves that if you teach doctors to speak the same "language" (HPO) and give them the right tools, you can turn a messy, confusing pile of patient notes into a clear, organized map.

By doing this, they didn't just count symptoms; they discovered that the "Complex" type of CVID is biologically different from the "Infection-Only" type. They found that certain genetic errors are directly linked to specific, severe complications. This means that in the future, looking at a patient's specific genetic code and detailed symptom list could help doctors understand exactly what kind of CVID a patient has, rather than just treating them all the same way.

In short: They built a better filing system, taught the staff how to use it, and in doing so, found hidden patterns that explain why some patients get sicker than others.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →