Uncertainty-aware graph representation learning with positive-unlabeled classification for biomarker discovery in peripheral artery disease

This paper presents an uncertainty-aware graph representation learning framework that integrates positive-unlabeled classification and ensemble methods to prioritize novel and well-calibrated biomarkers for peripheral artery disease, demonstrating superior predictive performance and biological relevance compared to existing baselines.

Original authors: Ayyalasomayajula, V. S. R. K., Senders, M. L., Wolterink, J. M., Yeung, K. K.

Published 2026-05-13
📖 3 min read☕ Coffee break read

Original authors: Ayyalasomayajula, V. S. R. K., Senders, M. L., Wolterink, J. M., Yeung, K. K.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine Peripheral Artery Disease (PAD) as a massive, tangled city of roads (our blood vessels) where some streets are blocked, but we don't have a complete map of why they are blocked. Scientists know a few key culprits (the "positive" proteins), but for most of the city, the traffic reports are missing or incomplete. This makes it very hard to find new clues to fix the problem.

Most computer programs trying to solve this are like overconfident tour guides. They point at a random building and say, "This is definitely the problem!" without admitting they might be guessing. They don't know when they are unsure, and they often miss new, strange buildings that don't look like the ones they've seen before.

The New Approach: A Cautious Detective with a Crystal Ball

The researchers in this paper built a smarter system, like a cautious detective who carries a "confidence meter" and a "novelty radar." Here is how they did it:

  1. Mapping the City (Graph Learning): First, they used a special type of AI (a Graph Neural Network) to create a 3D map of how all the proteins in the body connect to each other. Think of this as drawing a subway map where the distance between stations represents how closely related different proteins are.
  2. The "Yes, Maybe, No" Team (Ensemble Prediction): Instead of relying on one single detective, they hired a whole team of different experts (five different classifiers) and asked them to vote. They also taught these experts to say, "I'm not sure," when the data was fuzzy. This created a "confidence meter" that tells us how sure the system is about its answer.
  3. The Two Buckets (Uncertainty & Novelty): The system sorted the potential clues into two piles:
    • The "Safe Bets": These are candidates that look very similar to the known troublemakers. The system is very confident about these.
    • The "New Discoveries": These are candidates that live in strange, unexplored parts of the map. The system flags these as "structurally novel" because they don't fit the usual patterns, suggesting they might be new types of culprits we haven't thought of yet.

What They Found

The team tested this system and found it was much better than the old methods. While the old "overconfident" guides were right about 82% of the time, this new team was right about 92% of the time.

  • The Safe Bets: The proteins the system was most confident about clustered together with known PAD proteins. They were involved in familiar tasks like building the road walls (extracellular matrix) and managing blood clotting (coagulation).
  • The New Discoveries: The "novel" candidates lived in different neighborhoods on the map. These were linked to different kinds of traffic control, like cell signaling and immune system responses (G protein-coupled receptors and NF-kappaB pathways).

The Bottom Line

By teaching the computer to admit when it's unsure and to look for things that are different from the norm, the researchers successfully identified 100 new potential biomarkers for PAD. They proved that mixing "confidence" with "curiosity" helps scientists find both the obvious suspects and the hidden ones, leading to a much clearer picture of the disease.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →