Peak-Based Nuclide Identification in HPGe γ\gamma-Spectrometry with Machine Learning and SHAP

This paper presents a supervised machine learning approach for automating nuclide identification in HPGe gamma spectrometry, which achieves a superior F1 score of 0.97 compared to traditional software (0.84) and utilizes SHAP values to confirm that the model relies on physically relevant photopeaks for its predictions.

Original authors: Samuel Emmons, Kelly Truax, Maurice Lonsway, Bruce Pierson, Brian Archambault

Published 2026-06-16
📖 4 min read☕ Coffee break read

Original authors: Samuel Emmons, Kelly Truax, Maurice Lonsway, Bruce Pierson, Brian Archambault

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Problem: Finding Needles in a Haystack

Imagine you have a giant box of mixed-up Lego bricks. Some are red, some are blue, some are tiny, and some are huge. Your job is to look at a pile of these bricks and tell a friend exactly which specific colors and shapes are in that pile.

In the real world, scientists use special detectors (called HPGe detectors) to look at radioactive samples. These detectors produce a "spectrum," which is like a complex graph of peaks and valleys. Each "peak" represents a specific type of radioactive atom (a nuclide) emitting energy.

The problem is that these spectra are messy. Peaks overlap, some are very faint, and there are hundreds of different types of atoms to look for. Traditionally, human experts have to sit down, look at the graph, fit the peaks perfectly, and use standard software (like Genie 2000) to guess which atoms are present. This is slow, tiring, and sometimes the software gets it wrong, suggesting atoms that aren't actually there (false alarms).

The Solution: Training a Smart Assistant

The authors of this paper wanted to build a "smart assistant" using Machine Learning (ML) to help the experts. Instead of feeding the computer the whole messy graph, they gave it a "shopping list" of the most important peaks that experts had already identified and measured.

They taught two types of AI "students" to look at this list and decide which atoms are present:

  1. XGBoost: Think of this as a team of detectives who ask a series of "Yes/No" questions to narrow down the suspects.
  2. DNN (Deep Neural Networks): Think of this as a super-brain that looks for complex patterns and connections across the whole list at once.

The Results: Who Won the Contest?

The team tested these AI models against the traditional software (Genie 2000) using about 1,600 real-world examples of radioactive samples.

  • The Score: The AI models were much better at the job. The best AI model (XGBoost) got a score of 0.97 (out of 1.0), while the traditional software only got 0.84.
  • The Main Victory: The biggest win wasn't just finding the right atoms; it was not finding the wrong ones. The traditional software was like a security guard who yells "Intruder!" every time a shadow moves. The AI models were smarter; they only yelled "Intruder!" when they were actually sure. This means fewer false alarms for the human experts.

The "Why": The Magic Mirror (SHAP)

One of the most important parts of this paper is that the authors didn't just say, "The AI works." They wanted to know how it works. They used a tool called SHAP (which acts like a magic mirror) to see exactly which clues the AI was using to make its decisions.

They found that the AI wasn't just guessing; it was using physics-based logic:

  • The Main Clue: If the AI thinks "Cadmium-109" is there, it's mostly looking at the specific peak for Cadmium.
  • The Context Clue: The AI also looks at the "family." For example, if it sees a short-lived atom called Niobium-97, it checks to see if its "parent" atom (Zirconium-97) is also there. If the parent is missing, the AI knows the child probably shouldn't be there either. Traditional software often misses this family connection.
  • The "Context" Clue: The AI understands that a peak at a certain height might mean one thing if it's alone, but something else if it's surrounded by other specific peaks.

The Limitations: When the AI Gets Confused

The paper admits the AI isn't perfect.

  • Rare Items: If a specific atom appears very rarely in the training data (like a rare Lego piece that only shows up 3% of the time), the AI sometimes struggles to identify it correctly.
  • Bad Labels: If the human experts made mistakes when labeling the training data (saying "This is Atom A" when it was actually "Atom B"), the AI gets confused. It learns from the mistakes it was taught.

The Bottom Line

This paper shows that by teaching computers to look at the "shopping list" of radioactive peaks, we can create a tool that is faster and more accurate than current methods. It doesn't replace the human expert; instead, it acts like a highly skilled assistant that filters out the noise and false alarms, letting the human focus on the real work. The AI learned to think like a physicist, using both the main clues and the surrounding context to make the right call.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →