NuBench: An Open Benchmark for Deep Learning-Based Event Reconstruction in Neutrino Telescopes

This paper introduces NuBench, an open benchmark comprising seven large-scale simulated datasets across six detector geometries, designed to facilitate the development and comparative evaluation of deep learning-based event reconstruction methods for neutrino telescopes.

Rasmus F. Orsoe, Stephan Meighen-Berger, Jeffrey Lazar, Jorge Prado, Ivan Mozun-Mateo, Aske Rosted, Philip Weigel, Arturo Llorente Anaya

Published 2026-03-02
📖 5 min read🧠 Deep dive

Imagine you are trying to solve a massive, three-dimensional puzzle in the middle of a pitch-black ocean. You can't see the pieces, and you can't see the picture. All you have are a few scattered, glowing fireflies that blink at specific times. Your job? To figure out exactly where the fireflies came from, how fast they were moving, and what kind of creature they were.

This is essentially what scientists do with neutrino telescopes. Neutrinos are ghost-like particles that zip through the Earth almost without touching anything. When they do hit something deep underwater or under the ice, they create a tiny flash of blue light (Cherenkov radiation). Detectors like IceCube (in Antarctica) or KM3NeT (in the Mediterranean) are huge grids of light sensors waiting to catch these flashes.

The problem is that figuring out what caused the flash is incredibly hard. It's like trying to guess the shape, speed, and origin of a car just by hearing the sound of its horn echo off a canyon wall.

The Problem: Too Many Puzzles, Too Few Rules

For a long time, every team building a neutrino telescope (IceCube, KM3NeT, Baikal-GVD, etc.) has been trying to solve this puzzle in their own way. They have their own secret recipes (algorithms) and their own private datasets. It's like if every chef in the world was trying to make the perfect soup but refused to share their recipes or ingredients. This makes it hard to know who is actually making the best soup.

The Solution: NuBench (The "Neutrino Gym")

This paper introduces NuBench, which is essentially a giant, open-source "gym" for artificial intelligence (AI) to train on.

Think of NuBench as a massive, shared video game level that anyone can download. It contains 130 million simulated neutrino events.

  • The Simulations: Instead of waiting for real neutrinos (which are rare), the scientists used a supercomputer to simulate what would happen if neutrinos hit six different types of detectors.
  • The Detectors: They didn't just copy one detector. They built digital versions of six different "layouts," ranging from dense forests of sensors (like a crowded city) to sparse deserts of sensors (like a wide-open field). This tests if the AI can learn the rules of physics or if it just memorized the specific layout of one detector.
  • The Data: For every simulated event, the AI gets the "raw data" (the timing and brightness of the light flashes) and the "answer key" (the true energy, direction, and type of the neutrino).

The Challenge: The AI Olympics

The authors didn't just build the gym; they put four different AI "athletes" in it to see who could solve the puzzles best. These athletes are:

  1. ParticleNet & DynEdge: The current champions used by real-world experiments. They are like expert detectives who look at the clues locally (neighbor by neighbor).
  2. DeepIce: A winner from a previous public contest. It uses a "Transformer" architecture (the same tech behind modern chatbots) that looks at the whole picture at once.
  3. GRIT: A new hybrid athlete that tries to combine the best of both worlds.

They tested these AIs on five different tasks:

  1. Energy: How much power did the neutrino have? (Like guessing the speed of a car).
  2. Direction: Where did it come from? (Like pointing to the source of a sound).
  3. Shape (Track vs. Cascade): Did it leave a long trail (like a bullet) or a short puff (like a firework)?
  4. Location: Exactly where did the crash happen?
  5. Inelasticity: A fancy physics term for how much energy was "lost" in the crash versus kept by the particle.

The Results: What Did We Learn?

The paper found some fascinating things, which can be summarized with a few metaphors:

  • Density Matters: If you want to know exactly where a crash happened (Vertex) or how the energy was split (Inelasticity), you need a dense detector (lots of sensors close together). It's like trying to find a needle in a haystack; if the hay is packed tight, you find it easier. The AI did much better on the "dense" detector simulations.
  • Volume Matters: If you want to know the direction of a high-speed neutrino, a large detector is better. Even if the sensors are far apart, a huge volume gives the AI enough context to see the "line" the particle traveled.
  • The "Global" View Wins: For figuring out direction, the AI that looked at the whole picture at once (DeepIce and GRIT) beat the ones that looked at clues one by one. It's like the difference between trying to solve a maze by looking at one square at a time versus seeing the whole map.
  • No Single Winner: There wasn't one "perfect" AI for everything. Sometimes the old-school detective (DynEdge) was best; sometimes the new global thinker (DeepIce) won. It depends on the specific puzzle and how much energy the neutrino had.

Why This Matters

This paper is a huge step forward because it breaks down the walls between different scientific teams. By providing a common benchmark, anyone in the world can now download the data, train their own AI, and say, "My new method is 5% better than the current standard."

It turns neutrino physics from a collection of isolated experiments into a collaborative global sport, where the goal is to build the smartest, most accurate "ghost-hunting" AI possible. The datasets and the winning models are all open-source, meaning the door is wide open for the next generation of scientists to push the boundaries of what we know about the universe.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →