Machine-learned particle flow as a foundation model for collider physics

This paper establishes machine-learned particle flow (MLPF) as a foundation model for collider physics by demonstrating that its learned latent representations serve as a shared, information-rich bridge between low-level detector data and diverse high-level analysis tasks, significantly improving performance and efficiency compared to traditional modular approaches.

Original authors: Farouk Mokhtar, Joosep Pata, Michael Kagan, Javier Duarte

Published 2026-06-15✓ Author reviewed
📖 5 min read🧠 Deep dive

Original authors: Farouk Mokhtar, Joosep Pata, Michael Kagan, Javier Duarte

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine a massive, high-speed collision happening inside a particle accelerator. When particles smash together, they shatter into a chaotic spray of smaller fragments. To understand what happened, physicists need to rebuild the story from the debris.

Traditionally, this reconstruction process is like a factory assembly line with disconnected stations.

  1. Station A looks at the raw, messy signals from the detectors and creates a basic list of "what particles are here."
  2. Station B takes that list and tries to answer specific questions, like "Was this a heavy particle?" or "How much energy did it have?"

The problem is that once Station A finishes its job and hands over the list, it throws away all the subtle, messy details it saw in the raw data. Station B has to start from scratch, often having to manually invent new tools (called "features") to guess what it missed.

The Big Idea: The "Foundation Model"
This paper proposes a new way to run the factory. Instead of just handing over a simple list, the first station (a machine learning model called MLPF) keeps a "secret notebook" of high-level insights it learned while doing its job.

Think of this notebook as a universal translator or a rich internal memory. Even though the machine wasn't explicitly taught to answer the specific questions in Station B, its internal memory contains the raw physics of the event in a compressed, intelligent format.

The researchers took this "secret notebook" (called latent representations) and handed it to three different experts (the downstream tasks) to see if it helped them do their jobs better.

The Three Tests

The team tested this idea on three very different jobs:

1. Identifying the "Flavor" of a Jet (The Detective)

  • The Job: Particles often clump together into "jets." Physicists need to know if a jet came from a heavy "beauty" quark, a "charm" quark, or a lighter particle. This is like a detective trying to identify a suspect's nationality based on their clothing.
  • The Old Way: The detective only had a photo of the suspect's outfit (standard data).
  • The New Way: The detective was given the photo plus the secret notebook from the first station.
  • The Result: The detective became much better at spotting the heavy "beauty" quarks, even when they looked very similar to the others. The secret notebook contained clues about the suspect's history that the photo alone didn't show.

2. Measuring Jet Energy (The Accountant)

  • The Job: Calculating exactly how much energy a jet carries.
  • The Old Way: The accountant used standard math on the photo.
  • The New Way: The accountant used the photo plus the secret notebook.
  • The Result: The accountant's numbers were much more precise, especially for very high-energy jets. The notebook helped correct small errors that the standard math missed.

3. Finding "Missing" Momentum (The Balance Sheet)

  • The Job: Sometimes particles (like neutrinos) escape the detector unseen. Physicists have to calculate where they went by seeing what is "missing" from the total balance.
  • The Old Way: The balance sheet was often off because the individual numbers were slightly fuzzy.
  • The New Way: The balance sheet was updated using the secret notebook, which understood the reliability of every single piece of data.
  • The Result: This was the biggest win. The new method found the missing momentum with 35 times fewer parameters (a much simpler, lighter model) than the previous best method, and it was significantly more accurate.

The "Linear Probe" Surprise

The most surprising part of the paper is a test they called the "Linear Probe."

Imagine you have a super-complex, 2048-page secret notebook. Usually, you'd need a huge team of analysts to read it and find the answer. But the researchers asked: "Can a single, simple line of math read this notebook and still get a good answer?"

Yes.
Even with just a single, simple line of math (a linear layer), the model could extract useful physics information from the notebook.

  • For the "Missing Momentum" test, this simple line of math beat the complex, industry-standard models.
  • For the "Flavor" test, it did surprisingly well, even though the notebook was never explicitly trained to look for flavors. This proves the notebook naturally organizes the physics information in a way that is easy to read.

The Takeaway

The paper concludes that reconstruction and analysis don't need to be separate steps.

By using a machine learning model that learns a "shared language" (the latent representations) during the reconstruction phase, we can feed that language directly into analysis tasks. It's like if the factory worker didn't just hand you a box of parts, but also handed you a manual that explained exactly how those parts fit together, making the assembly process faster, cheaper, and more accurate.

This establishes the reconstruction model as a "Foundation Model" for particle physics: a powerful, pre-trained brain that can be easily adapted to solve many different problems without needing to be retrained from scratch.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →