AI-assisted modeling and Bayesian inference of unpolarized quark transverse momentum distributions from Drell-Yan data

This paper presents a global Bayesian analysis of unpolarized quark transverse-momentum-dependent parton distribution functions using Drell-Yan data at N3LO{\rm N^3LO} and N4LL{\rm N^4LL} accuracy, leveraging AI-driven functional form selection and machine-learning emulators to enable efficient Markov Chain Monte Carlo sampling and quantify uncertainties.

Original authors: Zhong-Bo Kang, Luke Sellers, Congyue Zhang, Curtis Zhou

Published 2026-04-16
📖 5 min read🧠 Deep dive

This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Big Picture: Mapping the Invisible Proton

Imagine the proton (the tiny particle inside an atom's nucleus) not as a solid marble, but as a busy, chaotic city filled with smaller particles called quarks and gluons.

For a long time, physicists have had a very good map of this city showing how the traffic moves forward and backward (longitudinal motion). But they were missing a crucial part of the map: how the traffic swerves side-to-side (transverse motion). This side-to-side movement is called the "Transverse Momentum."

This paper is about creating a high-definition, 3D map of that side-to-side swerving. The authors used data from massive particle collisions (like the Drell-Yan process, where particles smash together and create pairs of leptons) to figure out exactly how these quarks move.

The New Toolkit: AI as the Architect and the Speedster

The authors didn't just crunch numbers; they used Artificial Intelligence (AI) in two clever ways to solve a problem that is usually too hard for humans to do alone.

1. The AI Architect (Designing the Map)

Usually, when scientists try to map something they can't see directly, they have to guess the shape of the curve that fits the data. It's like trying to draw a smooth line through a scatter of dots on a piece of paper.

  • The Old Way: A human would guess a shape (maybe a curve, maybe a wave), draw it, and see if it fits. If it doesn't, they try a different shape. This is slow and biased by what the human thinks the shape should look like.
  • The New Way (This Paper): The authors used an AI agent as an Architect. They told the AI, "Here are the rules of physics (the building codes). Now, go design 100 different blueprints for the curve that fits our data best." The AI generated, tested, and ranked hundreds of mathematical shapes automatically. It found the best design much faster and more creatively than a human could, ensuring they didn't miss a better shape just because it looked "weird."

2. The AI Speedster (The Emulator)

Once they had the best design, they needed to test it against thousands of data points.

  • The Problem: Calculating the physics for every single data point is like trying to bake a cake from scratch every time you want to check if the frosting is right. It takes forever. If you need to check it a million times (which you do in statistics), you'd be baking for years.
  • The Solution: They trained a Machine Learning Emulator. Think of this as a super-fast food critic who has tasted a million cakes. Instead of baking a new cake (doing the full physics calculation), the critic looks at the ingredients and instantly says, "This will taste like a 9.5/10."
  • This "emulator" learned the relationship between the inputs and the results so well that it could predict the outcome in a split second. This allowed the scientists to run their complex statistical tests in days instead of years.

The Detective Work: Two Ways to Measure Uncertainty

The paper compares two different ways of measuring how "sure" they are about their map.

  1. The "Clone Army" Method (Replica Analysis):
    Imagine you have a blurry photo of a suspect. To figure out who it is, you create 100 slightly different versions of the photo (adding random noise to the pixels) and ask 100 different detectives to identify the person. If 95 of them say "It's Bob," you are pretty sure it's Bob. This is the traditional method used in physics.

  2. The "Bayesian Detective" Method (Bayesian Inference):
    This is a more modern, probabilistic approach. Instead of making clones of the photo, the detective starts with a "gut feeling" (a prior belief) about who the suspect might be. As they look at the evidence, they update their belief mathematically. It's like a detective saying, "I thought it was Bob, but the new evidence makes me 90% sure it's Bob, with a 10% chance it's Charlie."

The Result: The authors found that both methods agreed on who the suspect was (the central values of the map were the same). However, the Bayesian method gave a slightly wider range of "maybe" (larger uncertainty bands). This is actually good news! It means the Bayesian method is being more honest about the unknowns, rather than pretending to be more certain than the data allows.

The Findings: What Did They Learn?

  • The "Swerve" is Real: They successfully mapped how quarks move side-to-side.
  • The "Collins-Soper Kernel": This is a fancy name for the "engine" that drives how the quarks' movement changes as the energy of the collision changes. Their new map for this engine matches up very well with other recent discoveries (like those from the Electron-Energy Correlator) and even with supercomputer simulations (Lattice QCD).
  • Consistency: Whether they used the "Clone Army" or the "Bayesian Detective," the final picture of the proton's interior looked very similar.

Why Does This Matter?

Think of the proton as a complex machine. If you want to build a better engine (or understand the universe's fundamental forces), you need to know exactly how every gear turns.

By using AI to design the best mathematical models and to speed up the calculations, this paper provides the most precise map yet of how quarks move sideways inside a proton. It also proves that AI and Bayesian statistics are powerful new tools that can work together to solve problems that were previously too difficult or too slow to tackle.

In short: They used AI to design the best possible map, used a digital "speedster" to draw it quickly, and compared two different ways of measuring the map's accuracy to ensure it's reliable. The result is a clearer, more honest picture of the building blocks of our universe.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →