AgentRivet: an automated system for producing Rivet routines from journal publications

This paper presents AgentRivet, an automated workflow utilizing Large Language Models to extract physics analysis details from journal publications and generate Rivet routines, thereby addressing the significant gap in coverage for model-independent measurements in particle physics.

Original authors: Antonio J. Costa, Caterina Doglioni, Christian Gütschow, Andrew D. Pilkington, Sukanya Sinha

Published 2026-06-12
📖 4 min read🧠 Deep dive

Original authors: Antonio J. Costa, Caterina Doglioni, Christian Gütschow, Andrew D. Pilkington, Sukanya Sinha

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine the world of particle physics as a massive, high-stakes cooking competition. Scientists at giant machines (like the Large Hadron Collider) cook up complex "dishes" (collisions of particles) and write detailed recipes in scientific papers. They also provide a list of ingredients (data) so other chefs can try to recreate the dish.

However, there's a problem: To truly taste and compare these dishes, other scientists need a specific, standardized kitchen tool called Rivet. Think of Rivet as a specialized, high-tech measuring cup that ensures everyone is measuring the soup the exact same way. Without it, you can't fairly compare your soup to someone else's.

The trouble is, only about 40% of the published recipes come with this special measuring cup. The rest are just written descriptions, which are hard to turn into the precise code needed for the tool.

Enter AgentRivet: The AI Sous-Chef

The authors of this paper built a new system called AgentRivet. Think of it as a team of AI robots designed to read those messy, text-only recipes and automatically build the missing Rivet measuring cups (computer code) for you.

Here is how their "kitchen team" works, using a simple workflow:

  1. The Analyst (The Reader): This AI robot reads the scientific paper and acts like a very careful sous-chef. It doesn't just read; it extracts the exact instructions: "Use 2 lemons," "Chop the onions this way," "Cook for 10 minutes." It turns the messy text into a clean, structured shopping list.
  2. The Coder (The Builder): This robot takes the shopping list and tries to build the actual Rivet tool (which is written in a specific computer language called C++). It's like a robot arm trying to assemble a complex machine based on the instructions.
  3. The Reviewers (The Inspectors): Before the tool is finished, two inspectors check the work.
    • The Code Reviewer checks for technical errors, like using the wrong type of screw or a broken part (syntax errors).
    • The Physics Reviewer checks if the instructions actually match the recipe. Did the robot measure the onions correctly? Did it follow the cooking time?

The "Taste Test" (The Results)

The team tested this AI team on two very recent and complex recipes from the ATLAS and CMS experiments (two major particle physics labs). They asked the AI to build the Rivet tools from scratch.

  • The Good News: The AI team was surprisingly good at the job. They built working tools with very few technical glitches. When they used the tools to measure simulated particle collisions, the results looked very similar to what the human scientists expected.
  • The Bad News (The "Hallucinations"): Sometimes, the AI got confused by vague parts of the recipe.
    • If the paper said, "Do something special with the sauce," but didn't explain exactly how, the AI would guess. Sometimes it guessed right; sometimes it guessed wrong.
    • One AI model (Gemini) sometimes forgot to follow specific instructions about "neutrinos" (a type of invisible particle), while another (Claude) sometimes got stuck in a loop or wrote down its own "thoughts" instead of just the code.
    • The AI struggled the most with the most complex, abstract parts of the recipes, like measuring the "shape" of the event or using complex math formulas that weren't clearly defined.

The Verdict

The paper concludes that AgentRivet is a promising new tool. It can successfully turn about 40% of the "missing" recipes into working code, which is a huge help to the physics community.

However, it's not perfect yet. It still needs a human to look over its shoulder, especially when the original recipe is vague. The authors suggest that in the future, they will teach the AI better by training it on more examples and adding automatic checks to catch errors before a human even sees them.

In short: AgentRivet is an automated team that reads science papers and builds the missing software tools scientists need to compare their data. It works well, but it still makes mistakes when the instructions are unclear, so human experts are still needed to double-check the work.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →