Agentic Exploration of Physics Models

This paper introduces SciExplorer, a large language model-based agent capable of autonomously discovering the laws of unknown physical systems through iterative experimentation and code execution without requiring domain-specific tuning or instructions.

Original authors: Maximilian Nägele, Florian Marquardt

Published 2026-03-17
📖 6 min read🧠 Deep dive

This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are handed a mysterious, locked box. You can't see inside, and you don't know what's making the noises coming from it. Your job is to figure out exactly how the machine inside works just by shaking it, listening to the sounds, and watching how it reacts when you push it.

This is essentially what scientists do when they study the universe: they observe nature, guess the rules (hypotheses), test those guesses, and refine them until they find the "law" that explains everything.

This paper introduces SciExplorer, a new kind of AI scientist designed to do this detective work entirely on its own.

The Problem: The "Specialist" vs. The "Generalist"

In the past, we've built AI tools that are like specialized apprentices.

  • If you want to predict the weather, you use a weather AI.
  • If you want to design a new drug, you use a chemistry AI.
  • If you want to solve a specific math equation, you use a math AI.

These tools are great at their one job, but if you ask a weather AI to design a drug, it's completely lost. They need to be taught exactly what to do for every single new task.

SciExplorer is different. It's like a curious, super-smart intern who has read every physics textbook ever written but has never seen the specific machine in front of them. You don't give it a manual or a step-by-step guide. You just say, "Figure out how this thing works," and it has to figure out the rest.

How SciExplorer Works: The "Try, Fail, Learn" Loop

The paper describes SciExplorer as an "agentic" system. Think of it as a robot with a brain (a Large Language Model, or LLM) and a set of hands (computer tools).

Here is the cycle it goes through, using a simple analogy: The Detective's Notebook.

  1. The Plan (Hypothesis Generation):
    The AI looks at the mystery system. It says, "Okay, I think this might be a swinging pendulum. Or maybe it's a wave. Let's test that."

    • Analogy: The detective looks at a crime scene and says, "It looks like a robbery. Let's check for fingerprints."
  2. The Experiment (Tool Use):
    The AI writes a computer program to simulate an experiment. It might say, "I'm going to push the system hard from the left and see what happens." It runs this code.

    • Analogy: The detective sets up a trap or interviews a witness to see if their theory holds up.
  3. The Analysis (Observation):
    The AI looks at the results. Did the system swing? Did it stop? Did it explode? It draws graphs and plots to see patterns.

    • Analogy: The detective looks at the evidence. "Hmm, the witness said it was raining, but the ground is dry. My theory about the robbery is wrong."
  4. The Pivot (Self-Correction):
    If the theory was wrong, the AI doesn't give up. It says, "Okay, it's not a pendulum. Maybe it's a spring? Let's try a different experiment." It writes new code, runs new tests, and updates its notebook.

    • Analogy: The detective realizes the suspect is actually a spy, not a robber, and starts looking for different clues.
  5. The Solution (Discovery):
    Eventually, the AI finds a set of rules (an equation) that perfectly predicts what the system will do in any situation. It writes down the final formula.

What Did It Actually Do?

The researchers tested SciExplorer on three very different types of "mystery boxes":

  • Mechanical Systems (The Swinging Things): They gave it data from things like double pendulums (two swings attached) and particles moving in 2D. The AI had to figure out the exact equations of motion (like Newton's laws) just by watching the movement.

    • Result: It successfully "rediscovered" the laws of physics for these systems, often getting a perfect score.
  • Wave Systems (The Ripples): They gave it data on how waves move through a grid of connected points (like ripples in a pond or light waves). The AI had to guess the complex equations that govern these waves.

    • Result: It figured out complex wave equations, including ones with "non-linear" effects (where the wave changes its own shape).
  • Quantum Systems (The Tiny Spins): This is the hardest part. They gave it data on quantum particles (spins) and asked it to find the "Hamiltonian" (the master rulebook that dictates how these tiny particles interact).

    • Result: Even though quantum physics is notoriously difficult and counter-intuitive, the AI successfully identified the correct interaction rules for these systems.

Why Is This a Big Deal?

  1. No "Cheating": The AI wasn't told what kind of system it was looking at. It didn't know "This is a pendulum." It just knew "Here is some data, find the rule."
  2. It's a Generalist: It didn't need to be retrained for each new task. The same AI solved the pendulum, the wave, and the quantum problems using the same "brain."
  3. It Handles Noise: Real-world data is messy. The researchers added "noise" (random errors) to the data, and SciExplorer could still figure out the correct laws, just like a human scientist would.
  4. It's Fast (for an AI): While it takes the AI a few minutes to an hour to solve a problem, the researchers estimate that a human expert would take much longer to do the same "open-ended" exploration from scratch.

The Limitations

The paper is honest about where the AI struggles. Sometimes it gets "stubborn." If it guesses a model early on, it might stick to it even when the data says it's wrong. It also sometimes misses subtle visual clues in the graphs that a human might spot immediately. It's not perfect yet, but it's a massive leap forward.

The Bottom Line

SciExplorer is a proof-of-concept that we are moving toward a future where AI can act as a true scientific partner. Instead of just crunching numbers for us, it can look at a mystery, design its own experiments, analyze the results, and discover new laws of physics—all without needing a human to hold its hand and tell it exactly what to do next.

It's like giving a robot a library of all human knowledge and a set of tools, then saying, "Go explore the universe and tell us what you find." And for the first time, the robot is actually starting to find things on its own.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →