Discovering Symbolic Differential Equations with Symmetry Invariants

This paper introduces a novel framework for discovering symbolic differential equations from data by utilizing symmetry invariants as atomic building blocks, thereby ensuring that the resulting equations inherently respect physical laws while improving the accuracy and efficiency of existing discovery methods.

Jianke Yang, Manu Bhat, Bryan Hu, Yadi Cao, Nima Dehmamy, Robin Walters, Rose Yu

Published Wed, 11 Ma
📖 4 min read☕ Coffee break read

Imagine you are a detective trying to solve a mystery. The mystery is how the universe works. You have a pile of clues (data) showing how things move, change, or react—like water flowing through sand, chemicals mixing, or waves crashing. Your goal is to write down the "rulebook" (a mathematical equation) that explains these behaviors perfectly.

This is the job of Symbolic Regression: using computers to find the secret math formulas hidden inside data.

The Problem: The "Infinite Library"

The problem is that the library of possible math formulas is infinite. It's like trying to find the one correct sentence in a library containing every possible combination of words in the English language.

  • If you just let the computer guess randomly, it might find a formula that fits your data perfectly but is actually nonsense (like x + y = z when the real rule is x² - y = z).
  • It might also find a formula that is so incredibly complicated and messy that no human can understand it.
  • Worse, it might find a formula that breaks the laws of physics (like creating energy out of nothing).

The Solution: The "Symmetry" Superpower

This paper introduces a clever trick called Symmetry Invariants.

Think of Symmetry like a rule that says, "If I turn this system around, or move it to a different spot, the rules shouldn't change."

  • Example: If you rotate a circle, it still looks like a circle. The rule describing a circle doesn't care about the angle; it only cares about the distance from the center.

The authors realized that instead of asking the computer to guess formulas using all the raw variables (like x, y, u, v), we should force it to guess formulas using Symmetry Invariants.

The Analogy: The Shape-Shifting Detective
Imagine you are trying to describe a spinning top.

  • The Old Way (Raw Variables): You try to describe the top by listing the exact position of every atom at every second. If the top spins, the positions change wildly. Your formula has to be incredibly complex to track every tiny shift.
  • The New Way (Symmetry Invariants): You realize the top is spinning. So, you stop looking at the spinning atoms and start looking at the distance from the center. No matter how much the top spins, the distance from the center stays the same. This "distance" is the Invariant.

By forcing the computer to only use these "unchanging distances" (invariants) to build its formula, you instantly:

  1. Shrink the Library: You throw away millions of impossible formulas that don't respect the symmetry.
  2. Guarantee Physics: The resulting formula must obey the laws of physics (like conservation of energy or rotation) because it's built from pieces that never change under those laws.
  3. Find Simpler Answers: The formulas become shorter, cleaner, and easier for humans to read.

How It Works in Practice

The researchers took existing AI tools (like Sparse Regression and Genetic Programming) and gave them a new set of "building blocks."

  • Instead of giving the AI blocks labeled x, y, u, and v, they gave it blocks labeled "Distance from Center," "Rotation Speed," or "Total Energy."
  • The AI then builds the equation using only these special blocks.

The Results: A Supercharged Detective

They tested this on three different "mysteries":

  1. Water Waves (Boussinesq Equation): The old AI failed to find the rule. The new AI found it instantly.
  2. Water Flowing Through Sand (Darcy Flow): The old AI got lost in the complexity. The new AI found the simple, elegant rule.
  3. Chemical Reactions (Reaction-Diffusion): Even when the data was noisy (dirty clues) or the symmetry wasn't perfect (the top was slightly wobbly), the new method was much more robust and accurate than the old ones.

Why This Matters

This isn't just about math; it's about trust.
In the past, AI might give you a "black box" answer that works but makes no sense. With this method, the AI gives you a white box answer. It gives you a formula that:

  • Is short and simple.
  • Respects the fundamental laws of nature.
  • Is easy for a human scientist to read and understand.

In a nutshell: This paper teaches AI to stop guessing randomly in the dark and start looking for the "unchanging truths" (invariants) that nature uses. It's like giving the detective a map that only shows the roads that actually exist, rather than letting them drive off a cliff.