DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration

DreamSAC is a framework that enhances extrapolative generalization in physics simulations by combining an unsupervised symmetry exploration strategy, which actively probes conservation laws via a Hamiltonian-based curiosity bonus, with a Hamiltonian-based world model that learns invariant physical states from raw observations through a novel contrastive objective.

Jinzhou Tang, Fan Feng, Minghao Fu, Wenjun Lin, Biwei Huang, Keze Wang

Published 2026-03-10
📖 4 min read☕ Coffee break read

Imagine you are trying to teach a robot how to play pool.

The Old Way (Passive Learning):
Most current AI robots learn by watching thousands of hours of pool videos. They are like a student who memorizes every single shot they've ever seen. If the robot sees a red ball hit a blue ball, it knows exactly what happens because it's seen that specific scene a million times.

But here's the problem: If you put a heavier blue ball on the table, or change the friction of the felt, or move the camera to a weird angle, the robot gets confused. It fails. Why? Because it didn't learn the rules of physics (like momentum or gravity); it just learned to recognize patterns in the pixels. It's like memorizing the answer key to a test without understanding the math.

The New Way (DreamSAC):
The paper introduces DreamSAC, a robot that doesn't just watch; it plays. It treats the world like a playground and uses a special "curiosity" to figure out the laws of physics.

Here is how it works, broken down into three simple steps:

1. The "Physics Detective" (Symmetry Exploration)

Instead of waiting for data, the robot actively tries to break things.

  • The Analogy: Imagine a child in a dark room. A passive learner sits still and waits for a light to turn on. A DreamSAC robot is like a child who starts throwing balls at the walls, jumping on furniture, and banging pots.
  • Why? The robot has a special "curiosity bonus." It gets a reward for doing things that cause a big change in energy. It wants to find out: "If I push this heavy box, how much harder do I have to push compared to a light box?"
  • The Result: By actively "breaking symmetry" (doing things that change the system), it gathers the exact kind of data needed to understand the underlying rules, not just the surface appearance.

2. The "Invisible Backpack" (Hamiltonian World Model)

Once the robot gathers this data, it builds a mental model of the world. But this isn't a normal model; it's built on Hamiltonian Physics.

  • The Analogy: Think of a normal AI model as a video game character that just remembers the map. DreamSAC's model is like a character who carries an invisible backpack filled with the laws of physics (conservation of energy, momentum, etc.).
  • The Magic: Even if the robot looks at the pool table from a weird angle (a new camera view), the "backpack" tells it, "Hey, the ball still has mass, and gravity still pulls down." It separates the visual noise (is the camera tilted? is it sunny?) from the physical truth (how heavy is the ball?).
  • The Contrastive Trick: To make sure the robot ignores the camera angle, the researchers use a "spot the difference" game. They show the robot two pictures of the same scene from different angles and say, "Ignore the angle; tell me what's the same about the physics." This forces the robot to learn the invariant (unchanging) laws.

3. The "Dreaming" Phase (Imagination)

The robot doesn't just learn from real life; it dreams.

  • The Analogy: After playing in the real world, the robot goes to sleep and runs simulations in its head. It imagines, "What if I hit the ball with double the force?" or "What if gravity was 50% stronger?"
  • The Benefit: Because its "backpack" contains the actual laws of physics, these dreams are accurate. It can predict what happens in a world it has never visited before.

Why This Matters

The paper shows that DreamSAC is a chameleon of physics.

  • If you change the gravity, it adapts instantly.
  • If you change the camera angle, it doesn't get confused.
  • If you add a new object, it knows how to interact with it.

In a nutshell:
Old AI is like a parrot that repeats what it hears. DreamSAC is like a curious scientist who pokes, prods, and experiments until it figures out why things happen. It doesn't just memorize the world; it learns the code that runs the universe, allowing it to handle situations it has never seen before.