Learning interpretable and stable dynamical models via mixed-integer Lyapunov-constrained optimization

This paper proposes a mixed-integer quadratically constrained optimization framework that enforces Lyapunov stability constraints on data-driven dynamical models to simultaneously discover interpretable system equations and their corresponding Lyapunov functions, achieving superior predictive accuracy and true model recovery even in noisy conditions.

Zhe Li, Ilias Mitrai

Published 2026-04-10
📖 4 min read☕ Coffee break read

Imagine you are trying to teach a robot how to drive a car. You have a video of a human driving, and you want the robot to learn the rules of the road just by watching.

Most modern AI methods are like a "black box" wizard. They look at the video, guess the rules, and say, "I think the car should turn left here." They might get the answer right 99% of the time, but they can't explain why, and sometimes, they make a dangerous mistake that looks fine in the video but causes a crash in real life.

This paper proposes a different way to teach the robot. Instead of a black box, they want a transparent, rule-following student who also learns a "safety manual" at the same time.

Here is the breakdown of their approach using simple analogies:

1. The Goal: Finding the "True Story"

The authors want to discover the mathematical "story" (the equations) that explains how a system moves. But they have two strict requirements:

  • It must be readable: The story shouldn't be a confusing mess of code; it should look like a clear sentence (e.g., "Speed equals acceleration times time").
  • It must be safe: The story must guarantee that the system eventually settles down and doesn't go crazy (like a pendulum eventually stopping, rather than spinning forever).

2. The Ingredients: LEGO Bricks

Instead of using a giant, complex neural network, the authors build their model using LEGO bricks (called "basis functions").

  • Imagine you have a box of bricks: some are straight lines, some are curves, some are squares, and some are sine waves.
  • The computer's job is to pick the right bricks and snap them together to build the equation that describes the system.
  • They also build a second structure at the same time: a "Safety Net" (called a Lyapunov function). This isn't part of the car's engine; it's a mathematical tool that proves the car will never crash.

3. The Challenge: The "Safety Net" Constraint

Usually, when you teach a computer, you just say, "Make your guess match the video as closely as possible."

  • The Old Way: The computer might guess a model that fits the video perfectly but is physically impossible (like a car that accelerates infinitely). It's accurate on the video but dangerous in reality.
  • The New Way (This Paper): The authors say, "Make your guess match the video, AND you must prove that your 'Safety Net' holds up."

They force the computer to solve a giant puzzle where:

  1. The model must look like the data.
  2. The model must be built only from the selected LEGO bricks (to keep it simple and readable).
  3. The "Safety Net" must show that the system is always losing energy and moving toward a stop (stability).

4. The "Mixed-Integer" Puzzle

This is the hardest part. The computer has to make two types of decisions at once:

  • Continuous decisions: "How big should this brick be?" (e.g., 1.5 or 0.2).
  • Binary decisions: "Do we use this brick at all?" (Yes/No).

This turns the problem into a Mixed-Integer Quadratically Constrained Optimization problem.

  • Analogy: Imagine you are a chef trying to create a new recipe. You have to decide which ingredients to use (Yes/No) and how much of each to add (Continuous numbers), but you also have to prove mathematically that the dish won't explode in the oven.
  • The authors use a super-smart solver (Gurobi) to find the perfect combination of ingredients that satisfies all these rules.

5. The Results: Why It Matters

The authors tested this on two systems: a swinging pendulum (like a clock) and a coupled oscillator (two things vibrating together).

  • Without Noise: The method found the exact correct equations and the correct safety manual instantly.
  • With Noise (The Real World): Real data is messy (like a shaky camera).
    • Old methods (Baselines): When the data was noisy, they got confused. Their models became wildly inaccurate, and their "safety nets" failed.
    • The New Method: Because it was forced to respect the "Safety Net" rules during training, it ignored the noise better. It stayed accurate and kept the system stable, even when the data was messy.

The Bottom Line

This paper is like teaching a student to drive not just by showing them videos, but by forcing them to write down the traffic laws and prove that following those laws will keep them safe.

Even if the video is blurry (noisy data), the student who understands the rules of safety will drive much better than the student who just memorized the video frame-by-frame. The result is a model that is not only accurate but also trustworthy, simple to read, and mathematically guaranteed to be stable.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →