Learning of Population Dynamics: Inverse Optimization Meets JKO Scheme

This paper introduces iJKOnet\texttt{iJKOnet}, a novel approach that combines the JKO scheme with inverse optimization to learn population dynamics via end-to-end adversarial training, offering theoretical guarantees and improved performance without requiring restrictive architectural constraints like input-convex neural networks.

Mikhail Persiianov, Jiawei Chen, Petr Mokrov, Alexander Tyurin, Evgeny Burnaev, Alexander Korotin

Published 2026-03-04
📖 5 min read🧠 Deep dive

Imagine you are a detective trying to figure out how a crowd of people moves through a city, but you have a major problem: you can't watch them move.

You only have a series of "snapshots" (photos) taken at different times.

  • Photo 1: People are gathered in a park.
  • Photo 2: People are scattered near a coffee shop.
  • Photo 3: People are clustered around a bus stop.

You don't know who went where. You don't know if the person in the red shirt in Photo 1 is the same person in the red shirt in Photo 3. You just see the shape of the crowd changing over time.

Your goal is to figure out the invisible rules (the "energy") that are pushing and pulling these people. Is there a magnet pulling them to the coffee shop? Is there a wind blowing them toward the bus? Is there a natural tendency for them to spread out?

This is the problem of Learning Population Dynamics.

The Old Way: The "Perfect Map" Problem

Previous methods tried to solve this by assuming they could draw a perfect, continuous map connecting every single person from Photo 1 to Photo 2.

  • The Flaw: In the real world (like studying cells in a lab or stock prices), you often destroy the sample to measure it, or you can't track individuals. You can't draw that perfect map.
  • The Result: Old methods were either too slow, required impossible assumptions (like "everyone must move in a straight line"), or needed you to pre-calculate the map before you could even start learning the rules.

The New Way: iJKOnet (The "Smart Guessing Game")

The authors of this paper introduce a new method called iJKOnet. Think of it as a high-stakes game of "Hot and Cold" played between two AI agents.

The Players

  1. The Rule-Maker (The Energy Function): This AI tries to invent a set of invisible rules (like "gravity" or "attraction") that explain why the crowd moves the way it does.
  2. The Map-Maker (The Transport Map): This AI tries to figure out the most efficient way to move the crowd from the "Park" shape to the "Coffee Shop" shape according to the rules invented by the Rule-Maker.

The Game Loop

  1. The Setup: The Rule-Maker says, "I think the crowd moves because they are attracted to coffee."
  2. The Test: The Map-Maker tries to move the crowd based on that rule. It pushes the people from the Park toward the Coffee Shop.
  3. The Reality Check: The system compares the Map-Maker's result with the actual Photo 2 (the real Coffee Shop crowd).
    • If they match: Great! The Rule-Maker's guess was good.
    • If they don't match: The system says, "Nope, your rules are wrong. The crowd didn't move like that."
  4. The Twist (Inverse Optimization): Here is the clever part. Instead of just fixing the map, the system punishes the Rule-Maker. It forces the Rule-Maker to change its rules until the "cost" of moving the crowd matches the reality perfectly.

It's like trying to guess the recipe for a soup.

  • You taste the soup (the real data).
  • You guess the ingredients (the rules).
  • You cook a batch based on your guess.
  • If the taste is wrong, you don't just tweak the cooking; you realize your recipe was wrong and rewrite it entirely.

Why is this a Big Deal?

1. No "Input-Convex" Headaches
Old methods forced the AI to use very specific, rigid types of neural networks (like forcing a chef to only use square pans). This paper says, "Nope, use whatever tools you want." This makes the AI much more flexible and powerful.

2. It Works Without Tracking
Because it looks at the shapes of the crowds rather than individual people, it works perfectly for situations where you can't track individuals, like:

  • Biology: Watching how stem cells turn into different tissues (you kill the cell to look at it, so you can't watch it grow).
  • Finance: Predicting how stock prices move based on the distribution of prices at different times.
  • Traffic: Figuring out how pedestrians flow through a station without tracking every single person.

3. It's End-to-End
Old methods were like building a car engine, then building a transmission, then trying to bolt them together. If they didn't fit, you had to start over. This new method builds the engine and transmission at the same time, learning from each other instantly.

The Bottom Line

iJKOnet is a new detective tool. It looks at a series of blurry, disconnected photos of a moving crowd and figures out the invisible laws of physics that are driving them. It does this by playing a smart guessing game where it learns from its mistakes, without needing to know the identity of every single person in the crowd.

This allows scientists to understand complex systems—from how diseases spread to how cells develop—using data that was previously too messy or incomplete to analyze.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →