Equilibrium Propagation for Non-Conservative Systems

This paper proposes a framework that extends Equilibrium Propagation to arbitrary non-conservative systems by modifying the learning-phase dynamics to account for non-reciprocal interactions, thereby enabling the exact computation of cost function gradients and achieving superior performance compared to previous methods.

Original authors: Antonino Emanuele Scurria, Dimitri Vanden Abeele, Bortolo Matteo Mognetti, Serge Massar

Published 2026-06-02
📖 6 min read🧠 Deep dive

Original authors: Antonino Emanuele Scurria, Dimitri Vanden Abeele, Bortolo Matteo Mognetti, Serge Massar

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Big Picture: Teaching a Machine Without a "Backward Pass"

Imagine you are trying to teach a robot to recognize a cat in a photo. In the standard way we do this today (called "Backpropagation"), the robot looks at the photo, makes a guess, realizes it's wrong, and then sends a "correction signal" all the way back through its brain, layer by layer, to fix its mistakes.

The problem is that this "backward pass" is very hard to build in real physical machines (like biological brains or silicon chips) because it requires sending information backward in time or across long distances instantly.

Equilibrium Propagation (EP) is a smarter, more physical way to learn. Instead of a backward pass, the robot just relaxes into a "calm state" (equilibrium). It tries two slightly different scenarios:

  1. Free State: The robot looks at the picture and guesses naturally.
  2. Nudged State: Someone gently pushes the robot's final guess toward the correct answer.

By comparing how the robot's brain changed between these two calm states, it can figure out exactly how to adjust its internal settings to get better next time. It's like learning by feeling the difference between "what I thought" and "what I was nudged to think."

The Problem: The "Symmetry" Rule

The original version of this learning method (EP) only worked for systems that follow a strict rule: Symmetry.

Think of a conservative system like a ball rolling on a smooth hill. If the ball rolls from point A to point B, the path it takes is determined by the shape of the hill. If you reverse the path, the physics are the same. In a computer brain, this means if Neuron A talks to Neuron B, Neuron B must talk back to Neuron A with the exact same strength.

However, many real-world systems (and modern AI models) are not like a smooth hill. They are like a river with a current or a one-way street.

  • Non-Conservative Systems: Information flows one way (like in a feedforward network where data goes Input → Hidden → Output, but never backward).
  • The Issue: The old EP method breaks in these systems. It tries to use the "hill" math on a "river," and the learning calculations become wrong. The robot learns the wrong lessons.

The Solution: Two New Methods

The authors propose two new ways to fix this, allowing the "Equilibrium Propagation" method to work on these one-way, non-symmetric systems.

1. Asymmetric EP (AsymEP): The "Local Fix"

Imagine you are trying to balance a scale, but someone keeps secretly adding weight to one side (the non-symmetric part). The old method just ignores this and tries to balance it anyway, which fails.

AsymEP adds a tiny, local "counter-weight" to the scale.

  • How it works: During the "Nudged" phase (when the robot is being pushed toward the right answer), the algorithm adds a special correction term. This term is calculated based on exactly how "lopsided" or "non-symmetric" the connections are.
  • The Analogy: It's like a cyclist riding a bike with a flat tire. The old method just tells them to pedal harder. AsymEP adds a small, local adjustment to the handlebars to compensate for the flat tire, allowing them to ride straight and learn correctly.
  • Result: This allows the system to calculate the exact correct gradient (the right lesson) even when the connections are one-way.

2. Dyadic EP: The "Double-Brain" Approach

If AsymEP is a local fix, Dyadic EP is a bigger architectural change.

  • The Analogy: Imagine you have a complex machine that only works if you have two identical copies of it running side-by-side. One copy represents the "forward" flow, and the other represents a "backward" flow.
  • How it works: The algorithm doubles the number of variables in the system. It creates a new, larger "energy landscape" where the two copies interact. In this doubled space, the messy, one-way river of the original system transforms into a smooth, symmetrical hill again.
  • The Result: Because the math now works on this "doubled" system, the learning is perfect. It's a bit like using a mirror to make a one-way street look like a two-way street so you can apply standard traffic rules.

What They Tested (The Experiments)

The authors didn't just do math; they tested these ideas on real image recognition tasks (like identifying handwritten digits or clothes).

  1. Symmetric Start: They started with networks that were symmetrical (like the old EP). AsymEP learned faster and got better results than the old methods.
  2. Forced Asymmetry: They forced the networks to be very "one-way" (highly asymmetric).
    • The old method (Vector Field) failed miserably, getting results no better than random guessing.
    • AsymEP kept working perfectly, even when the network was completely one-way.
  3. Feedforward Networks: This is the big win. Modern AI (like the ones in your phone) are usually "feedforward" (strictly one-way). The old EP couldn't train these at all. AsymEP successfully trained these networks, proving it can handle the architecture used in most modern AI.
  4. Deep Learning: They tested on a complex dataset (CIFAR-10) with a deep network. AsymEP and Dyadic EP performed almost exactly as well as the standard "Backpropagation" method, which is the gold standard.

Summary

  • The Problem: The cool "Equilibrium Propagation" learning method only worked on symmetrical systems, but real AI and physical systems are often asymmetrical (one-way).
  • The Fix: The authors created AsymEP (which adds a local correction to the learning rule) and Dyadic EP (which doubles the system size to make the math work).
  • The Outcome: These new methods allow this physical, brain-friendly learning style to work on the same types of networks used in modern AI, achieving results just as good as the standard, difficult-to-implement methods.

In short, they figured out how to teach a physical machine using "relaxation" and "local nudges" even when the machine's internal wiring is strictly one-way.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →