Absence of poor local minima in matrix product states

This paper resolves the paradox of Matrix Product States (MPS) being highly trainable despite the general trainability issues of quantum circuits by proving that the gauge freedom in MPS induces effective local overparametrization, which eliminates poor local minima and concentrates them near the global minimum.

Original authors: Hao-Kai Zhang, Chenghong Zhu, Shuo Liu, Shi-Xin Zhang, Tao Xiang

Published 2026-06-10
📖 5 min read🧠 Deep dive

Original authors: Hao-Kai Zhang, Chenghong Zhu, Shuo Liu, Shi-Xin Zhang, Tao Xiang

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Big Problem: Getting Stuck in the Mud

Imagine you are trying to find the lowest point in a massive, foggy mountain range. This is what scientists do when they try to train quantum computers to solve problems. They use an algorithm called "gradient descent," which is like a hiker blindly feeling their way downhill, step by step, hoping to reach the very bottom (the best solution).

In most modern quantum circuits (specifically ones called "brickwork circuits"), this hiker often gets stuck in a poor local minimum.

  • The Analogy: Imagine the hiker is walking down a mountain but gets trapped in a small, deep valley surrounded by high walls. They think they are at the bottom because they can't go any lower, but in reality, there is a much deeper valley (the true solution) just over the next ridge.
  • The Result: The quantum computer gets stuck, thinks it has found the answer, but the answer is actually terrible. This is a major reason why training quantum computers is so difficult.

The Mystery: Why Do MPS Work So Well?

For decades, scientists have used a different method called Matrix Product States (MPS) to solve quantum problems. It's like a very successful, old-school hiking technique that has worked perfectly for 30 years.

  • The Paradox: MPS can be built using the exact same type of "steps" (quantum circuits) as the brickwork circuits that get stuck. Yet, MPS almost never gets stuck in those bad valleys. It always finds the true bottom.
  • The Question: Why does this specific arrangement of steps work so reliably, while others fail?

The Discovery: The "Magic Compass" (Gauge Freedom)

The authors of this paper solved the mystery. They found that MPS has a special hidden feature called gauge freedom.

  • The Analogy: Imagine you are navigating a maze. In a standard maze (brickwork circuits), the walls are fixed. If you hit a dead end, you are stuck.
    In an MPS maze, the walls are made of sliding glass panels. You can slide these panels left or right without changing the actual path you need to take to get to the exit. This is the "gauge freedom."
  • The Insight: Because you can slide these panels, you can always rearrange the maze so that the part of the path you are currently looking at is over-parameterized.
    • Over-parameterization is like having 100 different keys for a single lock. Even if you pick the wrong key, you have so many other options nearby that you can easily wiggle your way out of a bad spot.
    • In MPS, the ability to slide the "orthogonality center" (the part of the calculation you are focusing on) means that no matter where you are, you can always rearrange the view so that you have too many keys for the lock. This creates a "safe zone" where the landscape is smooth and convex, making it impossible to get stuck in a bad valley.

The Proof: It's All About the View

The paper proves two main things mathematically:

  1. The View Doesn't Matter: Whether you look at the MPS from the left, the right, or the middle (moving the orthogonality center), the statistical "map" of the landscape looks exactly the same. The bad valleys don't appear just because you changed your perspective.
  2. The "Good" Valleys: Because of this sliding ability, the "bad valleys" (poor local minima) are mathematically forced to concentrate right next to the "true bottom" (the global minimum).
    • The Analogy: In a bad circuit, the bad valleys are scattered everywhere like landmines. In an MPS circuit, the bad valleys are all clustered together right next to the treasure chest. So, even if you think you found a "bad" spot, you are actually standing right next to the solution.

The Experiment: The Race

To prove this, the authors ran a race between three types of circuits:

  1. Sequential Circuits (MPS): The "sliding panel" method.
  2. Brickwork Circuits: The standard, rigid method.
  3. Sloping Brickwork Circuits: A hybrid version.

They gave them all a random, difficult mountain range to climb (random Hamiltonians).

  • The Result: The Sequential (MPS) circuits always found the bottom. The Brickwork circuits got stuck in the shallow, bad valleys, especially as the mountains got bigger.

The Takeaway

The paper concludes that the secret to making quantum algorithms trainable isn't just making the circuits bigger or deeper. It's about structure.

By using a structure (MPS) that allows for "sliding panels" (gauge freedom), you create a situation where the computer is effectively "over-equipped" with options at every single step. This ensures that the computer never gets truly stuck in a bad spot, making it a much more reliable tool for solving quantum problems.

In short: MPS works because it has a built-in "undo" button that lets it rearrange its own path to avoid getting stuck, ensuring it always finds the best solution.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →