Learning in Low-Dimensional Subspaces: Orthogonal Bottlenecks for Reinforcement Learning

This paper introduces orthogonal bottlenecks, a lightweight, architecture-agnostic mechanism that constrains reinforcement learning representations to low-dimensional subspaces via fixed orthonormal projections, demonstrating both theoretically and empirically that task-relevant value functions can be preserved and often improved upon with minimal dimensionality while stabilizing feature geometry.

Original authors: Aleksandar Todorov, Matthia Sabatelli

Published 2026-05-26✓ Author reviewed
📖 5 min read🧠 Deep dive

Original authors: Aleksandar Todorov, Matthia Sabatelli

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are trying to teach a robot to play a video game or walk across a room. Usually, we give these robots "brains" (neural networks) that are massive and over-engineered, like using a supercomputer to solve a simple math problem. They have millions of connections, processing huge amounts of data, even though the actual task might only require a few simple rules.

This paper asks a simple question: Do these robots actually need such huge brains, or are they just carrying around a lot of unnecessary baggage?

The authors found that the "thoughts" (representations) a robot needs to solve a task are often much simpler and smaller than we think. They discovered a way to force the robot's brain to think in a tiny, efficient space without losing its ability to learn.

Here is the breakdown of their discovery using everyday analogies:

1. The Problem: The Over-Cluttered Desk

Imagine a robot's brain is like a giant, messy desk with thousands of drawers. When the robot tries to figure out what to do, it has to search through all these drawers. Even though the robot only needs three specific tools (a hammer, a screwdriver, and a wrench) to fix a toy, the desk is so big that it wastes time and energy searching through empty drawers.

In technical terms, deep learning agents use high-dimensional representations (huge "desks") even when the task is intrinsically simple.

2. The Solution: The "Orthogonal Bottleneck"

Note on Prior Work: Other researchers have tried to shrink robot brains before, but most of those methods still train the full huge brain first and only compress it afterwards. This paper does something different — it forces the robot to LEARN DIRECTLY IN THE TINY SPACE from the start, so the huge brain never has to be built and trained at all.

The authors propose a clever architectural trick they call an Orthogonal Bottleneck.

Think of this as placing a special, rigid funnel between the robot's eyes (the encoder that sees the world) and its brain (the part that decides what to do).

  • The Funnel: This funnel is fixed; it doesn't move or change shape. It is designed perfectly (mathematically "orthogonal") so that it doesn't squish or distort the information passing through it.
  • The Effect: It forces all the robot's thoughts to pass through a very narrow channel. If the robot's brain was a 1,000-dimensional room, this funnel shrinks it down to a 2-dimensional hallway.

Why "Orthogonal"?
Imagine trying to pour water through a funnel. If the funnel is crooked or lumpy, the water splashes, spills, or gets stuck. But if the funnel is perfectly smooth and straight (orthogonal), the water flows through cleanly without losing any volume or changing its shape. This ensures the robot doesn't lose important information just because the channel is narrow.

3. The Big Discovery: "Small is Enough"

The paper proves two main things:

  • The Theory: If a task has a "true" complexity of, say, 5 dimensions (like needing 5 specific tools), then as long as your funnel is at least 5 units wide, the robot can still solve the task perfectly. It doesn't matter how big the original desk was; the robot can do everything it needs to do inside that small hallway.
  • An Important Caveat: This "small is enough" guarantee only holds because the funnel is ORTHOGONAL. If the funnel were crooked or lumpy (non-orthogonal), the information would get squished and distorted on the way through, and the robot would no longer be able to actually learn the task in the small hallway — even if the hallway is technically wide enough to fit the task. The orthogonality from section 2 isn't a nice-to-have polish; it's what makes the whole theorem work.
  • The Reality Check: They tested this on many different games and robot tasks (from simple balance beams to complex video games like Atari and robot walking simulations).
    • Result: In almost every case, they could shrink the robot's brain down to a tiny size (sometimes just 2 or 3 dimensions!) and the robot performed just as well as the giant-brained version.
    • The "Tipping Point": There is a specific "minimum size" for each task. If the funnel is too small (smaller than the task's true complexity), the robot fails. But as soon as the funnel gets just a little bit bigger than that minimum, the robot's performance snaps back to 100%.

4. Why This Matters: Stability and Clarity

The authors also noticed something interesting about how the robot thinks with this funnel.

  • Without the funnel: The robot's internal "thoughts" can get messy. Some parts of the brain might get huge and loud, while others go silent. This is like a choir where one person is screaming and everyone else is whispering; it's unstable.
  • With the funnel: The robot's thoughts stay balanced. Every part of the small hallway is used equally. This makes the learning process more stable and prevents the robot from "breaking" or forgetting things.

They also tried making the funnel learnable (teaching the robot to build its own funnel), but found that a fixed, pre-made funnel was actually more reliable. It's like giving the robot a pre-fabricated, perfect hallway rather than asking it to build its own while it's trying to walk.

Summary

The paper shows that deep learning agents are often carrying around massive, unnecessary brains. By inserting a simple, fixed, and mathematically perfect "funnel" that forces the agent to think in a tiny, low-dimensional space, we can:

  1. Keep performance high: The robot learns just as well.
  2. Stabilize learning: The robot's internal thoughts stay organized and balanced.
  3. Reveal the truth: It proves that the "true" complexity of many tasks is surprisingly small, hidden inside the massive neural networks we usually build.

Essentially, the authors found a way to tell the robot: "You don't need a mansion to live in; a perfectly designed tiny apartment works just fine."

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →