From the Linear Quadratic Regulator (LQR) to the (Deterministic) Kalman Filter in Two Easy Steps

This paper presents a two-step tutorial demonstrating how to derive the deterministic Kalman filter from the Linear Quadratic Regulator (LQR) by first converting the state estimation problem into a purely quadratic LQR formulation using homogeneous coordinates, and then partitioning the resulting solution to recover the traditional Kalman filter dynamics and Riccati equation.

Original authors: Bassam Bamieh

Published 2026-06-11✓ Author reviewed
📖 6 min read🧠 Deep dive

Original authors: Bassam Bamieh

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are trying to figure out exactly where a lost hiker is in a dense forest. You have two sources of information, but both are flawed:

  1. Your Map (The Model): You know the hiker's general path and speed, but the terrain is tricky, and they might stumble or take a detour.
  2. Your Binoculars (The Measurements): You can see the hiker occasionally, but the trees block your view, and the image is blurry.

The Kalman Filter is the mathematical tool that combines these two imperfect sources to guess the hiker's true location. Usually, this is taught as a complex statistical problem involving "noise" and "probability."

This paper by Bassam Bamieh offers a different, simpler way to look at it. It argues that you don't need to think about random chance at all. Instead, you can treat this as a deterministic puzzle: "What is the simplest possible story that explains what we saw?"

Here is the paper's "Two Easy Steps" to solve this puzzle, explained with everyday analogies.

The Core Idea: "Occam's Razor" for Math

The paper starts with a principle called the Minimal Uncertainty Principle. Imagine you are a detective trying to reconstruct a crime scene. There are infinite ways the crime could have happened.

  • Story A: The suspect ran 5 miles, tripped 10 times, and the witness was hallucinating.
  • Story B: The suspect walked 1 mile, stumbled once, and the witness was slightly blurry-eyed.

The paper says: Choose Story B. Why? Because it requires the least amount of "weirdness" (uncertainty) to make the facts fit. In math terms, we want the story where the "errors" (the stumbling and the blurry vision) are as small as possible.

Step 1: The "Homogeneous Coordinates" Trick

The first hurdle is that the math for this "simplest story" problem is messy. It has a mix of squared terms (like "distance squared") and straight-line terms (like "distance"). It's like trying to bake a cake where the recipe calls for "2 cups of flour" and "a pinch of salt," but the mixing bowl only accepts ingredients in a specific "squared" format.

The Solution: The paper suggests a magic trick called Homogeneous Coordinates.

  • The Analogy: Imagine you have a 2D drawing on a piece of paper. To make the math work, you add a third dimension—a "1" attached to the side of your drawing. Suddenly, your 2D problem becomes a 3D problem where everything fits perfectly into a neat, symmetrical box.
  • What it does: By adding this extra "1" to the system, the messy "mixed" math problem transforms into a perfectly clean, purely "squared" math problem.
  • The Result: This clean problem is exactly the same as a Linear Quadratic Regulator (LQR). If you know how to solve an LQR problem (which is like finding the most fuel-efficient way to drive a car), you can now solve this messy estimation problem.

Why this matters: The paper points out a cool insight here. In control problems (like driving a car), the "extra" math usually represents a pre-planned feedforward signal. In estimation problems (like tracking the hiker), that same "extra" math represents the observer—the part of the system that learns and updates its guess over time.

Step 2: The "Time Reversal" and "Final Guess"

Now that we have a clean, squared problem, we need to solve it. But there's a catch: In a standard driving problem, you know where you started. In this estimation problem, we don't know where the hiker started. We only know where they are now (or rather, we are trying to figure out where they are now based on past data).

The Solution: The paper uses a clever two-part maneuver:

  1. Assume the End: Pretend for a moment that you do know where the hiker ended up at the final moment. If you know the start and the end, the "simplest path" between them is easy to calculate.
  2. Time Reversal: The math for "starting at A and ending at B" is the mirror image of "starting at B and ending at A." The paper flips the problem in time. Instead of asking "How do we get from start to finish?", it asks "If we are at the finish, how did we get here?"
  3. Optimize the Guess: Since we don't actually know the final position, we take the answer from step 2 and ask: "Which final position makes the total 'weirdness' (uncertainty) the smallest?"

The Result: When you do this optimization, the messy equations magically simplify into the famous Kalman Filter equations.

  • The "Observer Gain" (how much you trust the map vs. the binoculars) pops out naturally.
  • The "Riccati Equation" (the complex math that updates the filter) appears as the solution to this "cost-to-arrive" problem.

The Big Picture: Certainty vs. Information

The paper concludes with a fascinating re-interpretation of the math.

  • In the traditional (stochastic) view, the filter calculates a "Covariance Matrix," which tells you how uncertain you are. A big number means "I have no idea."
  • In this paper's view, the math calculates an "Information Matrix" (or "Certainty Matrix").
    • The Analogy: Think of a bowl. If the bowl is very steep and deep, a marble placed inside will roll quickly to the bottom. This means you are very certain about the bottom's location. If the bowl is flat, the marble can roll anywhere; you are uncertain.
    • The paper argues that the matrix SS in their equations measures the steepness of the bowl. A large SS means the "bowl" is steep, meaning the filter is very confident in its estimate.

Summary

This paper doesn't invent a new filter; it rewrites the recipe.

  1. It says: "Stop thinking about random noise. Think about finding the simplest, least-error explanation for your data."
  2. It uses a mathematical trick (homogeneous coordinates) to turn a messy problem into a clean, standard control problem.
  3. It uses time reversal to solve that problem, revealing that the Kalman Filter is just the optimal way to minimize uncertainty in a deterministic world.

It's a "tutorial" that strips away the scary probability theory to show that the Kalman Filter is fundamentally about efficiency and simplicity: choosing the path that requires the fewest assumptions.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →