KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System

KnowVal is a novel autonomous driving system that enhances planning performance and safety by synergistically integrating visual-language reasoning with a comprehensive driving knowledge graph and a value-guided trajectory assessment model, achieving state-of-the-art results on multiple benchmarks.

Zhongyu Xia, Wenhao Chen, Yongtao Wang, Ming-Hsuan Yang

Published 2026-03-13
📖 4 min read☕ Coffee break read

Imagine you are teaching a brand-new driver how to navigate the world.

The Old Way (Current AI):
Most self-driving cars today learn like a student who only watches hours of driving videos. They mimic exactly what a human driver did in a specific situation.

  • The Problem: If the human driver in the video made a mistake, the AI learns that mistake. If the AI encounters a weird situation it hasn't seen before (like a cow on the road in a snowstorm), it panics because it has no "common sense" or "rules" to fall back on. It's like a student who memorized the answers to a math test but doesn't understand why the math works.

The New Way (KnowVal):
The paper introduces KnowVal, a self-driving system that acts less like a mimic and more like a wise, experienced driver who carries a rulebook and a moral compass.

Here is how KnowVal works, broken down into three simple parts:

1. The "Eagle Eye" that Talks to a "Librarian"

Most cars just look at the road with cameras. KnowVal has two superpowers that talk to each other:

  • The Eagle Eye (Perception): It sees everything, even the weird stuff. It notices a puddle, a pedestrian in a dark coat, or that it's raining at night.
  • The Librarian (Knowledge Retrieval): Instead of just guessing what to do, the Eagle Eye asks the Librarian: "Hey, I see a puddle and a pedestrian. What does the rulebook say?"
  • The Magic: The Librarian pulls up the exact traffic law and the "defensive driving principle" (e.g., "Slow down near puddles so you don't splash the pedestrian").
  • The Loop: If the Librarian says, "I need more info to be sure," the Eagle Eye zooms in to look closer. They work together to understand the scene perfectly.

2. The "Rulebook" (The Knowledge Graph)

Think of this as a massive, organized library of everything a driver should know. It's not just a list of laws; it's a web of connections.

  • It contains Traffic Laws (e.g., "Stop at red lights").
  • It contains Defensive Driving (e.g., "If it's foggy, increase your distance").
  • It contains Ethics (e.g., "Never swerve into a crowd to avoid a small obstacle").
  • The Innovation: Unlike other systems that might "hallucinate" (make things up) when asked a question, KnowVal retrieves the exact original text of the rule. It doesn't summarize it; it quotes the law directly to ensure accuracy.

3. The "Conscience" (The Value Model)

This is the most important part. Once the car sees the road and reads the rules, it needs to decide: "Is this a good idea?"

  • Imagine the car generates three possible paths:
    1. Speed up to beat the light.
    2. Slow down and wait.
    3. Swerve slightly.
  • The Value Model acts like a strict but fair judge. It looks at the "Conscience" (the retrieved rules) and scores each path.
    • Path 1: "You might make it, but you risk hitting a pedestrian. Score: -1 (Bad)."
    • Path 2: "Safe, legal, and polite. Score: +1 (Good)."
  • The car then picks the path with the highest score. This ensures the car isn't just efficient; it's safe, legal, and kind.

Why is this a big deal?

The researchers tested KnowVal in real-world simulations and found:

  • Fewer Crashes: It had the lowest collision rate on a major test dataset (nuScenes).
  • Better Decisions: It handled tricky situations (like driving through a tunnel or avoiding splashing pedestrians) much better than current top-tier systems.
  • Explainable: If you ask the car, "Why did you stop?" it can tell you: "I stopped because I saw a pedestrian near a puddle, and Rule 8 says to slow down to avoid splashing them."

In a nutshell:
Current self-driving cars are like parrots that repeat what they've seen. KnowVal is like a thoughtful human who sees the world, checks the rulebook, thinks about what is right, and then makes a safe decision. It combines the eyes of a machine with the wisdom of a human.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →