HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles

HorizonForge is a unified framework that enables photorealistic, controllable driving scene generation with arbitrary trajectories and vehicles by combining editable Gaussian-Mesh representations and noise-aware video diffusion, significantly outperforming existing methods in fidelity and consistency while introducing the HorizonSuite benchmark for standardized evaluation.

Yifan Wang, Francesco Pittaluga, Zaid Tasneem, Chenyu You, Manmohan Chandraker, Ziyu Jiang

Published 2026-03-03
📖 4 min read☕ Coffee break read

Imagine you are watching a movie of a car driving down a busy street. Now, imagine you want to change the story: maybe the main car needs to take a sharp left turn instead of going straight, or you want to magically insert a bright red sports car into the traffic, or perhaps you want to make a slow-moving truck disappear entirely.

Doing this in real life is impossible. Doing it in a computer simulation is usually very hard because the computer either makes the scene look fake (like a cartoon) or it breaks the physics (the new car floats through the air).

HorizonForge is a new tool that solves this problem. Think of it as a "Magic Editing Kit" for driving videos.

Here is how it works, using some simple analogies:

1. The "Digital Clay" (3D Assets)

Most video editors just work with flat pictures (2D). If you try to move a car in a flat picture, it looks weird and stretched.

HorizonForge is different. First, it takes the original video and turns it into 3D "Digital Clay."

  • The Analogy: Imagine taking a photo of a room and turning it into a virtual room made of millions of tiny, glowing dust particles (called Gaussian Splats). These particles hold all the colors and shapes of the real world.
  • Why it matters: Because the scene is now 3D "clay," you can reach in, grab a car, move it to a new spot, or spin it around, and the computer knows exactly how the light should hit it from every angle. It's like sculpting with clay instead of painting on a canvas.

2. The "Ghost Writer" (Language & Trajectories)

How do you tell the computer what to do? You don't need to be a coder.

  • The Analogy: You can just talk to the scene or draw a line on a map.
    • Voice: "Put a gray sedan right in front of that SUV."
    • Map: "Make the main car turn left here."
  • The system understands these instructions. If you ask for a specific car, it can even generate a 3D model of that car from scratch (like a 3D printer in the computer) and drop it into the scene perfectly.

3. The "Reality Polisher" (Video Diffusion)

When you move the 3D clay around, the raw result might look a little glitchy or "digital." It needs a final polish to look like a real movie.

  • The Analogy: Think of a Video Diffusion Model as a super-smart film editor and special effects artist combined.
  • It looks at your "glitchy" 3D edit and says, "Okay, I see you moved the car. Now, let me fix the shadows, smooth out the motion, and make sure the background doesn't flicker."
  • Crucially, it remembers the flow of time. It ensures that if a car turns left, the wheels spin and the background blurs naturally, just like in a real movie. It doesn't just fix one frame; it fixes the whole sequence so it feels real.

Why is this a big deal?

Currently, testing self-driving cars is dangerous and expensive. You can't easily create a "what if" scenario where a child runs into the street or a truck brakes suddenly.

  • The Old Way: You had to build a fake city in a video game (which looks fake) or wait years to catch a rare accident on camera.
  • The HorizonForge Way: You take a normal video, tell the computer "Make the truck brake hard," and it instantly creates a photorealistic, physics-perfect video of that exact event.

The "Magic" Results

The paper tested this against other methods and found:

  • It looks real: Humans preferred HorizonForge over other methods 91% of the time.
  • It's consistent: The cars don't flicker or change shape as they move.
  • It's flexible: You can change the path of the driver, add new cars, or remove obstacles, all with simple instructions.

In short: HorizonForge is like having a time machine and a director's chair for driving videos. It lets you rewrite the rules of the road safely, instantly, and with a level of realism that makes it look like you were actually there.