UrbanGS: A Scalable and Efficient Architecture for Geometrically Accurate Large-Scene Reconstruction

UrbanGS is a scalable reconstruction framework for large-scale urban environments that enhances geometric accuracy through Depth-Consistent D-Normal Regularization and improves memory efficiency via Spatially Adaptive Gaussian Pruning and a unified partitioning scheme.

Changbai Li, Haodong Zhu, Hanlin Chen, Xiuping Liang, Tongfei Chen, Shuwei Shao, Linlin Yang, Huobin Tan, Baochang Zhang

Published 2026-02-23
📖 5 min read🧠 Deep dive

Imagine you are trying to build a perfect, life-sized digital twin of an entire city using only thousands of photos taken from different angles. This is the dream of 3D reconstruction.

For a long time, the best tools for this job were like trying to build a city out of fog. They looked good from a distance, but if you walked up close, the buildings were blurry, the roads were wobbly, and the memory on your computer would explode, causing the whole thing to crash.

Enter UrbanGS. Think of it as a new, super-smart construction crew that can build a massive, hyper-realistic city model without running out of space or time. Here is how they did it, explained through simple analogies:

1. The Problem: The "Foggy City"

Previous methods (like standard 3D Gaussian Splatting) are great for small rooms or single objects. But when you try to scale them up to a whole city, two things go wrong:

  • The Geometry is Wobbly: The buildings look like they are melting. The computer knows what color the wall is, but it doesn't know exactly where the wall is in 3D space.
  • The Memory Explosion: To make the city look detailed, the computer tries to put millions of tiny "pixels" (called Gaussians) everywhere, even in empty sky or far-away mountains. This is like trying to fill a swimming pool with water using a garden hose while also trying to fill the entire ocean with it. Your computer runs out of memory (OOM) and crashes.

2. The Solution: UrbanGS

The authors of this paper built a framework called UrbanGS that fixes these issues using three main "superpowers."

Superpower #1: The "Double-Check" GPS (Depth-Consistent D-Normal Regularization)

Imagine you are blindfolded and trying to find a wall.

  • Old Method: Someone tells you, "The wall is to your left." You turn left, but you don't know how far away it is. You might walk right into it or stop too far away.
  • UrbanGS Method: They use two guides.
    1. The Normal Guide: Tells you the direction of the wall (like a compass).
    2. The Depth Guide: Tells you exactly how far the wall is (like a laser rangefinder).

UrbanGS combines these two. It forces the computer to not only get the direction right but also the distance right. It's like having a GPS that corrects your steering and your speed simultaneously. This ensures the buildings are straight, the roads are flat, and the geometry is rock-solid.

Superpower #2: The "Smart Gardener" (Spatially Adaptive Gaussian Pruning)

Imagine you are painting a giant mural of a city.

  • Old Method: You paint every single brick on every building, even the ones on the far horizon that no one will ever see clearly. You also paint the empty sky with the same density as the busy downtown. This wastes a ton of paint (memory) and time.
  • UrbanGS Method: They hired a "Smart Gardener."
    • If a part of the city is complex (like a busy intersection with trees and cars), the gardener plants dense, detailed Gaussians.
    • If a part is simple (like a clear blue sky or a distant mountain), the gardener prunes (cuts away) the extra Gaussians.
    • The Result: The computer only spends its energy where it matters. It keeps the high-definition details for the things you look at and removes the clutter from the background. This stops the computer from running out of memory.

Superpower #3: The "Puzzle Master" (Partitioning Strategy)

Building a city on one computer is like trying to solve a 10,000-piece puzzle on a tiny coffee table. It's impossible.

  • Old Method: They tried to force the whole puzzle onto the table, or they cut it into pieces but left gaps where the pieces didn't fit together, creating ugly cracks in the middle of the city.
  • UrbanGS Method: They cut the city into manageable "neighborhoods" (blocks) and assigned a different computer (GPU) to build each one.
    • The Secret Sauce: They made sure the "fences" between the neighborhoods overlap slightly. This way, when the computers stitch the neighborhoods back together, there are no cracks or seams. It's like building a city block by block, but ensuring the sidewalks connect perfectly so you can walk from one block to the next without tripping.

The Result

When you put all these together, UrbanGS can:

  • Build Faster: It trains in about 2 hours instead of 20.
  • Look Better: The buildings are sharp, the roads are smooth, and the details (like trees and windows) are crisp.
  • Run Smaller: It fits on a standard high-end gaming card (RTX A5000) without crashing, whereas older methods would crash immediately.

In a Nutshell

If previous methods were like trying to build a city with a leaky bucket and a blurry map, UrbanGS is like having a team of architects with laser scanners, a smart pruning shears, and a perfect puzzle-solving strategy. They built a digital city that is so accurate and efficient, it feels like you could actually walk through it.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →