DreamAnywhere: Object-Centric Panoramic 3D Scene Generation

DreamAnywhere is a modular system that generates immersive, editable 360° panoramic 3D scenes from text by synthesizing a panoramic image, decomposing it into background and objects, and reconstructing detailed 3D representations, thereby enabling rapid prototyping and high-fidelity scene exploration for applications like low-budget film production.

Edoardo Alberto Dominici, Jozef Hladky, Floor Verhoeven, Lukas Radl, Thomas Deixelberger, Stefan Ainetter, Philipp Drescher, Stefan Hauswiesner, Arno Coomans, Giacomo Nazzaro, Konstantinos Vardis, Markus Steinberger

Published 2026-02-18
📖 4 min read☕ Coffee break read

Imagine you want to build a magical, fully explorable 3D world just by typing a sentence like, "A mystical beach with giant crabs wearing hats."

In the past, AI could only paint a flat picture of this scene. If you tried to walk around the giant crabs in your imagination, the picture would warp, glitch, or fall apart. Other methods could build 3D rooms, but they often felt like cardboard cutouts that looked great from the front but collapsed if you looked at them from the side.

DreamAnywhere is a new system that solves this by acting like a master architect and a team of specialized construction workers working together. Here is how it works, broken down into simple steps:

1. The Blueprint: The 360° Panorama

Instead of trying to build the whole world at once, the system first draws a 360-degree panoramic photo (like a giant, seamless wallpaper that wraps around you).

  • The Magic Trick: Usually, AI struggles to make these wide photos look consistent. DreamAnywhere uses a special "style guide" (a perspective image) to teach the AI exactly how the light, colors, and mood should look, ensuring the whole world feels like it belongs together.

2. The Separation: Cutting Out the Stars

Once the panoramic wallpaper is done, the system acts like a skilled editor with a pair of scissors.

  • It identifies the "stars" of the show (the giant crabs, the trees, the rocks) and carefully cuts them out.
  • This leaves behind a clean, empty background (the beach, the sky, the sand).
  • Why do this? It's much easier to build a perfect 3D crab if you focus on just the crab, rather than trying to build the crab and the beach simultaneously.

3. The "Rescue Mission": Fixing the Crabs

Sometimes, the cut-out crab looks a bit weird—maybe it's blurry, or the hat is missing a piece because of the cut.

  • DreamAnywhere doesn't just use the blurry cut-out. It sends the crab to a "Resynthesis Station."
  • It asks a smart AI: "Based on the text description and the shape we have, what does a perfect, high-definition version of this crab look like?"
  • It generates a brand new, crystal-clear image of the crab from all angles, then turns that into a 3D model. This ensures the crab looks solid and real, not like a flat sticker.

4. Filling the Holes: The 3D Inpainting

Now, imagine you have a 360° photo of a beach, but you've cut out the crabs. You have big holes in the sand where they used to be.

  • The system uses a technique called "3D Inpainting." Think of this like a magical painter that doesn't just paint over the hole on the flat photo, but actually builds new sand and rocks in 3D space to fill the gap.
  • It ensures that if you walk around the spot where the crab used to be, the sand looks real and consistent, not like a flat painting.

5. The Assembly: Putting It All Together

Finally, the system takes the high-quality 3D crabs and places them back onto the 3D beach.

  • It uses "gravity" logic to make sure the crabs sit on the sand, not floating in the air.
  • It even adds shadows so the crabs look like they are actually touching the ground.

The Result: A World You Can Walk Through

The final output is a 3D Gaussian Splat (a fancy term for a cloud of millions of tiny, colored dots that look like a photo but act like a 3D object).

Why is this a big deal?

  • No More "Flat" Worlds: You can walk 100 meters away from the giant crab, look back, and it still looks 3D and correct. The world doesn't warp or glitch.
  • Easy Editing: Because the system built the world in parts (background + individual objects), you can easily swap the "crab wearing a hat" with a "crab wearing a crown" without rebuilding the whole beach.
  • Speed: It does all this in about 15 minutes, which used to take human artists days or weeks.

In short: DreamAnywhere is like a LEGO set for the AI world. Instead of trying to mold a giant statue out of clay (which cracks easily), it builds the world by creating a perfect base, crafting perfect individual pieces, and snapping them together so you can explore the result from any angle.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →