CustomTex: High-fidelity Indoor Scene Texturing via Multi-Reference Customization

CustomTex is a novel framework that leverages a dual-distillation approach within a Variational Score Distillation (VSD) optimization to generate high-fidelity, instance-level 3D scene textures from reference images, effectively overcoming the limitations of text-driven methods by ensuring precise semantic alignment and superior visual quality with minimal artifacts.

Weilin Chen, Jiahao Rao, Wenhao Wang, Xinyang Li, Xuan Cheng, Liujuan Cao

Published 2026-03-20
📖 5 min read🧠 Deep dive

Imagine you have a digital 3D model of a room—like a video game level or a virtual reality house. Right now, this room is like a ghost house: it has the walls, the sofa, and the cabinets, but they are all painted in a boring, flat gray. They have no personality, no wood grain, no fabric weave, and no style.

Your goal is to "decorate" this ghost house to look real. This is what the paper calls texturing.

The Problem: The "Bad Painter" and the "Vague Client"

Previously, trying to decorate these 3D rooms was like hiring a painter who only speaks in vague riddles or tries to copy a messy collage.

  1. The Text Problem: If you tell a computer, "Make the sofa look like a vintage leather chair," the computer gets confused. It might make the whole room look like leather, or the sofa might look like a weird, blurry blob. It lacks precision.
  2. The Reference Problem: If you show the computer a photo of a nice sofa and say, "Copy this," older methods would try to paste that photo over the whole room. The result? The sofa looks okay, but the chair next to it suddenly looks like a sofa too, or the whole room gets weird shadows baked into the paint (like the sun is permanently stuck in one spot).

The Solution: CustomTex (The "Master Decorator")

The authors created CustomTex, a new tool that acts like a master interior designer with a very specific set of skills.

Here is how it works, using a simple analogy:

1. The "Reference Board" (Instance-Level Control)

Instead of giving the designer one big, confusing instruction, you give them a reference board with specific photos for specific items.

  • You point to the sofa in the 3D room and say, "Here is a photo of the exact leather sofa I want."
  • You point to the cabinet and say, "Here is a photo of the dark wood I want."
  • You point to the wall and say, "Here is the wallpaper pattern."

CustomTex understands that the sofa photo only applies to the sofa, and the wood photo only applies to the cabinet. It doesn't mix them up.

2. The "Two-Step Painting Process" (Dual-Distillation)

This is the secret sauce. The paper says CustomTex uses a "dual-distillation" approach. Let's break that down into two painters working together:

  • Painter A (The Architect): This painter looks at your reference photos and says, "Okay, I understand the idea." They ensure the sofa looks like a sofa, the wood looks like wood, and the patterns match. They handle the meaning and the shape.
    • Analogy: This is like sketching the outline of the furniture to make sure it fits the room and matches the style you asked for.
  • Painter B (The Detail Artist): This painter is obsessed with crispness. They take the sketch from Painter A and zoom in to 400%. They add the tiny scratches on the wood, the weave of the fabric, and the subtle shine. They make sure the image isn't blurry or "mushy."
    • Analogy: This is like taking a low-resolution photo and using a magic lens to make it look like a high-definition 4K photograph.

3. The "No-Baked-Shadows" Rule

Old methods often painted the shadows into the texture. If you moved the virtual sun, the shadows on the wall would look wrong because they were "baked in" (frozen) like a tattoo.

CustomTex is smart enough to paint only the material (the color and pattern), leaving the shadows for the lighting engine to handle later. This means you can change the lighting in your virtual room, and the textures will still look perfect.

Why is this a big deal?

Think of it like the difference between stamping a sticker on a wall versus hand-painting a mural.

  • Old methods were like stamping a sticker: It looked okay from far away, but up close, it was blurry, the edges were messy, and it didn't fit the specific shape of the object.
  • CustomTex is like hand-painting a mural: It fits the sofa perfectly, the wood grain is sharp, the fabric feels real, and you can change the lighting without ruining the look.

In Summary

CustomTex is a new way to decorate 3D rooms. Instead of guessing what you want with words, you show it pictures of exactly what you want for each piece of furniture. It uses a smart two-step process to ensure the room looks semantically correct (the right objects have the right look) and visually stunning (crisp, high-definition details without weird shadows).

It turns a gray, boring 3D model into a photorealistic, customizable home in a matter of minutes, making it much easier for designers, game developers, and architects to create beautiful virtual worlds.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →