Monocular Endoscopic Tissue 3D Reconstruction with Multi-Level Geometry Regularization

This paper proposes a novel 3D Gaussian Splatting framework for monocular endoscopic tissue reconstruction that integrates surface-aware mesh constraints and multi-level geometric regularization to achieve both real-time rendering and high-quality, physically plausible deformable surface reconstruction.

Yangsen Chen, Hao Wang

Published 2026-02-25
📖 4 min read☕ Coffee break read

Imagine you are a surgeon performing a delicate operation inside a patient's body. You are looking through a tiny camera (an endoscope) that gives you a 2D view of squishy, moving organs. To perform robot-assisted surgery safely, the robot needs a perfect, real-time 3D map of what it's seeing. But here's the problem: human tissue is soft, it stretches, it folds, and it gets covered by surgical tools. Creating a 3D movie of this from a single camera angle is like trying to guess the shape of a squishy balloon just by looking at its shadow.

This paper introduces a new "magic trick" to solve that problem. The authors, Yangsen Chen and Hao Wang, have built a system that creates a smooth, real-time 3D model of moving tissue using a technology called 3D Gaussian Splatting.

Here is how they did it, explained with everyday analogies:

1. The Problem: The "Floating Cloud" Issue

Previous methods tried to build these 3D models using "NeRFs" (Neural Radiance Fields). Think of NeRFs like a very slow, high-quality painter. They can make a beautiful picture, but it takes hours to paint one frame, and the robot can't wait that long.

Other newer methods use "3D Gaussian Splatting." Imagine this as a cloud of millions of tiny, fuzzy balloons (Gaussians) that float in space to form the shape of the tissue. It's incredibly fast (like a video game), but because the balloons are fuzzy, they often don't stick together perfectly. The result looks like a wobbly, glitchy cloud rather than a smooth skin. Sometimes, parts of the tissue look like they are floating in mid-air (artifacts) or the surface looks bumpy and unrealistic.

2. The Solution: "The Mesh Skeleton"

The authors realized that to make the fuzzy balloons look like smooth skin, they needed a skeleton to hold them in place.

  • Step 1: Build a Wireframe (The Mesh): First, they take the very first frame of the video and build a rigid 3D wireframe mesh (like a digital chicken wire) of the tissue surface.
  • Step 2: The "Velcro" Trick: They then attach their fuzzy balloons (Gaussians) directly to this wireframe. Imagine gluing the balloons onto the chicken wire. Now, the balloons can't float away or drift into weird shapes; they are forced to follow the smooth surface of the wireframe. This solves the "bumpy surface" problem.

3. The Challenge: The "Moving Jello" Problem

Once the tissue starts moving (because the surgeon is pulling or cutting it), the wireframe needs to move too. But tissue isn't a rigid rock; it's like Jello. It bends in some places and stays stiff in others.

If you just let the balloons move freely, the 3D model might twist into impossible shapes. The authors added two "rules of physics" to keep the movement realistic:

  • Rule A: The "Local Stiffness" (Semi-Rigidity): They identified key spots on the tissue (like where blood vessels cross) and told the balloons: "Hey, the area right around these spots shouldn't stretch too much. Stay stiff like a rock." This is like putting a little splint on a specific part of a bending finger.
  • Rule B: The "Global Flow" (Non-Rigidity): For the rest of the tissue, they told the balloons: "You can bend and stretch, but you must move smoothly with your neighbors. Don't let one balloon fly off in a different direction than the one next to it." This ensures the whole piece of tissue moves like a single, cohesive unit of Jello, not a bag of marbles.

4. Handling the "Blind Spots"

During surgery, tools often block the camera's view, creating black holes in the video. The authors added a "video inpainting" feature. Think of this as a smart AI artist that looks at the parts of the tissue hidden by the tool and guesses what the tissue looks like underneath based on how it was moving a second ago. It fills in the missing pieces so the 3D model doesn't have holes.

The Result: Fast, Smooth, and Real

The result is a system that is:

  • Fast: It renders the 3D scene in real-time (over 100 times faster than the old "painter" methods).
  • Smooth: The tissue looks like real skin, not a glitchy cloud.
  • Accurate: It handles the stretching and bending of organs without breaking the 3D model.

In summary: The authors took a fast but messy 3D technology (the fuzzy balloons) and gave it a rigid skeleton (the mesh) and a set of smart movement rules (local stiffness and global flow). This allows a surgical robot to see a perfect, real-time 3D map of squishy, moving organs, making surgery safer and more precise.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →