Automated Assessment of Kidney Ureteroscopy Exploration for Training

This paper presents a novel, automated ureteroscope video-based framework that generates reference reconstructions to localize camera poses and identify missed calyces during kidney phantom explorations, thereby enabling effective out-of-the-operating-room training with objective feedback without requiring expert supervision.

Fangjie Li, Nicholas Kavoussi, Charan Mohan, Matthieu Chabanas, Jie Ying Wu

Published 2026-02-19
📖 4 min read☕ Coffee break read

Imagine you are learning to drive a very tricky, winding mountain road. The road is inside a giant, dark cave (your kidney), and your goal is to visit every single little side-cave (called a calyx) to make sure you don't leave any hidden treasures (kidney stones) behind.

Right now, learning to drive this road is hard. You need a driving instructor sitting right next to you in the car, watching every move, and only giving you feedback after you've finished the whole trip. This is expensive, scary (because it happens on real patients), and there aren't enough instructors for everyone.

This paper introduces a "Smart Driving Coach" that works on a practice model (a fake kidney) and gives you instant, automatic feedback without needing a human to watch you.

Here is how it works, broken down into simple steps:

1. The "Perfect Map" (Stage 1)

Before any student tries to drive the road, the experts take a slow, super-careful tour of the fake kidney. They record this perfect journey.

  • The Analogy: Think of this like a cartographer slowly walking through a cave with a high-end 3D scanner, creating a perfect, digital "Google Earth" map of every nook and cranny.
  • The Tech: The computer uses this slow video to build a 3D model of the kidney's inside. It also marks exactly where the camera was at every second. This becomes the "Reference Model." Once this map is made for a specific fake kidney, it can be used forever for any student.

2. The "Student's Drive" (Stage 2)

Now, a trainee (a student surgeon) takes the camera and explores the same fake kidney at a normal, real-world speed. They might move fast, the camera might get blurry, or they might shake the camera.

  • The Problem: Usually, if the video is shaky or blurry, computers get lost and can't tell where they are.
  • The Solution: The system doesn't try to build a new map from the student's messy video. Instead, it acts like a GPS that matches your current view to the Perfect Map.
    • Even if the student's video is blurry, the computer grabs a clear frame, looks at the Perfect Map, and says, "Ah, that blurry spot looks like the entrance to the 'Left Upper Cave' on the map!"
    • It does this frame-by-frame, stitching the student's messy journey onto the perfect map.

3. The "Report Card" (The Result)

Once the student finishes, the system compares their journey against the Perfect Map.

  • The Output: It generates a visual report showing the kidney. It lights up the caves the student visited in Green and leaves the missed ones Red.
  • The Score: It tells the student exactly which "side-caves" they missed. No human needs to watch the video to tell them this; the computer does it automatically in about 10 minutes.

Why is this a big deal?

  • No Extra Hardware: You don't need expensive sensors attached to the camera (which can make the tool too thick to use). It just uses the video from the camera itself.
  • Safe Practice: Students can practice as much as they want on the fake kidney without risking a real patient.
  • Instant Feedback: Instead of waiting for a boss to say, "You missed a spot," the student gets a score immediately, like a video game.

Did it work?

The researchers tested this with 15 different student explorations.

  • Accuracy: The system correctly identified missed caves 93% of the time.
  • Precision: It knew where the camera was within 4 millimeters (about the width of a pencil eraser).
  • Speed: It took about 10 minutes to grade a 2-minute video.

The Bottom Line

This paper presents a new way to train surgeons. Instead of relying on a human teacher to watch every second of a practice session, this system acts like an automatic tour guide. It builds a perfect map of the terrain once, and then checks every student's journey against that map to tell them exactly where they went wrong. This could help more surgeons learn faster, safer, and more effectively.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →