SurgCUT3R: Surgical Scene-Aware Continuous Understanding of Temporal 3D Representation
SurgCUT3R is a novel framework that addresses the challenges of data scarcity and pose drift in monocular endoscopic video reconstruction by leveraging a synthetic data generation pipeline, hybrid supervision, and a hierarchical inference strategy to achieve robust, accurate, and efficient 3D surgical scene understanding.