DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation
DrivePTS is a progressive learning framework that enhances autonomous driving scene generation by mitigating geometric condition inter-dependencies, enriching semantic context through multi-view hierarchical text descriptions, and improving structural fidelity via frequency-guided loss, thereby achieving state-of-the-art realism and controllability.