UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving

UniDrive-WM is a unified vision-language model that integrates scene understanding, trajectory planning, and trajectory-conditioned future image generation into a single architecture, achieving state-of-the-art performance on the Bench2Drive benchmark by leveraging generative predictions to iteratively refine planning and enhance scene comprehension.

Zhexiao Xiong, Xin Ye, Burhan Yaman + 5 more2026-03-04💻 cs

WristMIR: Coarse-to-Fine Region-Aware Retrieval of Pediatric Wrist Radiographs with Radiology Report-Driven Learning

WristMIR is a novel framework that leverages radiology report-driven learning and a two-stage coarse-to-fine retrieval process to effectively identify pediatric wrist radiographs with analogous fracture patterns, significantly outperforming existing baselines in retrieval accuracy, fracture classification, and clinical relevance without requiring manual image annotations.

Mert Sonmezer, Serge Vasylechko, Duygu Atasoy + 2 more2026-03-04💻 cs