SoraNav: Adaptive UAV Task-Centric Navigation via Zeroshot VLM Reasoning
SoraNav is a novel framework that enables zero-shot Vision-Language Model reasoning for UAV task-centric navigation by integrating Multi-modal Visual Annotation to encode 3D geometric priors and an Adaptive Decision Making strategy to validate commands, thereby significantly outperforming existing methods in both 2.5D and complex 3D environments.