VLMFusionOcc3D: VLM Assisted Multi-Modal 3D Semantic Occupancy Prediction
VLMFusionOcc3D is a robust multimodal framework for autonomous driving that leverages Vision-Language Models to resolve semantic ambiguities and employs a weather-aware adaptive fusion mechanism to significantly improve 3D semantic occupancy prediction accuracy, particularly under adverse weather conditions.