Dynamic Adversarial Reinforcement Learning for Robust Multimodal Large Language Models
This paper introduces AOT-SFT, a large-scale adversarial dataset, and AOT, a self-play framework that co-evolves an image-editing attacker with a defender MLLM to dynamically generate training data, thereby significantly enhancing the model's perceptual robustness and reducing hallucinations in complex visual scenes.