MIDAS: Multi-Image Dispersion and Semantic Reconstruction for Jailbreaking MLLMs
This paper proposes MIDAS, a multimodal jailbreak framework that bypasses safety mechanisms in advanced MLLMs by decomposing harmful semantics into risk-bearing subunits dispersed across multiple images and leveraging cross-image reasoning to reconstruct malicious intent, achieving an average attack success rate of 81.46% against closed-source models.