Dyslexify: A Mechanistic Defense Against Typographic Attacks in CLIP
This paper introduces Dyslexify, a training-free defense mechanism that selectively ablates specific attention heads in CLIP vision encoders to neutralize typographic attacks, significantly improving robustness against text-based manipulations while preserving standard recognition accuracy.