Harnessing Chain-of-Thought Reasoning in Multimodal Large Language Models for Face Anti-Spoofing
This paper addresses the generalization limitations of traditional Face Anti-Spoofing by introducing FaceCoT, the first large-scale Visual Question Answering dataset enriched with Chain-of-Thought reasoning and generated via reinforcement learning, alongside a CEPL training strategy that collectively enable Multimodal Large Language Models to achieve superior robustness and interpretability across diverse spoofing attacks.