Towards Robust Speech Deepfake Detection via Human-Inspired Reasoning
This paper introduces HIR-SDD, a novel speech deepfake detection framework that leverages Large Audio Language Models and a human-annotated dataset to achieve robust generalization across audio domains while providing interpretable, human-like reasoning for its predictions.