Beyond Word Error Rate: Auditing the Diversity Tax in Speech Recognition through Dataset Cartography
This paper proposes a robust auditing framework for automatic speech recognition systems that moves beyond traditional Word Error Rate by introducing the Sample Difficulty Index and semantic metrics to quantify and mitigate the "diversity tax" disproportionately affecting marginalized speakers.