PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio
This paper introduces PolyBench, a new benchmark designed to evaluate compositional reasoning in polyphonic audio across five distinct tasks, revealing that current Large Audio Language Models consistently struggle with the complexity of concurrent sound events.