Training Dynamics-Aware Multi-Factor Curriculum Learning for Target Speaker Extraction

This paper proposes a training dynamics-aware multi-factor curriculum learning framework for target speaker extraction that jointly schedules multiple difficulty factors and utilizes the TSE-Datamap visualization tool to analyze training dynamics, thereby enabling data-driven progressive learning that significantly improves performance in complex multi-speaker scenarios.

Yun Liu, Xuechen Liu, Xiaoxiao Miao + 1 more2026-03-06💻 cs

Temporal Pooling Strategies for Training-Free Anomalous Sound Detection with Self-Supervised Audio Embeddings

This paper addresses the underexplored role of temporal pooling in training-free anomalous sound detection by proposing and evaluating adaptive strategies, specifically Relative Deviation Pooling (RDP) and a hybrid approach, which achieve state-of-the-art performance across multiple benchmarks and outperform previously reported trained systems.

Kevin Wilkinghoff, Sarthak Yadav, Zheng-Hua Tan2026-03-06💻 cs