An Approach to Simultaneous Acquisition of Real-Time MRI Video, EEG, and Surface EMG for Articulatory, Brain, and Muscle Activity During Speech Production

This paper presents a novel framework for the simultaneous acquisition of real-time MRI, EEG, and surface EMG to capture brain, muscle, and articulatory activity during speech, featuring a specialized artifact suppression pipeline to overcome technical challenges and enable unprecedented insights into speech neuroscience.

Jihwan Lee, Parsa Razmara, Kevin Huang + 16 more2026-03-06🤖 cs.AI

Temporal Pooling Strategies for Training-Free Anomalous Sound Detection with Self-Supervised Audio Embeddings

This paper addresses the underexplored role of temporal pooling in training-free anomalous sound detection by proposing and evaluating adaptive strategies, specifically Relative Deviation Pooling (RDP) and a hybrid approach, which achieve state-of-the-art performance across multiple benchmarks and outperform previously reported trained systems.

Kevin Wilkinghoff, Sarthak Yadav, Zheng-Hua Tan2026-03-06💻 cs

A Large-Scale Probing Analysis of Speaker-Specific Attributes in Self-Supervised Speech Representations

This study conducts a large-scale probing analysis of 11 self-supervised speech models to reveal a hierarchical encoding of speaker attributes, challenging the assumption that final layers are purely linguistic by showing that larger models recover speaker identity in deep layers while intermediate representations better capture dynamic prosody than specialized embeddings.

Aemon Yat Fei Chiu, Kei Ching Fung, Roger Tsz Yeung Li + 2 more2026-03-06💻 cs

ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis

The paper proposes ZeSTA, a domain-conditioned training framework that effectively leverages zero-shot TTS synthetic data for low-resource personalized speech synthesis by distinguishing real and synthetic inputs via lightweight embeddings and real-data oversampling, thereby improving speaker similarity without compromising quality.

Youngwon Choi, Jinwoo Oh, Hwayeon Kim + 1 more2026-03-05🤖 cs.AI