FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System

The paper introduces FireRedASR2S, a state-of-the-art industrial-grade all-in-one automatic speech recognition system that unifies high-performance modules for speech transcription, voice activity detection, language identification, and punctuation prediction, achieving superior results across Mandarin, Chinese dialects, and English benchmarks compared to existing solutions.

Kaituo Xu, Yan Jia, Kai Huang, Junjie Chen, Wenpeng Li, Kun Liu, Feng-Long Xie, Xu Tang, Yao HuThu, 12 Ma⚡ eess

UAV-Based 3D Spectrum Sensing: Insights on Altitude, Bandwidth, Trajectory, and Effective Antenna Patterns on REM Reconstruction

This paper presents a comprehensive analysis of UAV-based 3D spectrum sensing and Radio Environment Map (REM) reconstruction, demonstrating that robust algorithms like simple Kriging and Gaussian Process Regression, combined with altitude-aware trajectory planning, increased bandwidth, and airframe-induced antenna pattern calibration, significantly enhance mapping accuracy even under sparse sampling and complex shadowing conditions.

Mushfiqur Rahman, Sung Joon Maeng, Ismail Guvenc, Chau-Wai Wong, Mihail Sichitiu, Jason A. Abrahamson, Arupjyoti BhuyanThu, 12 Ma⚡ eess

Multi-Modal Intelligent Channel Modeling: From Fine-tuned LLMs to Pre-trained Foundation Models

This paper proposes and compares two novel paradigms for multi-modal intelligent channel modeling in 6G systems—fine-tuned Large Language Models (LLM4CM) and a pre-trained Wireless Channel Foundation Model (WiCo)—both grounded in the Synesthesia of Machines concept to enable precise, scalable, and physically interpretable channel prediction across complex communication environments.

Lu Bai, Zengrui Han, Mingran Sun, Xiang ChengThu, 12 Ma⚡ eess

High-Fidelity Digital Twin Dataset Generation for Inverter-Based Microgrids Under Multi-Scenario Disturbances

This paper introduces a high-fidelity, labeled digital twin dataset generated from a MATLAB/Simulink electromagnetic transient model of a ten-inverter microgrid, capturing synchronized three-phase waveforms and power metrics across eleven distinct operating and disturbance scenarios to serve as a robust benchmark for training surrogate models and analyzing cyber-physical resilience.

Osasumwen Cedric Ogiesoba-Eguakun, Kaveh Ashenayi, Suman RathThu, 12 Ma⚡ eess

Over-the-Air Consensus-based Formation Control of Heterogeneous Agents: Communication-Rate and Geometry-Aware Convergence Guarantees

This paper proposes a formation control framework for heterogeneous autonomous agents that leverages the superposition property of wireless multiple access channels to compute convex combinations of neighbor signals, thereby guaranteeing convergence to a prescribed formation under time-varying graphs and unknown channel coefficients while significantly reducing communication overhead compared to traditional interference-avoiding protocols.

Michael Epp, Fabio Molinari, Jörg RaischThu, 12 Ma⚡ eess

In-Situ Timing Diagnosis of PDN and Configuration-Upset-Induced Routing Delay Degradation in SRAM-based FPGAs

This paper presents a scalable, non-intrusive in-situ timing diagnosis architecture for SRAM-based FPGAs that utilizes distributed phase-swept delay monitoring to probabilistically characterize and spatially differentiate between global power-distribution-network degradation and localized configuration-induced routing perturbations during normal operation.

Mostafa DarvishiThu, 12 Ma⚡ eess

Data-Driven Successive Linearization for Optimal Voltage Control

This paper proposes a data-driven successive linearization method for optimal voltage control in power distribution systems that overcomes the infeasibility of fixed linear approximations under nonlinear power flow constraints by dynamically updating linearizations around the most recent operating point, thereby ensuring fast convergence and adaptability to load variations.

Yiwei Dong, Wenqi Cui, Han Xu, Adam Wierman, Steven LowThu, 12 Ma⚡ eess

ParaS2S: Benchmarking and Aligning Spoken Language Models for Paralinguistic-aware Speech-to-Speech Interaction

This paper introduces ParaS2S, a reinforcement learning framework and corresponding benchmark (ParaS2SBench) that utilizes a novel PolyTone-trained automatic judge to effectively align speech-to-speech models with paralinguistic cues, achieving superior performance in response content and speaking style compared to supervised fine-tuning while requiring fewer paired demonstrations.

Shu-wen Yang, Ming Tu, Andy T. Liu, Xinghua Qu, Hung-yi Lee, Lu Lu, Yuxuan Wang, Yonghui WuMon, 09 Ma⚡ eess

SAAIPAA: Optimizing aspect-angles-invariant physical adversarial attacks on SAR target recognition models

This paper introduces SAAIPAA, a physics-based framework that optimizes the placement of corner reflectors to execute aspect-angle-invariant physical adversarial attacks against SAR target recognition models, achieving high fooling rates even when the attacker lacks knowledge of the SAR platform's viewing angles.

Isar Lemeire, Yee Wei Law, Sang-Heon Lee, William Meakin, Tat-Jun ChinMon, 09 Ma⚡ eess

CECGSR: Circular ECG Super-Resolution

This paper proposes Circular ECG Super-Resolution (CECGSR), a closed-loop framework that leverages negative feedback and a Plug-and-Play strategy to outperform existing open-loop methods in reconstructing high-resolution ECG signals from low-resolution, noisy inputs.

Honggui Li, Zhengyang Zhang, Dingtai Li, Sinan Chen, Nahid Md Lokman Hossain, Hantao Lu, Ruobing Wang, Xinfeng Xu, Yinlu Qin, Yuting Feng, Maria Trocan, Dimitri Galayko, Amara Amara, Mohamad SawanMon, 09 Ma⚡ eess