One Model To Enhance Them All: Array Geometry Agnostic Multi-channel Personalized Speech Enhancement
2021 Β· Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, et al.
Abstract
With the recent surge of video conferencing tools usage, providing high-quality speech signals and accurate captions have become essential to conduct day-to-day business or connect with friends and families. Single-channel personalized speech enhancement (PSE) methods show promising results compared with the unconditional speech enhancement (SE) methods in these scenarios due to their ability to remove interfering speech in addition to the environmental noise. In this work, we leverage spatial information afforded by microphone arrays to improve such systems' performance further. We investigate the relative importance of speaker embeddings and spatial features. Moreover, we propose a new causal array-geometry-agnostic multi-channel PSE model, which can generate a high-quality enhanced signal from arbitrary microphone geometry. Experimental results show that the proposed geometry agnostic model outperforms the model trained on a specific microphone array geometry in both speech quality
Authors
(none)
Tags
Stats
Related papers
- Multi-geometry Spatial Acoustic Modeling For Distant Speech Recognition (2019)6.34
- Personalized Speech Enhancement Without A Separate Speaker Embedding Model (2024)5.24
- Real-time Joint Personalized Speech Enhancement And Acoustic Echo Cancellation (2022)4.52
- Neural Directed Speech Enhancement With Dual Microphone Array In High Noise Scenario (2024)0.00
- Exploring The Potential Of Data-driven Spatial Audio Enhancement Using A Single-channel Model (2024)0.00
- Efficient Multi-channel Speech Enhancement With Spherical Harmonics Injection For Directional Encoding (2023)3.58
- Learning-based Personal Speech Enhancement For Teleconferencing By Exploiting Spatial-spectral Features (2021)6.34
- Automatic Channel Selection And Spatial Feature Integration For Multi-channel Speech Recognition Across Various Array Topologies (2023)8.09