Learning-based Personal Speech Enhancement For Teleconferencing By Exploiting Spatial-spectral Features
2021 Β· Yicheng Hsu, Yonghan Lee, Mingsian R. Bai
Abstract
Teleconferencing is becoming essential during the COVID-19 pandemic. However, in real-world applications, speech quality can deteriorate due to, for example, background interference, noise, or reverberation. To solve this problem, target speech extraction from the mixture signals can be performed with the aid of the user's vocal features. Various features are accounted for in this study's proposed system, including speaker embeddings derived from user enrollment and a novel long-short-term spatial coherence feature pertaining to the target speaker activity. As a learning-based approach, a target speech sifting network was employed to extract the relevant features. The network trained with LSTSC in the proposed approach is robust to microphone array geometries and the number of microphones. Furthermore, the proposed enhancement system was compared with a baseline system with speaker embeddings and interchannel phase difference. The results demonstrated the superior performance of the pr
Authors
(none)
Tags
Stats
Related papers
- End-to-end Multi-channel Speaker Extraction And Binaural Speech Synthesis (2024)0.00
- SRIB-LEAP Submission To Far-field Multi-channel Speech Enhancement Challenge For Video Conferencing (2021)3.58
- Personalized Speech Enhancement Without A Separate Speaker Embedding Model (2024)5.24
- One Model To Enhance Them All: Array Geometry Agnostic Multi-channel Personalized Speech Enhancement (2021)0.00
- Spatialnet: Extensively Learning Spatial Information For Multichannel Joint Speech Separation, Denoising And Dereverberation (2023)13.88
- SE Territory: Monaural Speech Enhancement Meets The Fixed Virtual Perceptual Space Mapping (2023)0.00
- Efficient Multi-channel Speech Enhancement With Spherical Harmonics Injection For Directional Encoding (2023)3.58
- Audio-visual Target Speaker Enhancement On Multi-talker Environment Using Event-driven Cameras (2019)8.09