Mvnet: Memory Assistance And Vocal Reinforcement Network For Speech Enhancement
2022 Β· Jianrong Wang, Xiaomin Li, Xuewei Li, et al.
Abstract
Speech enhancement improves speech quality and promotes the performance of various downstream tasks. However, most current speech enhancement work was mainly devoted to improving the performance of downstream automatic speech recognition (ASR), only a relatively small amount of work focused on the automatic speaker verification (ASV) task. In this work, we propose a MVNet consisted of a memory assistance module which improves the performance of downstream ASR and a vocal reinforcement module which boosts the performance of ASV. In addition, we design a new loss function to improve speaker vocal similarity. Experimental results on the Libri2mix dataset show that our method outperforms baseline methods in several metrics, including speech quality, intelligibility, and speaker vocal similarity et al.
Authors
(none)
Tags
Stats
Related papers
- Lstmse-net: Long Short Term Speech Enhancement Network For Audio-visual Speech Enhancement (2024)8.57
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- Speech Enhancement Aided End-to-end Multi-task Learning For Voice Activity Detection (2020)11.49
- Svsnet+: Enhancing Speaker Voice Similarity Assessment Models With Representations From Speech Foundation Models (2024)0.00
- Vsanet: Real-time Speech Enhancement Based On Voice Activity Detection And Causal Spatial Attention (2023)5.24
- Svsnet: An End-to-end Speaker Voice Similarity Assessment Model (2021)6.34
- MLNET: An Adaptive Multiple Receptive-field Attention Neural Network For Voice Activity Detection (2020)3.58
- VC-ENHANCE: Speech Restoration With Integrated Noise Suppression And Voice Conversion (2024)0.00