Super Denoise Net: Speech Super Resolution With Noise Cancellation In Low Sampling Rate Noisy Environments
2023 Β· Junkang Yang, Hongqing Liu, Lu Gan, et al.
Abstract
Speech super-resolution (SSR) aims to predict a high resolution (HR) speech signal from its low resolution (LR) corresponding part. Most neural SSR models focus on producing the final result in a noise-free environment by recovering the spectrogram of high-frequency part of the signal and concatenating it with the original low-frequency part. Although these methods achieve high accuracy, they become less effective when facing the real-world scenario, where unavoidable noise is present. To address this problem, we propose a Super Denoise Net (SDNet), a neural network for a joint task of super-resolution and noise reduction from a low sampling rate signal. To that end, we design gated convolution and lattice convolution blocks to enhance the repair capability and capture information in the time-frequency axis, respectively. The experiments show our method outperforms baseline speech denoising and SSR models on DNS 2020 no-reverb test set with higher objective and subjective scores.
Authors
(none)
Tags
Stats
Related papers
- Wave-u-mamba: An End-to-end Framework For High-quality And Efficient Speech Super Resolution (2024)3.58
- Neural Vocoder Is All You Need For Speech Super-resolution (2022)12.25
- Mdctgan: Taming Transformer-based GAN For Speech Super-resolution With Modified DCT Spectra (2023)3.65
- Noise-aware Speech Separation With Contrastive Learning (2023)6.77
- STSR: High-fidelity Speech Super-resolution Via Spectral-transient Context Modeling (2025)0.00
- Audio Super Resolution Using Neural Networks (2017)0.00
- Speech Denoising By Parametric Resynthesis (2019)7.16
- Spatialnet: Extensively Learning Spatial Information For Multichannel Joint Speech Separation, Denoising And Dereverberation (2023)13.88