Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks
2024 Β· Mahmoud Salhab, Haidar Harmanani
Abstract
Speech bandwidth expansion is crucial for expanding the frequency range of low-bandwidth speech signals, thereby improving audio quality, clarity and perceptibility in digital applications. Its applications span telephony, compression, text-to-speech synthesis, and speech recognition. This paper presents a novel approach using a high-fidelity generative adversarial network, unlike cascaded systems, our system is trained end-to-end on paired narrowband and wideband speech signals. Our method integrates various bandwidth upsampling ratios into a single unified model specifically designed for speech bandwidth expansion applications. Our approach exhibits robust performance across various bandwidth expansion factors, including those not encountered during training, demonstrating zero-shot capability. To the best of our knowledge, this is the first work to showcase this capability. The experimental results demonstrate that our method outperforms previous end-to-end approaches, as well as in
Authors
(none)
Tags
Stats
Related papers
- Hifi++: A Unified Framework For Bandwidth Extension And Speech Enhancement (2022)11.93
- Towards High-quality And Efficient Speech Bandwidth Extension With Parallel Amplitude And Phase Prediction (2024)0.00
- UBGAN: Enhancing Coded Speech With Blind And Guided Bandwidth Extension (2025)0.00
- Bandwidth Extension On Raw Audio Via Generative Adversarial Networks (2019)0.00
- Joint Domain Adaptation And Speech Bandwidth Extension Using Time-domain Gans For Speaker Verification (2022)4.52
- Hifi-gan: Generative Adversarial Networks For Efficient And High Fidelity Speech Synthesis (2020)0.00
- Dsp-informed Bandwidth Extension Using Locally-conditioned Excitation And Linear Time-varying Filter Subnetworks (2024)2.26
- SEGAN: Speech Enhancement Generative Adversarial Network (2017)21.85