Postgan: A Gan-based Post-processor To Enhance The Quality Of Coded Speech
2022 Β· Srikanth Korse, Nicola Pia, Kishan Gupta, et al.
Abstract
The quality of speech coded by transform coding is affected by various artefacts especially when bitrates to quantize the frequency components become too low. In order to mitigate these coding artefacts and enhance the quality of coded speech, a post-processor that relies on a-priori information transmitted from the encoder is traditionally employed at the decoder side. In recent years, several data-driven post-postprocessors have been proposed which were shown to outperform traditional approaches. In this paper, we propose PostGAN, a GAN-based neural post-processor that operates in the sub-band domain and relies on the U-Net architecture and a learned affine transform. It has been tested on the recently standardized low-complexity, low-delay bluetooth codec (LC3) for wideband speech at the lowest bitrate (16 kbit/s). Subjective evaluations and objective scores show that the newly introduced post-processor surpasses previously published methods and can improve the quality of coded spee
Authors
(none)
Tags
Stats
Related papers
- Enhancement Of Coded Speech Using A Mask-based Post-filter (2020)8.82
- UBGAN: Enhancing Coded Speech With Blind And Guided Bandwidth Extension (2025)0.00
- Improving Opus Low Bit Rate Quality With Neural Speech Synthesis (2019)10.48
- Analysis By Adversarial Synthesis -- A Novel Approach For Speech Vocoding (2019)3.58
- A DNN Based Post-filter To Enhance The Quality Of Coded Speech In MDCT Domain (2022)6.34
- Boosting Objective Scores Of A Speech Enhancement Model By Metricgan Post-processing (2020)0.00
- Speech Quality Factors For Traditional And Neural-based Low Bit Rate Vocoders (2020)7.16
- Unetgan: A Robust Speech Enhancement Approach In Time Domain For Extremely Low Signal-to-noise Ratio Condition (2020)11.49