A Monaural Speech Enhancement Method For Robust Small-footprint Keyword Spotting
2019 Β· Yue Gu, Zhihao Du, Hui Zhang, et al.
Abstract
Robustness against noise is critical for keyword spotting (KWS) in real-world environments. To improve the robustness, a speech enhancement front-end is involved. Instead of treating the speech enhancement as a separated preprocessing before the KWS system, in this study, a pre-trained speech enhancement front-end and a convolutional neural networks (CNNs) based KWS system are concatenated, where a feature transformation block is used to transform the output from the enhancement front-end into the KWS system's input. The whole model is trained jointly, thus the linguistic and other useful information from the KWS system can be back-propagated to the enhancement front-end to improve its performance. To fit the small-footprint device, a novel convolution recurrent network is proposed, which needs fewer parameters and computation and does not degrade performance. Furthermore, by changing the input features from the power spectrogram to Mel-spectrogram, less computation and better performa
Authors
(none)
Tags
Stats
Related papers
- DCCRN-KWS: An Audio Bias Based Model For Noise Robust Small-footprint Keyword Spotting (2023)5.24
- A Separable Temporal Convolution Neural Network With Attention For Small-footprint Keyword Spotting (2021)0.00
- Small-footprint Keyword Spotting With Graph Convolutional Network (2019)10.48
- End-to-end Keyword Spotting Using Neural Architecture Search And Quantization (2021)8.60
- Small-footprint Keyword Spotting Using Deep Neural Network And Connectionist Temporal Classifier (2017)0.00
- Small-footprint Keyword Spotting With Multi-scale Temporal Convolution (2020)0.00
- Separable Temporal Convolution Plus Temporally Pooled Attention For Lightweight High-performance Keyword Spotting (2021)0.00
- Temporal Convolution For Real-time Keyword Spotting On Mobile Devices (2019)15.67