A Network Of Deep Neural Networks For Distant Speech Recognition
2017 Β· Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, et al.
Abstract
Despite the remarkable progress recently made in distant speech recognition, state-of-the-art technology still suffers from a lack of robustness, especially when adverse acoustic conditions characterized by non-stationary noises and reverberation are met. A prominent limitation of current systems lies in the lack of matching and communication between the various technologies involved in the distant speech recognition process. The speech enhancement and speech recognition modules are, for instance, often trained independently. Moreover, the speech enhancement normally helps the speech recognizer, but the output of the latter is not commonly used, in turn, to improve the speech enhancement. To address both concerns, we propose a novel architecture based on a network of deep neural networks, where all the components are jointly trained and better cooperate with each other thanks to a full communication scheme between them. Experiments, conducted using different datasets, tasks and acousti
Authors
(none)
Tags
Stats
Related papers
- Deep Learning For Distant Speech Recognition (2017)0.00
- Ensemble Of Jointly Trained Deep Neural Network-based Acoustic Models For Reverberant Speech Recognition (2016)0.00
- Batch-normalized Joint Training For Dnn-based Distant Speech Recognition (2017)8.82
- STC Speaker Recognition Systems For The Voices From A Distance Challenge (2019)7.81
- Contaminated Speech Training Methods For Robust DNN-HMM Distant Speech Recognition (2017)4.52
- Frequency Domain Multi-channel Acoustic Modeling For Distant Speech Recognition (2019)9.92
- Distributed Training Of Deep Neural Network Acoustic Models For Automatic Speech Recognition (2020)0.00
- Analyzing Large Receptive Field Convolutional Networks For Distant Speech Recognition (2019)5.84