Applying GPGPU To Recurrent Neural Network Language Model Based Fast Network Search In The Real-time LVCSR
2020 Β· Kyungmin Lee, Chiyoun Park, Ilhwan Kim, et al.
Abstract
Recurrent Neural Network Language Models (RNNLMs) have started to be used in various fields of speech recognition due to their outstanding performance. However, the high computational complexity of RNNLMs has been a hurdle in applying the RNNLM to a real-time Large Vocabulary Continuous Speech Recognition (LVCSR). In order to accelerate the speed of RNNLM-based network searches during decoding, we apply the General Purpose Graphic Processing Units (GPGPUs). This paper proposes a novel method of applying GPGPUs to RNNLM-based graph traversals. We have achieved our goal by reducing redundant computations on CPUs and amount of transfer between GPGPUs and CPUs. The proposed approach was evaluated on both WSJ corpus and in-house data. Experiments shows that the proposed approach achieves the real-time speed in various circumstances while maintaining the Word Error Rate (WER) to be relatively 10% lower than that of n-gram models.
Authors
(none)
Tags
Stats
Related papers
- Accelerating Recurrent Neural Network Language Model Based Online Speech Recognition System (2018)8.60
- Linguistic Search Optimization For Deep Learning Based LVCSR (2018)0.00
- Fpga-based Low-power Speech Recognition With Recurrent Neural Networks (2016)13.50
- Lpcnet: Improving Neural Speech Synthesis Through Linear Prediction (2018)0.00
- Lattice Rescoring Strategies For Long Short Term Memory Language Models In Speech Recognition (2017)9.76
- Exponential Moving Average Model In Parallel Speech Recognition Training (2017)0.00
- Memory Visualization For Gated Recurrent Neural Networks In Speech Recognition (2016)11.76
- Improved Neural Language Model Fusion For Streaming Recurrent Neural Network Transducer (2020)8.82