Fpga-based Low-power Speech Recognition With Recurrent Neural Networks
2016 Β· Minjae Lee, Kyuyeon Hwang, Jinhwan Park, et al.
Abstract
In this paper, a neural network based real-time speech recognition (SR) system is developed using an FPGA for very low-power operation. The implemented system employs two recurrent neural networks (RNNs); one is a speech-to-character RNN for acoustic modeling (AM) and the other is for character-level language modeling (LM). The system also employs a statistical word-level LM to improve the recognition accuracy. The results of the AM, the character-level LM, and the word-level LM are combined using a fairly simple N-best search algorithm instead of the hidden Markov model (HMM) based network. The RNNs are implemented using massively parallel processing elements (PEs) for low latency and high throughput. The weights are quantized to 6 bits to store all of them in the on-chip memory of an FPGA. The proposed algorithm is implemented on a Xilinx XC7Z045, and the system can operate much faster than real-time.
Authors
(none)
Tags
Stats
Related papers
- E-RNN: Design Optimization For Efficient Recurrent Neural Networks In Fpgas (2018)13.50
- Applying GPGPU To Recurrent Neural Network Language Model Based Fast Network Search In The Real-time LVCSR (2020)2.26
- Accelerating Recurrent Neural Network Language Model Based Online Speech Recognition System (2018)8.60
- A Novel Pyramidal-fsmn Architecture With Lattice-free MMI For Speech Recognition (2018)0.00
- Effcrn: An Efficient Convolutional Recurrent Network For High-performance Speech Enhancement (2023)5.84
- Speechnet: Weakly Supervised, End-to-end Speech Recognition At Industrial Scale (2022)0.00
- Segmental Recurrent Neural Networks For End-to-end Speech Recognition (2016)0.00
- Long Short-term Memory Based Convolutional Recurrent Neural Networks For Large Vocabulary Speech Recognition (2016)6.77