Fastwave: Accelerating Autoregressive Convolutional Neural Networks On FPGA
2020 Β· Shehzeen Hussain, Mojan Javaheripi, Paarth Neekhara, et al.
Abstract
Autoregressive convolutional neural networks (CNNs) have been widely exploited for sequence generation tasks such as audio synthesis, language modeling and neural machine translation. WaveNet is a deep autoregressive CNN composed of several stacked layers of dilated convolution that is used for sequence generation. While WaveNet produces state-of-the art audio generation results, the naive inference implementation is quite slow; it takes a few minutes to generate just one second of audio on a high-end GPU. In this work, we develop the first accelerator platform~\textit\{FastWave\} for autoregressive convolutional neural networks, and address the associated design challenges. We design the Fast-Wavenet inference model in Vivado HLS and perform a wide range of optimizations including fixed-point implementation, array partitioning and pipelining. Our model uses a fully parameterized parallel architecture for fast matrix-vector multiplication that enables per-layer customized latency fine-
Authors
(none)
Tags
Stats
Related papers
- Parallel Wavegan: A Fast Waveform Generation Model Based On Generative Adversarial Networks With Multi-resolution Spectrogram (2019)0.00
- Parallel Wavenet: Fast High-fidelity Speech Synthesis (2017)0.00
- Flowavenet : A Generative Flow For Raw Audio (2018)0.00
- Framewise Wavegan: High Speed Adversarial Vocoder In Time Domain With Very Low Computational Complexity (2022)7.16
- Efficient Neural Audio Synthesis (2018)0.00
- Featherwave: An Efficient High-fidelity Neural Vocoder With Multi-band Linear Prediction (2020)8.35
- Wavefit: An Iterative And Non-autoregressive Neural Vocoder Based On Fixed-point Iteration (2022)9.41
- Fbwave: Efficient And Scalable Neural Vocoders For Streaming Text-to-speech On The Edge (2020)0.00