Comparative Study Of State-based Neural Networks For Virtual Analog Audio Effects Modeling
2024 Β· Riccardo Simionato, Stefano Fasciani
Abstract
Artificial neural networks are a promising technique for virtual analog modeling, having shown particular success in emulating distortion circuits. Despite their potential, enhancements are needed to enable effect parameters to influence the network's response and to achieve a low-latency output. While hybrid solutions, which incorporate both analytical and black-box techniques, offer certain advantages, black-box approaches, such as neural networks, can be preferable in contexts where rapid deployment, simplicity, or adaptability are required, and where understanding the internal mechanisms of the system is less critical. In this article, we explore the application of recent machine learning advancements for virtual analog modeling. We compare State-Space models and Linear Recurrent Units against the more common LSTM networks, with a variety of audio effects. We evaluate the performance and limitations of these models using multiple metrics, providing insights for future research and
Authors
(none)
Tags
Stats
Related papers
- Hyper Recurrent Neural Network: Condition Mechanisms For Black-box Audio Effect Modeling (2024)0.00
- A Comparison Of Recent Waveform Generation And Acoustic Modeling Methods For Neural-network-based Speech Synthesis (2018)11.76
- Efficient Neural Networks For Real-time Modeling Of Analog Dynamic Range Compression (2021)0.00
- A Comparison Of Adaptation Techniques And Recurrent Neural Network Architectures (2018)3.58
- Neural Speech And Audio Coding: Modern AI Technology Meets Traditional Codecs (2024)7.16
- Learning Robust Heterogeneous Signal Features From Parallel Neural Network For Audio Sentiment Analysis (2018)0.00
- Wasserstein GAN And Waveform Loss-based Acoustic Model Training For Multi-speaker Text-to-speech Synthesis Systems Using A Wavenet Vocoder (2018)12.61
- A Comparative Study On Recent Neural Spoofing Countermeasures For Synthetic Speech Detection (2021)0.00