Neural Network Alternatives To Convolutive Audio Models For Source Separation
2017 Β· Shrikant Venkataramani, Y. Cem Subakan, Paris Smaragdis
Abstract
Convolutive Non-Negative Matrix Factorization model factorizes a given audio spectrogram using frequency templates with a temporal dimension. In this paper, we present a convolutional auto-encoder model that acts as a neural network alternative to convolutive NMF. Using the modeling flexibility granted by neural networks, we also explore the idea of using a Recurrent Neural Network in the encoder. Experimental results on speech mixtures from TIMIT dataset indicate that the convolutive architecture provides a significant improvement in separation performance in terms of BSSeval metrics.
Authors
(none)
Tags
Stats
Related papers
- End-to-end Non-negative Autoencoders For Sound Source Separation (2019)2.26
- End-to-end Source Separation With Adaptive Front-ends (2017)12.17
- Mmdenselstm: An Efficient Combination Of Convolutional And Recurrent Neural Networks For Audio Source Separation (2018)15.28
- Generalized Multichannel Variational Autoencoder For Underdetermined Source Separation (2018)7.81
- Complex NMF Under Phase Constraints Based On Signal Modeling: Application To Audio Source Separation (2016)7.50
- Raw Multi-channel Audio Source Separation Using Multi-resolution Convolutional Auto-encoders (2018)11.58
- End-to-end Networks For Supervised Single-channel Speech Separation (2018)0.00
- Independence-based Joint Dereverberation And Separation With Neural Source Model (2021)4.52