Source Separation And Depthwise Separable Convolutions For Computer Audition
2020 · Gabriel Mersy, Jin Hong Kuan
Abstract
Given recent advances in deep music source separation, we propose a feature representation method that combines source separation with a state-of-the-art representation learning technique that is suitably repurposed for computer audition (i.e. machine listening). We train a depthwise separable convolutional neural network on a challenging electronic dance music (EDM) data set and compare its performance to convolutional neural networks operating on both source separated and standard spectrograms. It is shown that source separation improves classification performance in a limited-data setting compared to the standard single spectrogram approach.
Authors
(none)
Tags
Stats
Related papers
- Depthwise Separable Convolutions Versus Recurrent Neural Networks For Monaural Singing Voice Separation (2020)0.00
- Voice And Accompaniment Separation In Music Using Self-attention Convolutional Neural Network (2020)0.00
- Mmdenselstm: An Efficient Combination Of Convolutional And Recurrent Neural Networks For Audio Source Separation (2018)15.28
- A Recurrent Encoder-decoder Approach With Skip-filtering Connections For Monaural Singing Voice Separation (2017)9.41
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models (2022)10.97
- Raw Multi-channel Audio Source Separation Using Multi-resolution Convolutional Auto-encoders (2018)11.58
- Music Source Separation In The Waveform Domain (2019)0.00
- Singing Voice Separation Using A Deep Convolutional Neural Network Trained By Ideal Binary Mask And Cross Entropy (2018)11.19