Multi-resolution Fully Convolutional Neural Networks For Monaural Audio Source Separation
2017 Β· Emad M. Grais, Hagen Wierstorf, Dominic Ward, et al.
Abstract
In deep neural networks with convolutional layers, each layer typically has fixed-size/single-resolution receptive field (RF). Convolutional layers with a large RF capture global information from the input features, while layers with small RF size capture local details with high resolution from the input features. In this work, we introduce novel deep multi-resolution fully convolutional neural networks (MR-FCNN), where each layer has different RF sizes to extract multi-resolution features that capture the global and local details information from its input features. The proposed MR-FCNN is applied to separate a target audio source from a mixture of many audio sources. Experimental results show that using MR-FCNN improves the performance compared to feedforward deep neural networks (DNNs) and single resolution deep fully convolutional neural networks (FCNNs) on the audio source separation problem.
Authors
(none)
Tags
Stats
Related papers
- Multi-band Multi-resolution Fully Convolutional Neural Networks For Singing Voice Separation (2019)5.84
- Raw Multi-channel Audio Source Separation Using Multi-resolution Convolutional Auto-encoders (2018)11.58
- Mmdenselstm: An Efficient Combination Of Convolutional And Recurrent Neural Networks For Audio Source Separation (2018)15.28
- Audio Source Separation Via Multi-scale Learning With Dilated Dense U-nets (2019)0.00
- Evolving Multi-resolution Pooling CNN For Monaural Singing Voice Separation (2020)9.03
- Speech Separation Using An Asynchronous Fully Recurrent Convolutional Neural Network (2021)0.00
- Deep Residual Echo Suppression And Noise Reduction: A Multi-input FCRN Approach In A Hybrid Speech Enhancement System (2021)8.09
- Interleaved Multitask Learning For Audio Source Separation With Independent Databases (2019)0.00