Audio-based Music Classification With Densenet And Data Augmentation
2019 Β· Wenhao Bian, Jie Wang, Bojin Zhuang, et al.
Abstract
In recent years, deep learning technique has received intense attention owing to its great success in image recognition. A tendency of adaption of deep learning in various information processing fields has formed, including music information retrieval (MIR). In this paper, we conduct a comprehensive study on music audio classification with improved convolutional neural networks (CNNs). To the best of our knowledge, this the first work to apply Densely Connected Convolutional Networks (DenseNet) to music audio tagging, which has been demonstrated to perform better than Residual neural network (ResNet). Additionally, two specific data augmentation approaches of time overlapping and pitch shifting have been proposed to address the deficiency of labelled data in the MIR. Moreover, an ensemble learning of stacking is employed based on SVM. We believe that the proposed combination of strong representation of DenseNet and data augmentation can be adapted to other audio processing tasks.
Authors
(none)
Tags
Stats
Related papers
- Sample Mixed-based Data Augmentation For Domestic Audio Tagging (2018)0.00
- D3net: Densely Connected Multidilated Densenet For Music Source Separation (2020)0.00
- Convolutional Gated Recurrent Neural Network Incorporating Spatial Features For Audio Tagging (2017)13.23
- Multi-level And Multi-scale Feature Aggregation Using Pre-trained Convolutional Neural Networks For Music Auto-tagging (2017)15.43
- Acoustic Scene Classification Using Convolutional Neural Network And Multiple-width Frequency-delta Data Augmentation (2016)0.00
- Mmdenselstm: An Efficient Combination Of Convolutional And Recurrent Neural Networks For Audio Source Separation (2018)15.28
- Sample-level CNN Architectures For Music Auto-tagging Using Raw Waveforms (2017)13.23
- A Deep Neural Network For Audio Classification With A Classifier Attention Mechanism (2020)0.00