Perceptual Musical Features For Interpretable Audio Tagging
2023 Β· Vassilis Lyberatos, Spyridon Kantarelis, Edmund Dervakos, et al.
Abstract
In the age of music streaming platforms, the task of automatically tagging music audio has garnered significant attention, driving researchers to devise methods aimed at enhancing performance metrics on standard datasets. Most recent approaches rely on deep neural networks, which, despite their impressive performance, possess opacity, making it challenging to elucidate their output for a given input. While the issue of interpretability has been emphasized in other fields like medicine, it has not received attention in music-related tasks. In this study, we explored the relevance of interpretability in the context of automatic music tagging. We constructed a workflow that incorporates three different information extraction techniques: a) leveraging symbolic knowledge, b) utilizing auxiliary deep neural networks, and c) employing signal processing to extract perceptual features from audio files. These features were subsequently used to train an interpretable machine-learning model for ta
Authors
(none)
Tags
Stats
Related papers
- Toward Interpretable Music Tagging With Self-attention (2019)0.00
- How Low Can You Go? Reducing Frequency And Time Resolution In Current CNN Architectures For Music Auto-tagging (2019)4.52
- Combining High-level Features Of Raw Audio Waves And Mel-spectrograms For Audio Tagging (2018)0.00
- Sample-level CNN Architectures For Music Auto-tagging Using Raw Waveforms (2017)13.23
- An Empirical Study Of Weakly Supervised Audio Tagging Embeddings For General Audio Representations (2022)0.00
- Multi-level And Multi-scale Feature Aggregation Using Pre-trained Convolutional Neural Networks For Music Auto-tagging (2017)15.43
- Sample-level Deep Convolutional Neural Networks For Music Auto-tagging Using Raw Waveforms (2017)0.00
- Automatic Tagging Using Deep Convolutional Neural Networks (2016)0.00