Exploring Single-song Autoencoding Schemes For Audio-based Music Structure Analysis
2021 · Axel Marmoret, Jérémy E. Cohen, Frédéric Bimbot
Abstract
The ability of deep neural networks to learn complex data relations and representations is established nowadays, but it generally relies on large sets of training data. This work explores a "piece-specific" autoencoding scheme, in which a low-dimensional autoencoder is trained to learn a latent/compressed representation specific to a given song, which can then be used to infer the song structure. Such a model does not rely on supervision nor annotations, which are well-known to be tedious to collect and often ambiguous in Music Structure Analysis. We report that the proposed unsupervised auto-encoding scheme achieves the level of performance of supervised state-of-the-art methods with 3 seconds tolerance when using a Log Mel spectrogram representation on the RWC-Pop dataset.
Authors
(none)
Tags
Stats
Related papers
- Supervised Metric Learning For Music Structure Features (2021)0.00
- Ssm-net: Feature Learning For Music Structure Analysis Using A Self-similarity-matrix Based Loss (2022)0.00
- Sample-level Deep Convolutional Neural Networks For Music Auto-tagging Using Raw Waveforms (2017)0.00
- Music2latent2: Audio Compression With Summary Embeddings And Autoregressive Decoding (2025)2.26
- An Investigation Of The Reconstruction Capacity Of Stacked Convolutional Autoencoders For Log-mel-spectrograms (2023)0.00
- Sample-level CNN Architectures For Music Auto-tagging Using Raw Waveforms (2017)13.23
- The Effect Of Explicit Structure Encoding Of Deep Neural Networks For Symbolic Music Generation (2018)11.49
- Towards Robust Unsupervised Disentanglement Of Sequential Data -- A Case Study Using Music Audio (2022)0.00