Ssm-net: Feature Learning For Music Structure Analysis Using A Self-similarity-matrix Based Loss
2022 Β· Geoffroy Peeters, Florian Angulo
Abstract
In this paper, we propose a new paradigm to learn audio features for Music Structure Analysis (MSA). We train a deep encoder to learn features such that the Self-Similarity-Matrix (SSM) resulting from those approximates a ground-truth SSM. This is done by minimizing a loss between both SSMs. Since this loss is differentiable w.r.t. its input features we can train the encoder in a straightforward way. We successfully demonstrate the use of this training paradigm using the Area Under the Curve ROC (AUC) on the RWC-Pop dataset.
Authors
(none)
Tags
Stats
Related papers
- Supervised Metric Learning For Music Structure Features (2021)0.00
- Exploring Single-song Autoencoding Schemes For Audio-based Music Structure Analysis (2021)0.00
- Convolutive Block-matching Segmentation Algorithm With Application To Music Structure Analysis (2022)0.00
- SSAMBA: Self-supervised Audio Representation Learning With Mamba State Space Model (2024)0.00
- Songformer: Scaling Music Structure Analysis With Heterogeneous Supervision (2025)4.25
- Audio Mamba: Selective State Spaces For Self-supervised Audio Representations (2024)9.23
- Muq: Self-supervised Music Representation Learning With Mel Residual Vector Quantization (2025)15.66
- Learning Complex Basis Functions For Invariant Representations Of Audio (2019)0.00