Supervised Metric Learning For Music Structure Features
2021 Β· Ju-Chiang Wang, Jordan B. L. Smith, Wei-Tsung Lu, et al.
Abstract
Music structure analysis (MSA) methods traditionally search for musically meaningful patterns in audio: homogeneity, repetition, novelty, and segment-length regularity. Hand-crafted audio features such as MFCCs or chromagrams are often used to elicit these patterns. However, with more annotations of section labels (e.g., verse, chorus, and bridge) becoming available, one can use supervised feature learning to make these patterns even clearer and improve MSA performance. To this end, we take a supervised metric learning approach: we train a deep neural network to output embeddings that are near each other for two spectrogram inputs if both have the same section type (according to an annotation), and otherwise far apart. We propose a batch sampling scheme to ensure the labels in a training pair are interpreted meaningfully. The trained model extracts features that can be used in existing MSA algorithms. In evaluations with three datasets (HarmonixSet, SALAMI, and RWC), we demonstrate tha
Authors
(none)
Tags
Stats
Related papers
- Ssm-net: Feature Learning For Music Structure Analysis Using A Self-similarity-matrix Based Loss (2022)0.00
- Songformer: Scaling Music Structure Analysis With Heterogeneous Supervision (2025)4.25
- Convolutive Block-matching Segmentation Algorithm With Application To Music Structure Analysis (2022)0.00
- Exploring Single-song Autoencoding Schemes For Audio-based Music Structure Analysis (2021)0.00
- Supervised And Unsupervised Learning Of Audio Representations For Music Understanding (2022)0.00
- To Catch A Chorus, Verse, Intro, Or Anything Else: Analyzing A Song With Structural Functions (2022)9.92
- Musictm-dataset For Joint Representation Learning Among Sheet Music, Lyrics, And Musical Audio (2020)3.58
- Learning Music Audio Representations Via Weak Language Supervision (2021)10.07