Audio-to-score Alignment Using Deep Automatic Music Transcription
2021 Β· Federico Simonetta, Stavros Ntalampiras, Federico Avanzini
Abstract
Audio-to-score alignment (A2SA) is a multimodal task consisting in the alignment of audio signals to music scores. Recent literature confirms the benefits of Automatic Music Transcription (AMT) for A2SA at the frame-level. In this work, we aim to elaborate on the exploitation of AMT Deep Learning (DL) models for achieving alignment at the note-level. We propose a method which benefits from HMM-based score-to-score alignment and AMT, showing a remarkable advancement beyond the state-of-the-art. We design a systematic procedure to take advantage of large datasets which do not offer an aligned score. Finally, we perform a thorough comparison and extensive tests on multiple datasets.
Authors
(none)
Tags
Stats
Related papers
- Audio-to-score Alignment Of Piano Music Using Rnn-based Automatic Music Transcription (2017)0.00
- Annotation-free Automatic Music Transcription With Scalable Synthetic Data And Adversarial Domain Confusion (2023)4.52
- Just Label The Repeats For In-the-wild Audio-to-score Alignment (2024)0.00
- Audio-to-score Alignment Using Transposition-invariant Features (2018)0.00
- End-to-end Real-world Polyphonic Piano Audio-to-score Transcription With Hierarchical Decoding (2024)0.00
- Note-level Singing Melody Transcription For Time-aligned Musical Score Generation (2025)5.24
- Reconvat: A Semi-supervised Automatic Music Transcription Framework For Low-resource Real-world Data (2021)10.85
- Learning Frame Similarity Using Siamese Networks For Audio-to-score Alignment (2020)8.09