Audio-to-score Alignment Of Piano Music Using Rnn-based Automatic Music Transcription
2017 Β· Taegyun Kwon, Dasaem Jeong, Juhan Nam
Abstract
We propose a framework for audio-to-score alignment on piano performance that employs automatic music transcription (AMT) using neural networks. Even though the AMT result may contain some errors, the note prediction output can be regarded as a learned feature representation that is directly comparable to MIDI note or chroma representation. To this end, we employ two recurrent neural networks that work as the AMT-based feature extractors to the alignment algorithm. One predicts the presence of 88 notes or 12 chroma in frame-level and the other detects note onsets in 12 chroma. We combine the two types of learned features for the audio-to-score alignment. For comparability, we apply dynamic time warping as an alignment algorithm without any additional post-processing. We evaluate the proposed framework on the MAPS dataset and compare it to previous work. The result shows that the alignment framework with the learned features significantly improves the accuracy, achieving less than 10 ms
Authors
(none)
Tags
Stats
Related papers
- Audio-to-score Alignment Using Deep Automatic Music Transcription (2021)0.00
- A Holistic Approach To Polyphonic Music Transcription With Neural Networks (2019)0.00
- Learning Frame Similarity Using Siamese Networks For Audio-to-score Alignment (2020)8.09
- Piano Transcription By Hierarchical Language Modeling With Pretrained Roll-based Encoders (2025)4.52
- Audio-to-score Alignment Using Transposition-invariant Features (2018)0.00
- Towards Efficient And Real-time Piano Transcription Using Neural Autoregressive Models (2024)5.84
- Just Label The Repeats For In-the-wild Audio-to-score Alignment (2024)0.00
- Annotation-free Automatic Music Transcription With Scalable Synthetic Data And Adversarial Domain Confusion (2023)4.52