Deep Domain Adaptation For Polyphonic Melody Extraction
2022 Β· Kavya Ranjan Saxena, Vipul Arora
Abstract
Extraction of the predominant pitch from polyphonic audio is one of the fundamental tasks in the field of music information retrieval and computational musicology. To accomplish this task using machine learning, a large amount of labeled audio data is required to train the model that predicts the pitch contour. But a classical model pre-trained on data from one domain (source), e.g, songs of a particular singer or genre, may not perform comparatively well in extracting melody from other domains (target). The performance of such models can be boosted by adapting the model using some annotated data in the target domain. In this work, we study various adaptation techniques applied to machine learning models for polyphonic melody extraction. Experimental results show that meta-learning-based adaptation performs better than simple fine-tuning. In addition to this, we find that this method outperforms the existing state-of-the-art non-adaptive polyphonic melody extraction algorithms.
Authors
(none)
Tags
Stats
Related papers
- Melody Extraction From Polyphonic Music By Deep Learning Approaches: A Review (2022)0.00
- Acoustic Modeling For Automatic Lyrics-to-audio Alignment (2019)8.60
- Towards Improving Harmonic Sensitivity And Prediction Stability For Singing Melody Extraction (2023)0.00
- Annotation-free Automatic Music Transcription With Scalable Synthetic Data And Adversarial Domain Confusion (2023)4.52
- Domain Adaptation For Formant Estimation Using Deep Learning (2016)0.00
- Learning To Adapt: A Meta-learning Approach For Speaker Adaptation (2018)9.76
- Semi-supervised Learning Using Teacher-student Models For Vocal Melody Extraction (2020)0.00
- MAJL: A Model-agnostic Joint Learning Framework For Music Source Separation And Pitch Estimation (2025)4.52