Toward Expressive Singing Voice Correction: On Perceptual Validity Of Evaluation Metrics For Vocal Melody Extraction
2020 Β· Yin-Jyun Luo, Yuen-Jen Lin, Li Su
Abstract
Singing voice correction (SVC) is an appealing application for amateur singers. Commercial products automate SVC by snapping pitch contours to equal-tempered scales, which could lead to deadpan modifications. Together with the neglect of rhythmic errors, extensive manual corrections are still necessary. In this paper, we present a streamlined system to automate expressive SVC for both pitch and rhythmic errors. Particularly, we extend a previous work by integrating advanced techniques for singing voice separation (SVS) and vocal melody extraction. SVC is achieved by temporally aligning the source-target pair, followed by replacing pitch and rhythm of the source with those of the target. We evaluate the framework by a comparative study for melody extraction which involves both subjective and objective evaluations, whereby we investigate perceptual validity of the standard metrics through the lens of SVC. The results suggest that the high pitch accuracy obtained by the metrics does not s
Authors
(none)
Tags
Stats
Related papers
- Robustsvc: Hubert-based Melody Extractor And Adversarial Learning For Robust Singing Voice Conversion (2024)3.58
- Singing Voice Conversion With Accompaniment Using Self-supervised Representation-based Melody Features (2025)0.00
- SYKI-SVC: Advancing Singing Voice Conversion With Post-processing Innovations And An Open-source Professional Testset (2025)4.52
- Singing Voice Correction Using Canonical Time Warping (2017)5.84
- A Comparative Analysis Of Latent Regressor Losses For Singing Voice Conversion (2023)0.00
- Leveraging Diverse Semantic-based Audio Pretrained Models For Singing Voice Conversion (2023)0.00
- A Data-driven Approach To Smooth Pitch Correction For Singing Voice In Pop Music (2018)0.00
- LHQ-SVC: Lightweight And High Quality Singing Voice Conversion Modeling (2024)3.58