Singing Voice Correction Using Canonical Time Warping
2017 Β· Yin-Jyun Luo, Ming-Tso Chen, Tai-Shih Chi, et al.
Abstract
Expressive singing voice correction is an appealing but challenging problem. A robust time-warping algorithm which synchronizes two singing recordings can provide a promising solution. We thereby propose to address the problem by canonical time warping (CTW) which aligns amateur singing recordings to professional ones. A new pitch contour is generated given the alignment information, and a pitch-corrected singing is synthesized back through the vocoder. The objective evaluation shows that CTW is robust against pitch-shifting and time-stretching effects, and the subjective test demonstrates that CTW prevails the other methods including DTW and the commercial auto-tuning software. Finally, we demonstrate the applicability of the proposed method in a practical, real-world scenario.
Authors
(none)
Tags
Stats
Related papers
- Toward Expressive Singing Voice Correction: On Perceptual Validity Of Evaluation Metrics For Vocal Melody Extraction (2020)0.00
- A Data-driven Approach To Smooth Pitch Correction For Singing Voice In Pop Music (2018)0.00
- Karatuner: Towards End To End Natural Pitch Correction For Singing Voice In Karaoke (2021)5.24
- Stabilizing Training With Soft Dynamic Time Warping: A Case Study For Pitch Class Estimation With Weakly Aligned Targets (2023)0.00
- Singing Voice Conversion With Non-parallel Data (2019)9.59
- Vits-based Singing Voice Conversion Leveraging Whisper And Multi-scale F0 Modeling (2023)0.00
- Improving Adversarial Waveform Generation Based Singing Voice Conversion With Harmonic Signals (2022)7.50
- SYKI-SVC: Advancing Singing Voice Conversion With Post-processing Innovations And An Open-source Professional Testset (2025)4.52