A Data-driven Approach To Smooth Pitch Correction For Singing Voice In Pop Music
2018 Β· Sanna Wager, Lijiang Guo, Aswin Sivaraman, et al.
Abstract
In this paper, we present a machine-learning approach to pitch correction for voice in a karaoke setting, where the vocals and accompaniment are on separate tracks and time-aligned. The network takes as input the time-frequency representation of the two tracks and predicts the amount of pitch-shifting in cents required to make the voice sound in-tune with the accompaniment. It is trained on examples of semi-professional singing. The proposed approach differs from existing real-time pitch correction methods by replacing pitch tracking and mapping to a discrete set of notes---for example, the twelve classes of the equal-tempered scale---with learning a correction that is continuous both in frequency and in time directly from the harmonics of the vocal and accompaniment tracks. A Recurrent Neural Network (RNN) model provides a correction that takes context into account, preserving expressive pitch bending and vibrato. This method can be extended into unsupervised pitch correction of a voc
Authors
(none)
Tags
Stats
Related papers
- Karatuner: Towards End To End Natural Pitch Correction For Singing Voice In Karaoke (2021)5.24
- Pitchnet: Unsupervised Singing Voice Conversion With Pitch Adversarial Network (2019)10.97
- Human Voice Pitch Estimation: A Convolutional Network With Auto-labeled And Synthetic Data (2023)0.00
- Singing Voice Correction Using Canonical Time Warping (2017)5.84
- Toward Expressive Singing Voice Correction: On Perceptual Validity Of Evaluation Metrics For Vocal Melody Extraction (2020)0.00
- Traditional Machine Learning For Pitch Detection (2019)10.85
- Improved Singing Voice Separation With Chromagram-based Pitch-aware Remixing (2022)7.50
- Primadnn': A Characteristics-aware DNN Customization For Singing Technique Detection (2023)0.00