Singing Voice Separation And Vocal F0 Estimation Based On Mutual Combination Of Robust Principal Component Analysis And Subharmonic Summation
2016 Β· Yukara Ikemiya, Katsutoshi Itoyama, Kazuyoshi Yoshii
Abstract
This paper presents a new method of singing voice analysis that performs mutually-dependent singing voice separation and vocal fundamental frequency (F0) estimation. Vocal F0 estimation is considered to become easier if singing voices can be separated from a music audio signal, and vocal F0 contours are useful for singing voice separation. This calls for an approach that improves the performance of each of these tasks by using the results of the other. The proposed method first performs robust principal component analysis (RPCA) for roughly extracting singing voices from a target music audio signal. The F0 contour of the main melody is then estimated from the separated singing voices by finding the optimal temporal path over an F0 saliency spectrogram. Finally, the singing voices are separated again more accurately by combining a conventional time-frequency mask given by RPCA with another mask that passes only the harmonic structures of the estimated F0s. Experimental results showed th
Authors
(none)
Tags
Stats
Related papers
- Investigation Of Singing Voice Separation For Singing Voice Detection In Polyphonic Music (2020)5.84
- Multiple F0 Estimation In Vocal Ensembles Using Convolutional Neural Networks (2020)0.00
- A Vocoder Based Method For Singing Voice Extraction (2019)5.24
- Single-channel Blind Source Separation For Singing Voice Detection: A Comparative Study (2018)0.00
- Nebula: F0 Estimation And Voicing Detection By Modeling The Statistical Properties Of Feature Extractors (2017)3.58
- Noisy Speech Based Temporal Decomposition To Improve Fundamental Frequency Estimation (2021)5.24
- Robustsvc: Hubert-based Melody Extractor And Adversarial Learning For Robust Singing Voice Conversion (2024)3.58
- Medleyvox: An Evaluation Dataset For Multiple Singing Voices Separation (2022)10.63