Medleyvox: An Evaluation Dataset For Multiple Singing Voices Separation
2022 Β· Chang-Bin Jeon, Hyeongi Moon, Keunwoo Choi, et al.
Abstract
Separation of multiple singing voices into each voice is a rarely studied area in music source separation research. The absence of a benchmark dataset has hindered its progress. In this paper, we present an evaluation dataset and provide baseline studies for multiple singing voices separation. First, we introduce MedleyVox, an evaluation dataset for multiple singing voices separation. We specify the problem definition in this dataset by categorizing it into i) unison, ii) duet, iii) main vs. rest, and iv) N-singing separation. Second, to overcome the absence of existing multi-singing datasets for a training purpose, we present a strategy for construction of multiple singing mixtures using various single-singing datasets. Third, we propose the improved super-resolution network (iSRNet), which greatly enhances initial estimates of separation networks. Jointly trained with the Conv-TasNet and the multi-singing mixture construction strategy, the proposed iSRNet achieved comparable performa
Authors
(none)
Tags
Stats
Related papers
- Investigation Of Singing Voice Separation For Singing Voice Detection In Polyphonic Music (2020)5.84
- Jointly Detecting And Separating Singing Voice: A Multi-task Approach (2018)7.81
- Singing Voice Separation: A Study On Training Data (2019)10.07
- Mad Twinnet: Masker-denoiser Architecture With Twin Networks For Monaural Sound Source Separation (2018)0.00
- Jointly Recognizing Speech And Singing Voices Based On Multi-task Audio Source Separation (2024)2.26
- Multi-band Multi-resolution Fully Convolutional Neural Networks For Singing Voice Separation (2019)5.84
- Zero-shot Duet Singing Voices Separation With Diffusion Models (2023)3.01
- Investigating U-nets With Various Intermediate Blocks For Spectrogram-based Singing Voice Separation (2019)0.00