Singmos: An Extensive Open-source Singing Voice Dataset For MOS Prediction
2024 Β· Yuxun Tang, Jiatong Shi, Yuning Wu, et al.
Abstract
In speech generation tasks, human subjective ratings, usually referred to as the opinion score, are considered the "gold standard" for speech quality evaluation, with the mean opinion score (MOS) serving as the primary evaluation metric. Due to the high cost of human annotation, several MOS prediction systems have emerged in the speech domain, demonstrating good performance. These MOS prediction models are trained using annotations from previous speech-related challenges. However, compared to the speech domain, the singing domain faces data scarcity and stricter copyright protections, leading to a lack of high-quality MOS-annotated datasets for singing. To address this, we propose SingMOS, a high-quality and diverse MOS dataset for singing, covering a range of Chinese and Japanese datasets. These synthesized vocals are generated using state-of-the-art models in singing synthesis, conversion, or resynthesis tasks and are rated by professional annotators alongside real vocals. Data analy
Authors
(none)
Tags
Stats
Related papers
- Singmos-pro: An Comprehensive Benchmark For Singing Quality Assessment (2025)0.00
- SOMOS: The Samsung Open MOS Dataset For The Evaluation Of Neural Text-to-speech Synthesis (2022)10.74
- The Voicemos Challenge 2023: Zero-shot Subjective Speech Quality Prediction For Multiple Domains (2023)11.19
- Opencpop: A High-quality Open Source Chinese Popular Song Corpus For Singing Voice Synthesis (2022)13.34
- Singing Voice Data Scaling-up: An Introduction To Ace-opencpop And Ace-kising (2024)15.48
- Pitch-and-spectrum-aware Singing Quality Assessment With Bias Correction And Model Fusion (2024)3.58
- SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations And Acoustic Features (2024)2.26
- A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality (2022)6.34