AISHELL-3: A Multi-speaker Mandarin TTS Corpus And The Baselines
2020 Β· Yao Shi, Hui Bu, Xin Xu, et al.
Abstract
In this paper, we present AISHELL-3, a large-scale and high-fidelity multi-speaker Mandarin speech corpus which could be used to train multi-speaker Text-to-Speech (TTS) systems. The corpus contains roughly 85 hours of emotion-neutral recordings spoken by 218 native Chinese mandarin speakers. Their auxiliary attributes such as gender, age group and native accents are explicitly marked and provided in the corpus. Accordingly, transcripts in Chinese character-level and pinyin-level are provided along with the recordings. We present a baseline system that uses AISHELL-3 for multi-speaker Madarin speech synthesis. The multi-speaker speech synthesis system is an extension on Tacotron-2 where a speaker verification model and a corresponding loss regarding voice similarity are incorporated as the feedback constraint. We aim to use the presented corpus to build a robust synthesis model that is able to achieve zero-shot voice cloning. The system trained on this dataset also generalizes well on
Authors
(none)
Tags
Stats
Related papers
- Wenetspeech4tts: A 12,800-hour Mandarin TTS Corpus For Large Speech Generation Model Benchmark (2024)9.76
- Cross-lingual Multi-speaker Text-to-speech Synthesis For Voice Cloning Without Using Parallel Corpus For Unseen Speakers (2019)0.00
- TALCS: An Open-source Mandarin-english Code-switching Corpus And A Speech Recognition Baseline (2022)5.84
- Mscenespeech: A Multi-scene Speech Dataset For Expressive Speech Synthesis (2024)0.00
- Towards Natural Bilingual And Code-switched Speech Synthesis Based On Mix Of Monolingual Recordings And Cross-lingual Voice Conversion (2020)0.00
- The THU-HCSI Multi-speaker Multi-lingual Few-shot Voice Cloning System For LIMMITS'24 Challenge (2024)0.00
- Bailing-tts: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation (2024)0.00
- Mntts2: An Open-source Multi-speaker Mongolian Text-to-speech Synthesis Dataset (2022)5.81