AISHELL-6-Whisper

Emerging

2papers using it

2025first seen

🗣️ AISHELL6-Whisper AISHELL6-Whisper is a large-scale open-source Chinese Mandarin audio-visual whisper speech dataset,containing 30 hours each of whisper and parallel normal speech, with synchronized frontal RGB facial videos. 📘 Dataset Summary Property Description Language Chinese (Mandarin, ZH) License CC BY-NC-SA

🔎 Find this dataset

Papers using AISHELL-6-Whisper (2)

AISHELL6-whisper: A Chinese Mandarin Audio-visual Whisper Speech Dataset with Speech Recognition Baselines2025 · 6 cites

WhisperVC: Decoupled Cross-Domain Alignment and Speech Generation for Low-Resource Whisper-to-Normal Conversion2025