← all datasets

AISHELL-6-Whisper

Emerging
2papers using it
2025first seen

πŸ—£οΈ AISHELL6-Whisper AISHELL6-Whisper is a large-scale open-source Chinese Mandarin audio-visual whisper speech dataset,containing 30 hours each of whisper and parallel normal speech, with synchronized frontal RGB facial videos. πŸ“˜ Dataset Summary Property Description Language Chinese (Mandarin, ZH) License CC BY-NC-SA

Papers using AISHELL-6-Whisper (2)

AISHELL-6-Whisper β€” datasets β€” speech-audio