Realistic Multi-microphone Data Simulation For Distant Speech Recognition
2017 Β· Mirco Ravanelli, Piergiorgio Svaizer, Maurizio Omologo
Abstract
The availability of realistic simulated corpora is of key importance for the future progress of distant speech recognition technology. The reliability, flexibility and low computational cost of a data simulation process may ultimately allow researchers to train, tune and test different techniques in a variety of acoustic scenarios, avoiding the laborious effort of directly recording real data from the targeted environment. In the last decade, several simulated corpora have been released to the research community, including the data-sets distributed in the context of projects and international challenges, such as CHiME and REVERB. These efforts were extremely useful to derive baselines and common evaluation frameworks for comparison purposes. At the same time, in many cases they highlighted the need of a better coherence between real and simulated conditions. In this paper, we examine this issue and we describe our approach to the generation of realistic corpora in a domestic contex
Authors
(none)
Tags
Stats
Related papers
- Property-aware Multi-speaker Data Simulation: A Probabilistic Modelling Technique For Synthetic Data Generation (2023)6.34
- The Dirha-english Corpus And Related Tasks For Distant-speech Recognition In Domestic Environments (2017)10.35
- Deep Learning For Distant Speech Recognition (2017)0.00
- Multi-speaker And Wide-band Simulated Conversations As Training Data For End-to-end Neural Diarization (2022)8.60
- Libriheavymix: A 20,000-hour Dataset For Single-channel Reverberant Multi-talker Speech Separation, ASR And Speaker Diarization (2024)5.24
- 3d-speaker: A Large-scale Multi-device, Multi-distance, And Multi-dialect Corpus For Speech Representation Disentanglement (2023)0.00
- Frequency Domain Multi-channel Acoustic Modeling For Distant Speech Recognition (2019)9.92
- A Network Of Deep Neural Networks For Distant Speech Recognition (2017)10.35