WHAM
Emerging16papers using it
2022first seen
The 'WHAM!' dataset is a benchmark that contains noisy speech mixtures used to evaluate the performance of speech separation and denoising systems.
Papers using WHAM (16)
- Exploring Self-attention Mechanisms For Speech SeparationNoise-aware Speech Separation With Contrastive LearningRing Mixing with Auxiliary Signal-to-Consistency-Error Ratio Loss for Unsupervised Denoising in Speech SeparationA Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy ReferencesDynamic Slimmable Networks for Efficient Speech SeparationAttractor-Based Speech Separation of Multiple Utterances by Unknown Number of SpeakersListen to Extract: Onset-Prompted Target Speaker ExtractionAudio-visual Speech Separation In Noisy Environments With A Lightweight Iterative ModelMulti-dimensional And Multi-scale Modeling For Speech Separation Optimized By Discriminative LearningResource-Efficient Separation TransformerSPMamba: State-space model is all you need in speech separationMulti-Dimensional and Multi-Scale Modeling for Speech Separation
Optimized by Discriminative LearningNoise-Aware Speech Separation with Contrastive LearningMossFormer2: Combining Transformer and RNN-Free Recurrent Network for
Enhanced Time-Domain Monaural Speech SeparationUSEF-TSE: Universal Speaker Embedding Free Target Speaker ExtractionAudio-Visual Speech Separation in Noisy Environments with a Lightweight
Iterative Model