← all datasets

WHAM

Emerging

15papers using it

2021first seen

The 'WHAM!' dataset/benchmark contains mixtures of speech and is used to evaluate the performance of noisy speech separation systems.

🔎 Find this dataset

Papers using WHAM (15)

Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers2025 · 1 cites

Ring Mixing with Auxiliary Signal-to-Consistency-Error Ratio Loss for Unsupervised Denoising in Speech Separation2026

A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References2025

Dynamic Slimmable Networks for Efficient Speech Separation2025

Listen to Extract: Onset-Prompted Target Speaker Extraction2025

MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions2023 · 64 cites

Resource-Efficient Separation Transformer2022 · 11 cites

SPMamba: State-space model is all you need in speech separation2024 · 8 cites

Exploring Self-Attention Mechanisms for Speech Separation2022 · 2 cites

Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning2023 · 1 cites

Noise-Aware Speech Separation with Contrastive Learning2023 · 1 cites

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation2023 · 1 cites

USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction2024 · 1 cites

Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model2023

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain2021

WHAM dataset — papers, benchmarks & downloads · Speech Audio