← all datasets

Libri-2Mix

Emerging

32papers using it

2022first seen

Libri-2Mix is a dataset used to evaluate Target Speaker Extraction (TSE) performance in mixed speech scenarios.

🔎 Find this dataset

Papers using Libri-2Mix (32)

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline2025 · 12 cites

Unifying Diarization, Separation, and ASR with Multi-Speaker Encoder2025 · 6 cites

SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation2025 · 1 cites

Towards Streaming Target Speaker Extraction via Chunk-wise Interleaved Splicing of Autoregressive Language Model2026

AlphaFlowTSE: One-Step Generative Target Speaker Extraction via Conditional AlphaFlow2026

TripleC Learning and Lightweight Speech Enhancement for Multi-Condition Target Speech Extraction2025

MeanFlow-TSE: One-Step Generative Target Speaker Extraction with Mean Flow2025

GenTSE: Enhancing Target Speaker Extraction via a Coarse-to-Fine Generative Language Model2025

A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References2025

Lightweight speech enhancement guided target speech extraction in noisy multi-speaker scenarios2025

Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition2025

SPMamba: State-space model is all you need in speech separation2024 · 8 cites

On Data Sampling Strategies for Training Neural Network Speech Separation Models2023 · 6 cites

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction2022 · 3 cites

Adapting self-supervised models to multi-talker speech recognition using speaker embeddings2022 · 2 cites

Scaling strategies for on-device low-complexity source separation with Conv-Tasnet2023 · 2 cites

SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR2024 · 2 cites

Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation2023 · 1 cites

Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition2023 · 1 cites

MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation2023 · 1 cites

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation2023 · 1 cites

Investigating self-supervised learning for speech enhancement and separation2022

MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement2022

AudioSlots: A slot-centric generative model for audio separation2023

Target Speech Extraction with Conditional Diffusion Model2023

SPGM: Prioritizing Local Features for enhanced speech separation performance2023

Probing Self-supervised Learning Models with Target Speech Extraction2024

Noise-robust Speech Separation with Fast Generative Correction2024

On the effectiveness of enrollment speech augmentation for Target Speaker Extraction2024

Wanna hear your voice? A sample is all we need!2024

Multi-Level Speaker Representation for Target Speaker Extraction2024

U-Mamba-Net: A highly efficient Mamba-based U-net style network for noisy and reverberant speech separation2024

Libri-2Mix dataset — papers, benchmarks & downloads · Speech Audio