← all datasets

Callhome

Emerging

29papers using it

2021first seen

Dataset Card for the Callhome dataset for speaker diarization The CALLHOME Corpus is a collection of unscripted telephone conversations between native speakers in Chinese, English, German, Japanese and Spanish. This is a processed version of the original Callhome dataset from the TalkBank corpora taken from here. It co

🔎 Find this dataset

Papers using Callhome (29)

MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition2023 · 5 cites

LibriConvo: Simulating Conversations from Read Literature for ASR and Diarization2025 · 1 cites

Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker Embeddings2025 · 1 cites

Speaker-Aware Simulation Improves Conversational Speech Recognition2026

O-EENC-SD: Efficient Online End-to-End Neural Clustering for Speaker Diarization2025

LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Using in-the-wild Data2025

Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection2025

SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models2025

Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model2022 · 13 cites

Improving Transformer-based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention Heads2023 · 8 cites

Multi-scale Speaker Diarization with Dynamic Scale Weighting2022 · 4 cites

Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors2021 · 1 cites

Low-Latency Speech Separation Guided Diarization for Telephone Conversations2022 · 1 cites

Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization2022 · 1 cites

Neural Diarization with Non-autoregressive Intermediate Attractors2023 · 1 cites

Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator2023 · 1 cites

Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer2023 · 1 cites

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation2023 · 1 cites

DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors2023 · 1 cites

Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models2022

Generation of Speaker Representations Using Heterogeneous Training Batch Assembly2022

Improving the Training Recipe for a Robust Conformer-based Hybrid Model2022

Utterance-by-utterance overlap-aware neural diarization with Graph-PIT2022

USED: Universal Speaker Extraction and Diarization2023

Semi-Autoregressive Streaming ASR With Label Context2023

Automatic Speech Recognition System-Independent Word Error Rate Estimation2024

Leveraging Speaker Embeddings in End-to-End Neural Diarization for Two-Speaker Scenarios2024

Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders2024

LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction2024

Callhome dataset — papers, benchmarks & downloads · Speech Audio