CallHome
Emerging28papers using it
2022first seen
Papers using CallHome (27)
- Unified Modeling Of Multi-talker Overlapped Speech Recognition And Diarization With A Sidecar SeparatorImproving Transformer-based End-to-end Speaker Diarization By Assigning Auxiliary Losses To Attention HeadsMixRep: Hidden Representation Mixup for Low-Resource Speech RecognitionLibriConvo: Simulating Conversations from Read Literature for ASR and DiarizationWhisper Speaker Identification: Leveraging Pre-Trained Multilingual
Transformers for Robust Speaker EmbeddingsTowards Unsupervised Speaker Diarization System For Multilingual Telephone Calls Using Pre-trained Whisper Model And Mixture Of Sparse AutoencodersSpeaker-Aware Simulation Improves Conversational Speech RecognitionO-EENC-SD: Efficient Online End-to-End Neural Clustering for Speaker DiarizationLESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Using in-the-wild DataWhisper Speaker Identification: Leveraging Pre-trained Multilingual Transformers For Robust Speaker EmbeddingsSEAL: Speaker Error Correction Using Acoustic-conditioned Large Language ModelsUniversal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity DetectionSEAL: Speaker Error Correction using Acoustic-conditioned Large Language
ModelsLeveraging Speaker Embeddings In End-to-end Neural Diarization For Two-speaker ScenariosGeneration Of Speaker Representations Using Heterogeneous Training Batch AssemblyTarget Speaker Voice Activity Detection with Transformers and Its
Integration with End-to-End Neural DiarizationNeural Diarization with Non-autoregressive Intermediate AttractorsUnified Modeling of Multi-Talker Overlapped Speech Recognition and
Diarization with a Sidecar SeparatorDecoder-only Architecture for Speech Recognition with CTC Prompts and
Text Data AugmentationDiaPer: End-to-End Neural Diarization with Perceiver-Based AttractorsImproving the Training Recipe for a Robust Conformer-based Hybrid ModelImproving Transformer-based End-to-End Speaker Diarization by Assigning
Auxiliary Losses to Attention HeadsUSED: Universal Speaker Extraction and DiarizationAutomatic Speech Recognition System-Independent Word Error Rate
EstimationLeveraging Speaker Embeddings in End-to-End Neural Diarization for
Two-Speaker ScenariosTowards Unsupervised Speaker Diarization System for Multilingual
Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse
AutoencodersLS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction