← all datasets

English

Emerging

33papers using it

2021first seen

🔎 Find this dataset

Papers using English (33)

LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context2025 · 3 cites

Breaking the Script Barrier: Enabling Automatic Alignment for PoS-based ASR Error Analysis in Non-Latin Scripts2026

Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking2025 · 2 cites

Word stress in self-supervised speech models: A cross-linguistic comparison2025 · 2 cites

Exploring Cross-Lingual Voice Conversion Methods for Anonymizing Low-Resource Text-to-Speech2026

Rethinking Entropy Allocation in LLM-based ASR: Understanding the Dynamics between Speech Encoders and LLMs2026

An Ultra-Low Latency, End-to-End Streaming Speech Synthesis Architecture via Block-Wise Generation and Depth-Wise Codec Decoding2026

Utterance-Level Methods for Identifying Reliable ASR-Output for Child Speech2026

findsylls: A Language-Agnostic Toolkit for Syllable-Level Speech Tokenization and Embedding2026

Unsupervised Cross-Lingual Part-of-Speech Tagging with Monolingual Corpora Only2026

E2E-VGuard: Adversarial Prevention for Production LLM-based End-To-End Speech Synthesis2025

Unsupervised lexicon learning from speech is limited by representations rather than clustering2025

Parallel GPT: Harmonizing the Independence and Interdependence of Acoustic and Semantic Information for Zero-Shot Text-to-Speech2025

SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM Development2025

Evaluating Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance2025

Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions2023 · 2 cites

EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion2021 · 1 cites

Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE2022 · 1 cites

Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models2022 · 1 cites

TTS-Guided Training for Accent Conversion Without Parallel Data2022 · 1 cites

Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model2023 · 1 cites

Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition2024 · 1 cites

Advocating Character Error Rate for Multilingual ASR Evaluation2024 · 1 cites

STTATTS: Unified Speech-To-Text And Text-To-Speech Model2024 · 1 cites

Intent Classification Using Pre-trained Language Agnostic Embeddings For Low Resource Languages2021

Visually Grounded Keyword Detection and Localisation for Low-Resource Languages2023

Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data2023

Zero Resource Cross-Lingual Part Of Speech Tagging2024

Contextualized Automatic Speech Recognition with Dynamic Vocabulary2024

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech2024

Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion2024

Textless NLP -- Zero Resource Challenge with Low Resource Compute2024

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech2023

English dataset — papers, benchmarks & downloads · Speech Audio