Google Speech Commands
Canonical34papers using it
2022first seen
The 'Google Speech Commands' dataset contains a collection of spoken commands used to evaluate keyword spotting in audio recognition systems.
Papers using Google Speech Commands (34)
- Few-shot Open-set Learning For On-device Customization Of Keyword Spotting SystemsImproving Vision-inspired Keyword Spotting Using Dynamic Module Skipping In Streaming Conformer EncoderDifferential Evolution Algorithm Based Hyper-parameters Selection Of Convolutional Neural Network For Speech Command RecognitionVIC-KD: Variance-invariance-covariance Knowledge Distillation To Make Keyword Spotting More Robust Against Adversarial AttacksPractical Bayesian Inference for Speech SNNs: Uncertainty and Loss-Landscape SmoothingWhisper-AuT: Domain-Adapted Audio Encoder for Efficient Audio-LLM TrainingWaveSSM: Multiscale State-Space Models for Non-stationary Signal AttentionSW-ASR: A Context-Aware Hybrid ASR Pipeline for Robust Single Word Speech RecognitionSpeech Command Recognition Using LogNNet Reservoir Computing for Embedded SystemsKeyword Mamba: Spoken Keyword Spotting with State Space ModelsLLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword SpottingText-Aware Adapter for Few-Shot Keyword SpottingSelf-supervised Speech Representation Learning For Keyword-spotting With Light-weight TransformersBoosting Keyword Spotting Through On-device Learnable User Speech CharacteristicsReparameterized Multi-resolution Convolutions For Long Sequence ModellingConvolutional Variational Autoencoders For Spectrogram Compression In Automatic Speech RecognitionEnhancing Synthetic Training Data For Speech Commands: From Asr-based Filtering To Domain Adaptation In SSL Latent SpaceFilterbank Learning for Noise-Robust Small-Footprint Keyword SpottingSelf-supervised speech representation learning for keyword-spotting with
light-weight transformersContrastive Speech Mixup for Low-resource Keyword SpottingBoosting keyword spotting through on-device learnable user speech
characteristicsDamage Control During Domain Adaptation for Transducer Based Automatic
Speech RecognitionGAN You Hear Me? Reclaiming Unconditional Speech Synthesis from
Diffusion ModelsMAST: Multiscale Audio Spectrogram TransformersSpot keywords from very noisy and mixed speechFew-Shot Open-Set Learning for On-Device Customization of KeyWord
Spotting SystemsImproving vision-inspired keyword spotting using dynamic module skipping
in streaming conformer encoderVIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make
Keyword Spotting More Robust Against Adversarial AttacksDifferential Evolution Algorithm based Hyper-Parameters Selection of
Convolutional Neural Network for Speech Command RecognitionED-sKWS: Early-Decision Spiking Neural Networks for Rapid,and
Energy-Efficient Keyword SpottingFew-Shot Keyword Spotting from Mixed SpeechReparameterized Multi-Resolution Convolutions for Long Sequence
ModellingEnhancing Synthetic Training Data for Speech Commands: From ASR-Based
Filtering to Domain Adaptation in SSL Latent SpaceConvolutional Variational Autoencoders for Spectrogram Compression in
Automatic Speech Recognition