SpeechCommands V-2
Emerging5papers using it
2024first seen
The 'SpeechCommands V2' dataset is a benchmark that contains a collection of spoken commands used to evaluate the performance of audio classification models.
Papers using SpeechCommands V-2 (5)
- On The Transferability Of Large-scale Self-supervision To Few-shot Audio ClassificationAMAuT: A Flexible and Efficient Multiview Audio Transformer Framework Trained from ScratchCascaded Cross-Modal Transformer for Audio-Textual ClassificationOn the Transferability of Large-Scale Self-Supervision to Few-Shot Audio
ClassificationEpisodic fine-tuning prototypical networks for optimization-based
few-shot learning: Application to audio classification