SpeechCommands V-2

Emerging

6papers using it

2022first seen

SpeechCommandsV2-C (SC2-C) serves as a benchmark for DHAuDS in the context of test-time adaptation for audio classification. SC2-C, a subset of SpeechCommands V2, utilizes only test-set samples to address domain shift. Each sample in SC2-C has a duration of 1 second, a sample rate of 16 kHz, and belongs to one of 35 cl

🔎 Find this dataset

Papers using SpeechCommands V-2 (6)

AMAuT: A Flexible and Efficient Multiview Audio Transformer Framework Trained from Scratch2025 · 1 cites

From Physics to Representation: Audio Learning with Synthetic Pre-training via Procedural Generation2026

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model2022 · 11 cites

Cascaded Cross-Modal Transformer for Audio-Textual Classification2024 · 2 cites

On the Transferability of Large-Scale Self-Supervision to Few-Shot Audio Classification2024

Episodic fine-tuning prototypical networks for optimization-based few-shot learning: Application to audio classification2024