Towards Speaker Identification With Minimal Dataset And Constrained Resources Using 1d-convolution Neural Network
2024 Β· Irfan Nafiz Shahan, Pulok Ahmed Auvi
Abstract
Voice recognition and speaker identification are vital for applications in security and personal assistants. This paper presents a lightweight 1D-Convolutional Neural Network (1D-CNN) designed to perform speaker identification on minimal datasets. Our approach achieves a validation accuracy of 97.87%, leveraging data augmentation techniques to handle background noise and limited training samples. Future improvements include testing on larger datasets and integrating transfer learning methods to enhance generalizability. We provide all code, the custom dataset, and the trained models to facilitate reproducibility. These resources are available on our GitHub repository: https://github.com/IrfanNafiz/RecMe.
Authors
(none)
Tags
Stats
Code
Related papers
- Voxceleb2: Deep Speaker Recognition (2018)23.96
- Speakernet: 1D Depth-wise Separable Convolutional Network For Text-independent Speaker Recognition And Verification (2020)0.00
- Training Speaker Recognition Systems With Limited Data (2022)8.13
- Unified Hypersphere Embedding For Speaker Recognition (2018)0.00
- Neural Network Based Speaker Classification And Verification Systems With Enhanced Features (2017)8.60
- Few-shot Speaker Identification Using Depthwise Separable Convolutional Network With Channel Attention (2022)5.24
- Voxceleb: A Large-scale Speaker Identification Dataset (2017)23.55
- Speaker Verification Using Convolutional Neural Networks (2018)0.00