Towards Lightweight Speaker Verification Via Adaptive Neural Network Quantization
2024 Β· Bei Liu, Haoyu Wang, Yanmin Qian
Abstract
Modern speaker verification (SV) systems typically demand expensive storage and computing resources, thereby hindering their deployment on mobile devices. In this paper, we explore adaptive neural network quantization for lightweight speaker verification. Firstly, we propose a novel adaptive uniform precision quantization method which enables the dynamic generation of quantization centroids customized for each network layer based on k-means clustering. By applying it to the pre-trained SV systems, we obtain a series of quantized variants with different bit widths. To enhance the performance of low-bit quantized models, a mixed precision quantization algorithm along with a multi-stage fine-tuning (MSFT) strategy is further introduced. Unlike uniform precision quantization, mixed precision approach allows for the assignment of varying bit widths to different network layers. When bit combination is determined, MSFT is employed to progressively quantize and fine-tune network in a specific
Authors
(none)
Tags
Stats
Related papers
- Optimization Of Dnn-based Speaker Verification Model Through Efficient Quantization Technique (2024)0.00
- Model Compression For Dnn-based Speaker Verification Using Weight Quantization (2022)3.58
- A Model For Every User And Budget: Label-free And Personalized Mixed-precision Quantization (2023)0.00
- Small Footprint Text-independent Speaker Verification For Embedded Systems (2020)7.16
- Neuralmultiling: A Novel Neural Architecture Search For Smartphone Based Multilingual Speaker Verification (2024)0.00
- Memory-efficient Training For Deep Speaker Embedding Learning In Speaker Verification (2024)2.26
- Rsknet-mtsp: Effective And Portable Deep Architecture For Speaker Verification (2021)9.03
- Neural Network Based Speaker Classification And Verification Systems With Enhanced Features (2017)8.60