Optimization Of Dnn-based Speaker Verification Model Through Efficient Quantization Technique
2024 Β· Yeona Hong, Woo-Jin Chung, Hong-Goo Kang
Abstract
As Deep Neural Networks (DNNs) rapidly advance in various fields, including speech verification, they typically involve high computational costs and substantial memory consumption, which can be challenging to manage on mobile systems. Quantization of deep models offers a means to reduce both computational and memory expenses. Our research proposes an optimization framework for the quantization of the speaker verification model. By analyzing performance changes and model size reductions in each layer of a pre-trained speaker verification model, we have effectively minimized performance degradation while significantly reducing the model size. Our quantization algorithm is the first attempt to maintain the performance of the state-of-the-art pre-trained speaker verification model, ECAPATDNN, while significantly compressing its model size. Overall, our quantization approach resulted in reducing the model size by half, with an increase in EER limited to 0.07%.
Authors
(none)
Tags
Stats
Related papers
- Model Compression For Dnn-based Speaker Verification Using Weight Quantization (2022)3.58
- Towards Lightweight Speaker Verification Via Adaptive Neural Network Quantization (2024)5.84
- Dynamic Kernels And Channel Attention For Low Resource Speaker Verification (2022)0.00
- Small Footprint Text-independent Speaker Verification For Embedded Systems (2020)7.16
- Empirical Evaluation Of Deep Learning Model Compression Techniques On The Wavenet Vocoder (2020)0.00
- Mixed Precision Of Quantization Of Transformer Language Models For Speech Recognition (2021)8.09
- Quantization Of Acoustic Model Parameters In Automatic Speech Recognition Framework (2020)0.00
- DNN Based Speaker Recognition On Short Utterances (2016)0.00