Rawnet: Advanced End-to-end Deep Neural Network Using Raw Waveforms For Text-independent Speaker Verification
2019 Β· Jee-Weon Jung, Hee-Soo Heo, Ju-Ho Kim, et al.
Abstract
Recently, direct modeling of raw waveforms using deep neural networks has been widely studied for a number of tasks in audio domains. In speaker verification, however, utilization of raw waveforms is in its preliminary phase, requiring further investigation. In this study, we explore end-to-end deep neural networks that input raw waveforms to improve various aspects: front-end speaker embedding extraction including model architecture, pre-training scheme, additional objective functions, and back-end classification. Adjustment of model architecture using a pre-training scheme can extract speaker embeddings, giving a significant improvement in performance. Additional objective functions simplify the process of extracting speaker embeddings by merging conventional two-phase processes: extracting utterance-level features such as i-vectors or x-vectors and the feature enhancement phase, e.g., linear discriminant analysis. Effective back-end classification models that suit the proposed speak
Authors
(none)
Tags
Stats
Related papers
- Improved Rawnet With Feature Map Scaling For Text-independent Speaker Verification Using Raw Waveforms (2020)14.15
- Mr-rawnet: Speaker Verification System With Multiple Temporal Resolutions For Variable Duration Utterances Using Raw Waveforms (2024)2.26
- Rawnext: Speaker Verification System For Variable-duration Utterances With Deep Layer Aggregation And Extended Dynamic Scaling Policies (2021)12.24
- Rawnet: Fast End-to-end Neural Vocoder (2019)0.00
- FDN: Finite Difference Network With Hierarchical Convolutional Features For Text-independent Speaker Verification (2021)0.00
- Complementing Handcrafted Features With Raw Waveform Using A Light-weight Auxiliary Model (2021)0.00
- Speech And Speaker Recognition From Raw Waveform With Sincnet (2018)0.00
- Wavenet: A Generative Model For Raw Audio (2016)0.00