Complex-valued Restricted Boltzmann Machine For Direct Speech Parameterization From Complex Spectra
2018 Β· Toru Nakashika, Shinji Takaki, Junichi Yamagishi
Abstract
This paper describes a novel energy-based probabilistic distribution that represents complex-valued data and explains how to apply it to direct feature extraction from complex-valued spectra. The proposed model, the complex-valued restricted Boltzmann machine (CRBM), is designed to deal with complex-valued visible units as an extension of the well-known restricted Boltzmann machine (RBM). Like the RBM, the CRBM learns the relationships between visible and hidden units without having connections between units in the same layer, which dramatically improves training efficiency by using Gibbs sampling or contrastive divergence (CD). Another important characteristic is that the CRBM also has connections between real and imaginary parts of each of the complex-valued visible units that help represent the data distribution in the complex domain. In speech signal processing, classification and generation features are often based on amplitude spectra (e.g., MFCC, cepstra, and mel-cepstra) even i
Authors
(none)
Tags
Stats
Related papers
- Complex Frequency Domain Linear Prediction: A Tool To Compute Modulation Spectrum Of Speech (2022)3.58
- Enhanced Factored Three-way Restricted Boltzmann Machines For Speech Detection (2016)0.00
- Complex Recurrent Variational Autoencoder With Application To Speech Enhancement (2022)0.00
- Single-channel Speech Enhancement With Deep Complex U-networks And Probabilistic Latent Space Models (2023)5.24
- Complex Spectral Mapping With Attention Based Convolution Recurrent Neural Network For Speech Enhancement (2021)0.00
- DCCRN: Deep Complex Convolution Recurrent Network For Phase-aware Speech Enhancement (2020)20.78
- Phase Aware Speech Enhancement Using Realisation Of Complex-valued LSTM (2020)0.00
- A Deep Representation Learning-based Speech Enhancement Method Using Complex Convolution Recurrent Variational Autoencoder (2023)7.16