Similarity-and-independence-aware Beamformer: Method For Target Source Extraction Using Magnitude Spectrogram As Reference
2020 Β· Atsuo Hiroe
Abstract
This study presents a novel method for source extraction, referred to as the similarity-and-independence-aware beamformer (SIBF). The SIBF extracts the target signal using a rough magnitude spectrogram as the reference signal. The advantage of the SIBF is that it can obtain an accurate target signal, compared to the spectrogram generated by target-enhancing methods such as the speech enhancement based on deep neural networks (DNNs). For the extraction, we extend the framework of the deflationary independent component analysis, by considering the similarity between the reference and extracted target, as well as the mutual independence of all potential sources. To solve the extraction problem by maximum-likelihood estimation, we introduce two source model types that can reflect the similarity. The experimental results from the CHiME3 dataset show that the target signal extracted by the SIBF is more accurate than the reference signal generated by the DNN. Index Terms: semiblind source s
Authors
(none)
Tags
Stats
Related papers
- Similarity-and-independence-aware Beamformer With Iterative Casting And Boost Start For Target Source Extraction Using Reference (2021)5.84
- Optimization Of Speaker Extraction Neural Network With Magnitude And Temporal Spectrum Approximation Loss (2019)11.29
- Improving Speaker Discrimination Of Target Speech Extraction With Time-domain Speakerbeam (2020)14.76
- Dual-path Transformer Based Neural Beamformer For Target Speech Extraction (2023)0.00
- Locate And Beamform: Two-dimensional Locating All-neural Beamformer For Multi-channel Speech Separation (2023)3.58
- Enhanced Neural Beamformer With Spatial Information For Target Speech Extraction (2023)2.26
- Neural Network-based Time-frequency-bin-wise Linear Combination Of Beamformers For Underdetermined Target Source Extraction (2026)0.00
- Statistical Beamformer Exploiting Non-stationarity And Sparsity With Spatially Constrained ICA For Robust Speech Recognition (2023)0.00