Embedding And Beamforming: All-neural Causal Beamformer For Multichannel Speech Enhancement
2021 Β· Andong Li, Wenzhe Liu, Chengshi Zheng, et al.
Abstract
The spatial covariance matrix has been considered to be significant for beamformers. Standing upon the intersection of traditional beamformers and deep neural networks, we propose a causal neural beamformer paradigm called Embedding and Beamforming, and two core modules are designed accordingly, namely EM and BM. For EM, instead of estimating spatial covariance matrix explicitly, the 3-D embedding tensor is learned with the network, where both spectral and spatial discriminative information can be represented. For BM, a network is directly leveraged to derive the beamforming weights so as to implement filter-and-sum operation. To further improve the speech quality, a post-processing module is introduced to further suppress the residual noise. Based on the DNS-Challenge dataset, we conduct the experiments for multichannel speech enhancement and the results show that the proposed system outperforms previous advanced baselines by a large margin in multiple evaluation metrics.
Authors
(none)
Tags
Stats
Related papers
- Locate And Beamform: Two-dimensional Locating All-neural Beamformer For Multi-channel Speech Separation (2023)3.58
- Sequential Multi-frame Neural Beamforming For Speech Separation And Enhancement (2019)0.00
- Multi-channel End-to-end Neural Network For Speech Enhancement, Source Localization, And Voice Activity Detection (2022)0.00
- Dual-path Transformer Based Neural Beamformer For Target Speech Extraction (2023)0.00
- Attention-based Neural Beamforming Layers For Multi-channel Speech Recognition (2021)0.00
- Multichannel Loss Function For Supervised Speech Source Separation By Mask-based Beamforming (2019)7.50
- Taylorbeamformer: Learning All-neural Beamformer For Multi-channel Speech Enhancement From Taylor's Approximation Theory (2022)9.41
- Deep Ad-hoc Beamforming (2018)9.59