Taylorbeamixer: Learning Taylor-inspired All-neural Multi-channel Speech Enhancement From Beam-space Dictionary Perspective
2022 Β· Andong Li, Guochen Yu, Wenzhe Liu, et al.
Abstract
Despite the promising performance of existing frame-wise all-neural beamformers in the speech enhancement field, it remains unclear what the underlying mechanism exists. In this paper, we revisit the beamforming behavior from the beam-space dictionary perspective and formulate it into the learning and mixing of different beam-space components. Based on that, we propose an all-neural beamformer called TaylorBM to simulate Taylor's series expansion operation in which the 0th-order term serves as a spatial filter to conduct the beam mixing, and several high-order terms are tasked with residual noise cancellation for post-processing. The whole system is devised to work in an end-to-end manner. Experiments are conducted on the spatialized LibriSpeech corpus and results show that the proposed approach outperforms existing advanced baselines in terms of evaluation metrics.
Authors
(none)
Tags
Stats
Related papers
- Taylorbeamformer: Learning All-neural Beamformer For Multi-channel Speech Enhancement From Taylor's Approximation Theory (2022)9.41
- Embedding And Beamforming: All-neural Causal Beamformer For Multichannel Speech Enhancement (2021)13.05
- Sequential Multi-frame Neural Beamforming For Speech Separation And Enhancement (2019)0.00
- A Unified Multichannel Far-field Speech Recognition System: Combining Neural Beamforming With Attention Based End-to-end Model (2024)0.00
- Locate And Beamform: Two-dimensional Locating All-neural Beamformer For Multi-channel Speech Separation (2023)3.58
- Deep Long Short-term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition (2017)13.23
- Dual-path Transformer Based Neural Beamformer For Target Speech Extraction (2023)0.00
- Attention-based Neural Beamforming Layers For Multi-channel Speech Recognition (2021)0.00