TPARN: Triple-path Attentive Recurrent Network For Time-domain Multichannel Speech Enhancement
2021 Β· Ashutosh Pandey, Buye Xu, Anurag Kumar, et al.
Abstract
In this work, we propose a new model called triple-path attentive recurrent network (TPARN) for multichannel speech enhancement in the time domain. TPARN extends a single-channel dual-path network to a multichannel network by adding a third path along the spatial dimension. First, TPARN processes speech signals from all channels independently using a dual-path attentive recurrent network (ARN), which is a recurrent neural network (RNN) augmented with self-attention. Next, an ARN is introduced along the spatial dimension for spatial context aggregation. TPARN is designed as a multiple-input and multiple-output architecture to enhance all input channels simultaneously. Experimental results demonstrate the superiority of TPARN over existing state-of-the-art approaches.
Authors
(none)
Tags
Stats
Related papers
- Dual-path Self-attention RNN For Real-time Speech Enhancement (2020)0.00
- Multichannel Speech Enhancement Without Beamforming (2021)9.41
- PDPCRN: Parallel Dual-path CRN With Bi-directional Inter-branch Interactions For Multi-channel Speech Enhancement (2023)0.00
- Monaural Speech Enhancement Using A Multi-branch Temporal Convolutional Network (2019)3.58
- DPCRN: Dual-path Convolution Recurrent Network For Single Channel Speech Enhancement (2021)14.35
- Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks (2020)5.84
- Multi-loss Convolutional Network With Time-frequency Attention For Speech Enhancement (2023)0.00
- Time-domain Speech Enhancement For Robust Automatic Speech Recognition (2022)7.16