Multichannel Speech Enhancement By Raw Waveform-mapping Using Fully Convolutional Networks
2019 Β· Chang-Le Liu, Sze-Wei Fu, You-Jin Li, et al.
Abstract
In recent years, waveform-mapping-based speech enhancement (SE) methods have garnered significant attention. These methods generally use a deep learning model to directly process and reconstruct speech waveforms. Because both the input and output are in waveform format, the waveform-mapping-based SE methods can overcome the distortion caused by imperfect phase estimation, which may be encountered in spectral-mapping-based SE systems. So far, most waveform-mapping-based SE methods have focused on single-channel tasks. In this paper, we propose a novel fully convolutional network (FCN) with Sinc and dilated convolutional layers (termed SDFCN) for multichannel SE that operates in the time domain. We also propose an extended version of SDFCN, called the residual SDFCN (termed rSDFCN). The proposed methods are evaluated on two multichannel SE tasks, namely the dual-channel inner-ear microphones SE task and the distributed microphones SE task. The experimental results confirm the outstanding
Authors
(none)
Tags
Stats
Related papers
- Raw Waveform-based Speech Enhancement By Fully Convolutional Networks (2017)16.63
- Efficient Encoder-decoder And Dual-path Conformer For Comprehensive Feature Learning In Speech Enhancement (2023)7.16
- End-to-end Waveform Utterance Enhancement For Direct Evaluation Metrics Optimization By Fully Convolutional Neural Networks (2017)18.00
- Multichannel Speech Enhancement Without Beamforming (2021)9.41
- FB-MSTCN: A Full-band Single-channel Speech Enhancement Method Based On Multi-scale Temporal Convolutional Network (2022)6.77
- Wavecrn: An Efficient Convolutional Recurrent Neural Network For End-to-end Speech Enhancement (2020)14.02
- A Two-stage Full-band Speech Enhancement Model With Effective Spectral Compression Mapping (2022)0.00
- Distortionless Multi-channel Target Speech Enhancement For Overlapped Speech Recognition (2020)0.00