Cracking The Cocktail Party Problem By Multi-beam Deep Attractor Network
2018 Β· Zhuo Chen, Jinyu Li, Xiong Xiao, et al.
Abstract
While recent progresses in neural network approaches to single-channel speech separation, or more generally the cocktail party problem, achieved significant improvement, their performance for complex mixtures is still not satisfactory. In this work, we propose a novel multi-channel framework for multi-talker separation. In the proposed model, an input multi-channel mixture signal is firstly converted to a set of beamformed signals using fixed beam patterns. For this beamforming, we propose to use differential beamformers as they are more suitable for speech separation. Then each beamformed signal is fed into a single-channel anchored deep attractor network to generate separated signals. And the final separation is acquired by post selecting the separating output for each beams. To evaluate the proposed system, we create a challenging dataset comprising mixtures of 2, 3 or 4 speakers. Our results show that the proposed system largely improves the state of the art in speech separation, a
Authors
(none)
Tags
Stats
Related papers
- Speaker-independent Speech Separation With Deep Attractor Network (2017)16.84
- Deep Attractor Network For Single-microphone Speaker Separation (2016)17.88
- Locate And Beamform: Two-dimensional Locating All-neural Beamformer For Multi-channel Speech Separation (2023)3.58
- Sequential Multi-frame Neural Beamforming For Speech Separation And Enhancement (2019)0.00
- Multi-channel Narrow-band Deep Speech Separation With Full-band Permutation Invariant Training (2021)9.41
- Single-channel Multi-speaker Separation Using Deep Clustering (2016)0.00
- Mimo-dbnet: Multi-channel Input And Multiple Outputs Doa-aware Beamforming Network For Speech Separation (2022)0.00
- Deep Ad-hoc Beamforming Based On Speaker Extraction For Target-dependent Speech Separation (2020)7.50