Supervised Speech Separation Based On Deep Learning: An Overview
2017 Β· Deliang Wang, Jitong Chen
Abstract
Speech separation is the task of separating target speech from background interference. Traditionally, speech separation is studied as a signal processing problem. A more recent approach formulates speech separation as a supervised learning problem, where the discriminative patterns of speech, speakers, and background noise are learned from training data. Over the past decade, many supervised separation algorithms have been put forward. In particular, the recent introduction of deep learning to supervised speech separation has dramatically accelerated progress and boosted separation performance. This article provides a comprehensive overview of the research on deep learning based supervised speech separation in the last several years. We first introduce the background of speech separation and the formulation of supervised separation. Then we discuss three main components of supervised separation: learning machines, training targets, and acoustic features. Much of the overview is on sep
Authors
(none)
Tags
Stats
Related papers
- An Overview Of Deep-learning-based Audio-visual Speech Enhancement And Separation (2020)18.31
- Speaker Recognition Based On Deep Learning: An Overview (2020)18.86
- Investigating Self-supervised Learning For Speech Enhancement And Separation (2022)13.44
- SADDEL: Joint Speech Separation And Denoising Model Based On Multitask Learning (2020)0.00
- Deep Ad-hoc Beamforming Based On Speaker Extraction For Target-dependent Speech Separation (2020)7.50
- Single-channel Multi-speaker Separation Using Deep Clustering (2016)0.00
- Jointly Detecting And Separating Singing Voice: A Multi-task Approach (2018)7.81
- Deep Neural Network Techniques For Monaural Speech Enhancement: State Of The Art Analysis (2022)0.00