Multi-speaker Localization Using Convolutional Neural Network Trained With Noise
2017 · Soumitro Chakrabarty, Emanuël A. P. Habets
Abstract
The problem of multi-speaker localization is formulated as a multi-class multi-label classification problem, which is solved using a convolutional neural network (CNN) based source localization method. Utilizing the common assumption of disjoint speaker activities, we propose a novel method to train the CNN using synthesized noise signals. The proposed localization method is evaluated for two speakers and compared to a well-known steered response power method.
Authors
(none)
Tags
Stats
Related papers
- Multi-speaker DOA Estimation Using Deep Convolutional Networks Trained With Noise Signals (2018)18.46
- Saladnet: Self-attentive Multisource Localization In The Ambisonics Domain (2021)7.50
- Leveraging Visual Supervision For Array-based Active Speaker Detection And Localization (2023)6.77
- Multi-channel End-to-end Neural Network For Speech Enhancement, Source Localization, And Voice Activity Detection (2022)0.00
- Deep Learning Based Multi-source Localization With Source Splitting And Its Effectiveness In Multi-talker Speech Recognition (2021)14.23
- Non-local Convolutional Neural Networks (nlcnn) For Speaker Recognition (2020)0.00
- Multiple-speaker Localization Based On Direct-path Features And Likelihood Maximization With Spatial Sparsity Regularization (2016)11.85
- CNN-LSTM Models For Multi-speaker Source Separation Using Bayesian Hyper Parameter Optimization (2019)6.34