The Importance Of Spatial And Spectral Information In Multiple Speaker Tracking
2024 Β· Hanan Beit-On, Vladimir Tourbabin, Boaz Rafaely
Abstract
Multi-speaker localization and tracking using microphone array recording is of importance in a wide range of applications. One of the challenges with multi-speaker tracking is to associate direction estimates with the correct speaker. Most existing association approaches rely on spatial or spectral information alone, leading to performance degradation when one of these information channels is partially known or missing. This paper studies a joint probability data association (JPDA)-based method that facilitates association based on joint spatial-spectral information. This is achieved by integrating speaker time-frequency (TF) masks, estimated based on spectral information, in the association probabilities calculation. An experimental study that tested the proposed method on recordings from the LOCATA challenge demonstrates the enhanced performance obtained by using joint spatial-spectral information in the association.
Authors
(none)
Tags
Stats
Related papers
- A Cascaded Multiple-speaker Localization And Tracking System (2018)0.00
- Multiple-speaker Localization Based On Direct-path Features And Likelihood Maximization With Spatial Sparsity Regularization (2016)11.85
- Jointly Tracking And Separating Speech Sources Using Multiple Features And The Generalized Labeled Multi-bernoulli Framework (2017)0.00
- Mask-weighted Spatial Likelihood Coding For Speaker-independent Joint Localization And Mask Estimation (2024)0.00
- End-to-end Multi-microphone Speaker Extraction Using Relative Transfer Functions (2025)0.00
- Audio Inputs For Active Speaker Detection And Localization Via Microphone Array (2023)0.00
- Multi-geometry Spatial Acoustic Modeling For Distant Speech Recognition (2019)6.34
- Multi-channel Speaker Verification For Single And Multi-talker Speech (2020)0.00