Speaker Analysis
50 papers tagged Speaker Analysis (ordered by heat_score)
Papers
- Speaker Recognition Based On Deep Learning: An Overview (2020)Zhongxin Bai, Xiao-Lei Zhang18.86
- Multi-speaker DOA Estimation Using Deep Convolutional Networks Trained With Noise Signals (2018)Soumitro Chakrabarty, Emanuël A. P. Habets18.46
- Deep Attractor Network For Single-microphone Speaker Separation (2016)Zhuo Chen, Yi Luo, Nima Mesgarani17.88
- Exploring The Encoding Layer And Loss Function In End-to-end Speaker And Language Recognition System (2018)Weicheng Cai, Jinkun Chen, Ming Li17.07
- Multichannel Long-term Streaming Neural Speech Enhancement For Static And Moving Speakers (2024)Changsheng Quan, Xiaofei Li16.05
- Rawnet: Advanced End-to-end Deep Neural Network Using Raw Waveforms For Text-independent Speaker Verification (2019)Jee-Weon Jung, Hee-Soo Heo, Ju-Ho Kim, et al.15.34
- Spot The Conversation: Speaker Diarisation In The Wild (2020)Joon Son Chung, Jaesung Huh, Arsha Nagrani, et al.15.31
- DNN-HMM Based Speaker Adaptive Emotion Recognition Using Proposed Epoch And MFCC Features (2018)Md. Shah Fahad, Jainath Yadav, Gyadhar Pradhan, et al.14.11
- LSTM Based Similarity Measurement With Spectral Clustering For Speaker Diarization (2019)Qingjian Lin, Ruiqing Yin, Ming Li, et al.13.79
- Deep Learning Based Phase Reconstruction For Speaker Separation: A Trigonometric Perspective (2018)Zhong-Qiu Wang, Ke Tan, Deliang Wang13.34
- Bertphone: Phonetically-aware Encoder Representations For Utterance-level Speaker And Language Recognition (2019)Shaoshi Ling, Julian Salazar, Yuzong Liu, et al.13.27
- Integration Of Speech Separation, Diarization, And Recognition For Multi-speaker Meetings: System Description, Comparison, And Analysis (2020)Desh Raj, Pavel Denisov, Zhuo Chen, et al.13.23
- All-neural Online Source Separation, Counting, And Diarization For Meeting Analysis (2019)Thilo von Neumann, Keisuke Kinoshita, Marc Delcroix, et al.13.05
- Recursive Speech Separation For Unknown Number Of Speakers (2019)Naoya Takahashi, Sudarsanam Parthasaarathy, Nabarun Goswami, et al.12.93
- Meta-tts: Meta-learning For Few-shot Speaker Adaptive Text-to-speech (2021)Sung-Feng Huang, Chyi-Jiunn Lin, da-Rong Liu, et al.12.74
- Joint Speaker Counting, Speech Recognition, And Speaker Identification For Overlapped Speech Of Any Number Of Speakers (2020)Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, et al.12.54
- Frame-level Speaker Embeddings For Text-independent Speaker Recognition And Analysis Of End-to-end Model (2018)Suwon Shon, Hao Tang, James Glass12.17
- Multimodal Emotion Recognition Using Transfer Learning From Speaker Recognition And Bert-based Models (2022)Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, et al.12.10
- Speaker Adaptation Using Spectro-temporal Deep Features For Dysarthric And Elderly Speech Recognition (2022)Mengzhe Geng, Xurong Xie, Zi Ye, et al.12.02
- Optimization Of Data-driven Filterbank For Automatic Speaker Verification (2020)Susanta Sarangi, Md Sahidullah, Goutam Saha11.93
- Multiple-speaker Localization Based On Direct-path Features And Likelihood Maximization With Spatial Sparsity Regularization (2016)Xiaofei Li, Laurent Girin, Sharon Gannot, et al.11.85
- Speaker Embedding Extraction With Phonetic Information (2018)Yi Liu, Liang He, Jia Liu, et al.11.85
- Video-based Cross-modal Auxiliary Network For Multimodal Sentiment Analysis (2022)Rongfei Chen, Wenju Zhou, Yang Li, et al.11.76
- Investigating On Incorporating Pretrained And Learnable Speaker Representations For Multi-speaker Multi-style Text-to-speech (2021)Chung-Ming Chien, Jheng-Hao Lin, Chien-Yu Huang, et al.11.67
- Analysis Of DNN Speech Signal Enhancement For Robust Speaker Recognition (2018)Ondrej Novotny, Oldrich Plchot, Ondrej Glembek, et al.11.39
- Multi-speaker ASR Combining Non-autoregressive Conformer CTC And Conditional Speaker Chain (2021)Pengcheng Guo, Xuankai Chang, Shinji Watanabe, et al.11.31
- Machine Speech Chain With One-shot Speaker Adaptation (2018)Andros Tjandra, Sakriani Sakti, Satoshi Nakamura11.29
- Optimization Of Speaker Extraction Neural Network With Magnitude And Temporal Spectrum Approximation Loss (2019)Chenglin Xu, Wei Rao, Eng Siong Chng, et al.11.29
- Temporal Dynamic Convolutional Neural Network For Text-independent Speaker Verification And Phonemetic Analysis (2021)Seong-Hu Kim, Hyeonuk Nam, Yong-Hwa Park11.19
- Angular Softmax Loss For End-to-end Speaker Verification (2018)Yutian Li, Feng Gao, Zhijian Ou, et al.11.19
- Anonymizing Speech With Generative Adversarial Networks To Preserve Speaker Privacy (2022)Sarina Meyer, Pascal Tilli, Pavel Denisov, et al.11.19
- Speaker Verification Using End-to-end Adversarial Language Adaptation (2018)Johan Rohdin, Themos Stafylakis, Anna Silnova, et al.11.19
- S-vectors And TESA: Speaker Embeddings And A Speaker Authenticator Based On Transformer Encoder (2020)N J Metilda Sagaya Mary, S Umesh, Sandesh V Katta11.08
- Speaker Anonymization Using Neural Audio Codec Language Models (2023)Michele Panariello, Francesco Nespoli, Massimiliano Todisco, et al.10.97
- Speaker Identification In The Shouted Environment Using Suprasegmental Hidden Markov Models (2017)Ismail Shahin10.85
- Improving Speaker De-identification With Functional Data Analysis Of F0 Trajectories (2022)Lauri Tavi, Tomi Kinnunen, Rosa González Hautamäki10.85
- Singing Voice Separation And Vocal F0 Estimation Based On Mutual Combination Of Robust Principal Component Analysis And Subharmonic Summation (2016)Yukara Ikemiya, Katsutoshi Itoyama, Kazuyoshi Yoshii10.74
- Overview Of Speaker Modeling And Its Applications: From The Lens Of Deep Speaker Representation Learning (2024)Shuai Wang, Zhengyang Chen, Kong Aik Lee, et al.10.74
- Auxiliary Function-based Algorithm For Blind Extraction Of A Moving Speaker (2020)Jakub Janský, Zbyněk Koldovský, Jiří Málek, et al.10.61
- Combining Automatic Speaker Verification And Prosody Analysis For Synthetic Speech Detection (2022)Luigi Attorresi, Davide Salvi, Clara Borrelli, et al.10.48
- Overlap-aware Low-latency Online Speaker Diarization Based On End-to-end Local Segmentation (2021)Juan M. Coria, Hervé Bredin, Sahar Ghannay, et al.10.35
- Conversational Emotion Analysis Via Attention Mechanisms (2019)Zheng Lian, Jianhua Tao, Bin Liu, et al.10.35
- EEND-SS: Joint End-to-end Neural Speaker Diarization And Speech Separation For Flexible Number Of Speakers (2022)Soumi Maiti, Yushi Ueda, Shinji Watanabe, et al.10.35
- Accent And Speaker Disentanglement In Many-to-many Voice Conversion (2020)Zhichao Wang, Wenshuo Ge, Xiong Wang, et al.10.35
- Adversarial Speaker Adaptation (2019)Zhong Meng, Jinyu Li, Yifan Gong10.21
- Attention Back-end For Automatic Speaker Verification With Multiple Enrollment Utterances (2021)Chang Zeng, Xin Wang, Erica Cooper, et al.10.21
- Discriminative Neural Clustering For Speaker Diarisation (2019)Qiujia Li, Florian L. Kreyssig, Chao Zhang, et al.10.07
- Foolhd: Fooling Speaker Identification By Highly Imperceptible Adversarial Disturbances (2020)Ali Shahin Shamsabadi, Francisco Sepúlveda Teixeira, Alberto Abad, et al.10.07
- Streaming Multi-speaker ASR With RNN-T (2020)Ilya Sklyar, Anna Piunova, Yulan Liu10.07
- Content-dependent Fine-grained Speaker Embedding For Zero-shot Speaker Adaptation In Text-to-speech Synthesis (2022)Yixuan Zhou, Changhe Song, Xiang Li, et al.10.07