IMPROVED SPEAKER RECOGNITION WHEN USING I-VECTORS FROM MULTIPLE SPEECH SOURCES

被引:0
|
作者
McLaren, Mitchell [1 ]
van Leeuwen, David [1 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol, Nijmegen, Netherlands
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
speaker recognition; i-vector; total variability; source conditions; linear discriminant analysis;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The concept of speaker recognition using i-vectors was recently introduced offering state-of-the-art performance. An i-vector is a compact representation of a speaker's utterance after projection into a low-dimensional, total variability subspace trained using factor analysis. A secondary process involving linear discriminant analysis (LDA) is then used to improve the discrimination of i-vectors from different speakers. The newness of this technology invokes the question as to the best way to train the total variability subspace and LDA matrix when using speech collected from distinctly different sources. This paper presents a comparative study of a number of subspace training techniques and a novel source-normalised-and-weighted LDA algorithm for the purpose of improving i-vector-based speaker recognition under mis-matched evaluation conditions. Results from the NIST 2010 speaker recognition evaluation (SRE) suggest that accounting for source conditions in the LDA matrix as opposed to the total variability subspace training regime provides improved robustness to mis-matched evaluation conditions.
引用
收藏
页码:5460 / 5463
页数:4
相关论文
共 50 条
  • [41] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
    Asma Mansour
    Farah Chenchah
    Zied Lachiri
    Multimedia Tools and Applications, 2019, 78 : 6441 - 6458
  • [42] Speaker Recognition from Coded Speech Using Support Vector Machines
    Janicki, Artur
    Staroszczyk, Tomasz
    TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 291 - 298
  • [43] SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS
    Snyder, David
    Garcia-Romero, Daniel
    Sell, Gregory
    McCree, Alan
    Povey, Daniel
    Khudanpur, Sanjeev
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5796 - 5800
  • [44] VOICE VERIFICATION USING I-VECTORS AND NEURAL NETWORKS WITH LIMITED TRAINING DATA
    Mamyrbayev, O. Zh.
    Othman, M.
    Akhmediyarova, A. T.
    Kydyrbekova, A. S.
    Mekebayev, N. O.
    BULLETIN OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN, 2019, (03): : 36 - 43
  • [45] Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings
    Nandwana, Mahesh Kumar
    van Hout, Julien
    McLaren, Mitchell
    Stauffer, Allen
    Richey, Colleen
    Lawson, Aaron
    Graciarena, Martin
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1106 - 1110
  • [46] Multimedia document retrieval using speech and speaker recognition
    Viswanathan M.
    Beigi H.S.M.
    Dharanipragada S.
    Maali F.
    Tritschler A.
    International Journal on Document Analysis and Recognition, 2000, 2 (04) : 147 - 162
  • [47] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
    Kang, Woo Hyun
    Cho, Won Ik
    Jang, Se Young
    Lee, Hyeon Seung
    Kim, Nam Soo
    IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
  • [48] Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
    Pelecanos, Jason
    Wang, Quan
    Moreno, Ignacio Lopez
    INTERSPEECH 2021, 2021, : 4603 - 4607
  • [49] INVESTIGATION ON NEURAL BANDWIDTH EXTENSION OF TELEPHONE SPEECH FOR IMPROVED SPEAKER RECOGNITION
    Nidadavolu, Phani Sankar
    Iglesias, Vicente
    Villalba, Jesus
    Dehak, Najim
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6111 - 6115
  • [50] Distributed speaker recognition using the ETSI distributed speech recognition standard
    Broun, CC
    Campbell, WM
    Pearce, D
    Kelleher, H
    IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 244 - 248