IMPROVED SPEAKER RECOGNITION WHEN USING I-VECTORS FROM MULTIPLE SPEECH SOURCES

被引：0

作者：

McLaren, Mitchell ^{[1
]}

van Leeuwen, David ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol, Nijmegen, Netherlands

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

speaker recognition; i-vector; total variability; source conditions; linear discriminant analysis;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The concept of speaker recognition using i-vectors was recently introduced offering state-of-the-art performance. An i-vector is a compact representation of a speaker's utterance after projection into a low-dimensional, total variability subspace trained using factor analysis. A secondary process involving linear discriminant analysis (LDA) is then used to improve the discrimination of i-vectors from different speakers. The newness of this technology invokes the question as to the best way to train the total variability subspace and LDA matrix when using speech collected from distinctly different sources. This paper presents a comparative study of a number of subspace training techniques and a novel source-normalised-and-weighted LDA algorithm for the purpose of improving i-vector-based speaker recognition under mis-matched evaluation conditions. Results from the NIST 2010 speaker recognition evaluation (SRE) suggest that accounting for source conditions in the LDA matrix as opposed to the total variability subspace training regime provides improved robustness to mis-matched evaluation conditions.

引用

页码：5460 / 5463

页数：4

共 50 条

[41] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
Asma Mansour
Farah Chenchah
Zied Lachiri
Multimedia Tools and Applications, 2019, 78 : 6441 - 6458
[42] Speaker Recognition from Coded Speech Using Support Vector Machines
Janicki, Artur
Staroszczyk, Tomasz
TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 291 - 298
[43] SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS
Snyder, David
Garcia-Romero, Daniel
Sell, Gregory
McCree, Alan
Povey, Daniel
Khudanpur, Sanjeev
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5796 - 5800
[44] VOICE VERIFICATION USING I-VECTORS AND NEURAL NETWORKS WITH LIMITED TRAINING DATA
Mamyrbayev, O. Zh.
Othman, M.
Akhmediyarova, A. T.
Kydyrbekova, A. S.
Mekebayev, N. O.
BULLETIN OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN, 2019, (03): : 36 - 43
[45] Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings
Nandwana, Mahesh Kumar
van Hout, Julien
McLaren, Mitchell
Stauffer, Allen
Richey, Colleen
Lawson, Aaron
Graciarena, Martin
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1106 - 1110
[46] Multimedia document retrieval using speech and speaker recognition
Viswanathan M.
Beigi H.S.M.
Dharanipragada S.
Maali F.
Tritschler A.
International Journal on Document Analysis and Recognition, 2000, 2 (04) : 147 - 162
[47] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
Kang, Woo Hyun
Cho, Won Ik
Jang, Se Young
Lee, Hyeon Seung
Kim, Nam Soo
IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
[48] Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Pelecanos, Jason
Wang, Quan
Moreno, Ignacio Lopez
INTERSPEECH 2021, 2021, : 4603 - 4607
[49] INVESTIGATION ON NEURAL BANDWIDTH EXTENSION OF TELEPHONE SPEECH FOR IMPROVED SPEAKER RECOGNITION
Nidadavolu, Phani Sankar
Iglesias, Vicente
Villalba, Jesus
Dehak, Najim
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6111 - 6115
[50] Distributed speaker recognition using the ETSI distributed speech recognition standard
Broun, CC
Campbell, WM
Pearce, D
Kelleher, H
IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 244 - 248

← 1 2 3 4 5 →