IMPROVED SPEAKER RECOGNITION WHEN USING I-VECTORS FROM MULTIPLE SPEECH SOURCES

被引：0

作者：

McLaren, Mitchell ^{[1
]}

van Leeuwen, David ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol, Nijmegen, Netherlands

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

speaker recognition; i-vector; total variability; source conditions; linear discriminant analysis;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The concept of speaker recognition using i-vectors was recently introduced offering state-of-the-art performance. An i-vector is a compact representation of a speaker's utterance after projection into a low-dimensional, total variability subspace trained using factor analysis. A secondary process involving linear discriminant analysis (LDA) is then used to improve the discrimination of i-vectors from different speakers. The newness of this technology invokes the question as to the best way to train the total variability subspace and LDA matrix when using speech collected from distinctly different sources. This paper presents a comparative study of a number of subspace training techniques and a novel source-normalised-and-weighted LDA algorithm for the purpose of improving i-vector-based speaker recognition under mis-matched evaluation conditions. Results from the NIST 2010 speaker recognition evaluation (SRE) suggest that accounting for source conditions in the LDA matrix as opposed to the total variability subspace training regime provides improved robustness to mis-matched evaluation conditions.

引用

页码：5460 / 5463

页数：4

共 50 条

[31] Single Image Camera Identification Using I-Vectors
Rashidi, Arash
Razzazi, Farbod
PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2017, : 406 - 410
[32] IDENTIFICATION OF VOICE QUALITY VARIATION USING I-VECTORS
Feng, Chuyao
van Leer, Eva
Anderson, David V.
2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 105 - 109
[33] APPLYING COMPENSATION TECHNIQUES ON I-VECTORS EXTRACTED FROM SHORT-TEST UTTERANCES FOR SPEAKER VERIFICATION USING DEEP NEURAL NETWORK
Yang, Il-Ho
Heo, Hee-Soo
Yoon, Sung-Hyun
Yu, Ha-Jin
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5490 - 5494
[34] DNN Senone MAP Multinomial i-vectors for Phonotactic Language Recognition
McCree, Alan
Garcia-Romero, Daniel
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 394 - 397
[35] Speaker Recognition Using e-Vectors
Cumani, Sandro
Laface, Pietro
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 736 - 748
[36] Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification
Ahmad, Waquar
Karnick, Harish
Hegde, Rajesh M.
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (07) : 8273 - 8294
[37] VARIABILITY COMPENSATION IN SMALL DATA: OVERSAMPLED EXTRACTION OF I-VECTORS FOR THE CLASSIFICATION OF DEPRESSED SPEECH
Cummins, Nicholas
Epps, Julien
Sethu, Vidhyasaharan
Krajewski, Jarek
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[38] Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification
Waquar Ahmad
Harish Karnick
Rajesh M. Hegde
Multimedia Tools and Applications, 2018, 77 : 8273 - 8294
[39] Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models
Zeinali, Hossein
Sameti, Hossein
Burget, Lukas
Cernocky, Jan Honza
COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 53 - 71
[40] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
Mansour, Asma
Chenchah, Farah
Lachiri, Zied
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (06) : 6441 - 6458

← 1 2 3 4 5 →