IMPROVED SPEAKER RECOGNITION WHEN USING I-VECTORS FROM MULTIPLE SPEECH SOURCES

被引:0
|
作者
McLaren, Mitchell [1 ]
van Leeuwen, David [1 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol, Nijmegen, Netherlands
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
speaker recognition; i-vector; total variability; source conditions; linear discriminant analysis;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The concept of speaker recognition using i-vectors was recently introduced offering state-of-the-art performance. An i-vector is a compact representation of a speaker's utterance after projection into a low-dimensional, total variability subspace trained using factor analysis. A secondary process involving linear discriminant analysis (LDA) is then used to improve the discrimination of i-vectors from different speakers. The newness of this technology invokes the question as to the best way to train the total variability subspace and LDA matrix when using speech collected from distinctly different sources. This paper presents a comparative study of a number of subspace training techniques and a novel source-normalised-and-weighted LDA algorithm for the purpose of improving i-vector-based speaker recognition under mis-matched evaluation conditions. Results from the NIST 2010 speaker recognition evaluation (SRE) suggest that accounting for source conditions in the LDA matrix as opposed to the total variability subspace training regime provides improved robustness to mis-matched evaluation conditions.
引用
收藏
页码:5460 / 5463
页数:4
相关论文
共 50 条
  • [31] Single Image Camera Identification Using I-Vectors
    Rashidi, Arash
    Razzazi, Farbod
    PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2017, : 406 - 410
  • [32] IDENTIFICATION OF VOICE QUALITY VARIATION USING I-VECTORS
    Feng, Chuyao
    van Leer, Eva
    Anderson, David V.
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 105 - 109
  • [33] APPLYING COMPENSATION TECHNIQUES ON I-VECTORS EXTRACTED FROM SHORT-TEST UTTERANCES FOR SPEAKER VERIFICATION USING DEEP NEURAL NETWORK
    Yang, Il-Ho
    Heo, Hee-Soo
    Yoon, Sung-Hyun
    Yu, Ha-Jin
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5490 - 5494
  • [34] DNN Senone MAP Multinomial i-vectors for Phonotactic Language Recognition
    McCree, Alan
    Garcia-Romero, Daniel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 394 - 397
  • [35] Speaker Recognition Using e-Vectors
    Cumani, Sandro
    Laface, Pietro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 736 - 748
  • [36] Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification
    Ahmad, Waquar
    Karnick, Harish
    Hegde, Rajesh M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (07) : 8273 - 8294
  • [37] VARIABILITY COMPENSATION IN SMALL DATA: OVERSAMPLED EXTRACTION OF I-VECTORS FOR THE CLASSIFICATION OF DEPRESSED SPEECH
    Cummins, Nicholas
    Epps, Julien
    Sethu, Vidhyasaharan
    Krajewski, Jarek
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [38] Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification
    Waquar Ahmad
    Harish Karnick
    Rajesh M. Hegde
    Multimedia Tools and Applications, 2018, 77 : 8273 - 8294
  • [39] Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models
    Zeinali, Hossein
    Sameti, Hossein
    Burget, Lukas
    Cernocky, Jan Honza
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 53 - 71
  • [40] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
    Mansour, Asma
    Chenchah, Farah
    Lachiri, Zied
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (06) : 6441 - 6458