Compensation of Intrinsic Variability with Factor Analysis Modeling for Robust Speaker Verification

被引:0
|
作者
Chen, Sheng [1 ]
Xu, Mingxing [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, Key Lab Pervas Comp,Minist Educ, Beijing 100084, Peoples R China
关键词
speaker verification; intrinsic variability; joint factor analysis; i-vector; LDA; WCCN; NAP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Performances of speaker verification systems are adversely affected by intrinsic variability in the real world applications. In this paper, factor analysis approaches of Joint Factor Analysis (JFA) and i-vector modeling are used to address the effects of intrinsic variations for robust speaker verification. The speaker variability and intrinsic variability are modeled with the speaker and session factors respectively in the JFA approach. In the i-vector framework, a low-dimensional space is defined to model the total variability and intrinsic variations are compensated with a variety of techniques including Linear Discriminant Analysis (LDA), Within-Class Co-variance Normalization (WCCN) and Nuisance Attribute Projection (NAP). Experiments in the intrinsic variation corpus show that factor analysis approaches of JFA and i-vector framework perform much better than the GMM-UBM paradigm in modeling the intrinsic variability. Relative reductions in Error Equal Rate (EER) of around 39.85% and 36.76% are obtained respectively for JFA and i-Vector+LDA+WCCN speaker verification systems, compared to the GMM-UBM baseline system.
引用
收藏
页码:1574 / 1577
页数:4
相关论文
共 50 条
  • [21] Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
    Prieto, Santi
    Ortega, Alfonso
    Lopez-Espejo, Ivan
    Lleida, Eduardo
    INTERSPEECH 2020, 2020, : 1511 - 1515
  • [22] Simplified factor analysis in speaker verification
    Guo, Wu
    Li, Yijie
    Dai, Lirong
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1316 - 1319
  • [23] On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification
    Garreton, Claudio
    Yoma, Nestor Becerra
    Huenupan, Fernando
    Molina, Carlos
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 913 - 916
  • [24] REVERBERATION COMPENSATION FOR SPEAKER VERIFICATION
    Peer, Itai
    Rafaely, Boaz
    Zigel, Yaniv
    2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 333 - +
  • [25] Eigenageing Compensation for Speaker Verification
    Kelly, Finnian
    Brummer, Niko
    Harte, Naomi
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1623 - 1627
  • [26] Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise
    Hasan, Taufiq
    Hansen, John H. L.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3126 - 3130
  • [27] Continuous Prosodic Features and Formant Modeling with Joint Factor Analysis for Speaker Verification
    Dehak, Najim
    Kenny, Patrick
    Dumouchel, Pierre
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 853 - 856
  • [28] SNR-Invariant PLDA Modeling for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
  • [29] Psychoacoustic Model Compensation with Robust Feature Set for Speaker Verification in Additive Noise
    Panda, Ashish
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 629 - 632
  • [30] Score-level Compensation of Extreme Speech Duration Variability in Speaker Verification
    Perez-Gomez, Sergio
    Ramos, Daniel
    Gonzalez-Dominguez, Javier
    Gonzalez-Rodriguez, Joaquin
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 374 - 377