Compensation of Intrinsic Variability with Factor Analysis Modeling for Robust Speaker Verification

被引：0

作者：

Chen, Sheng ^{[1
]}

Xu, Mingxing ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, Key Lab Pervas Comp,Minist Educ, Beijing 100084, Peoples R China

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

speaker verification; intrinsic variability; joint factor analysis; i-vector; LDA; WCCN; NAP;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Performances of speaker verification systems are adversely affected by intrinsic variability in the real world applications. In this paper, factor analysis approaches of Joint Factor Analysis (JFA) and i-vector modeling are used to address the effects of intrinsic variations for robust speaker verification. The speaker variability and intrinsic variability are modeled with the speaker and session factors respectively in the JFA approach. In the i-vector framework, a low-dimensional space is defined to model the total variability and intrinsic variations are compensated with a variety of techniques including Linear Discriminant Analysis (LDA), Within-Class Co-variance Normalization (WCCN) and Nuisance Attribute Projection (NAP). Experiments in the intrinsic variation corpus show that factor analysis approaches of JFA and i-vector framework perform much better than the GMM-UBM paradigm in modeling the intrinsic variability. Relative reductions in Error Equal Rate (EER) of around 39.85% and 36.76% are obtained respectively for JFA and i-Vector+LDA+WCCN speaker verification systems, compared to the GMM-UBM baseline system.

引用

页码：1574 / 1577

页数：4

共 50 条

[21] Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
Prieto, Santi
Ortega, Alfonso
Lopez-Espejo, Ivan
Lleida, Eduardo
INTERSPEECH 2020, 2020, : 1511 - 1515
[22] Simplified factor analysis in speaker verification
Guo, Wu
Li, Yijie
Dai, Lirong
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1316 - 1319
[23] On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification
Garreton, Claudio
Yoma, Nestor Becerra
Huenupan, Fernando
Molina, Carlos
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 913 - 916
[24] REVERBERATION COMPENSATION FOR SPEAKER VERIFICATION
Peer, Itai
Rafaely, Boaz
Zigel, Yaniv
2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 333 - +
[25] Eigenageing Compensation for Speaker Verification
Kelly, Finnian
Brummer, Niko
Harte, Naomi
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1623 - 1627
[26] Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise
Hasan, Taufiq
Hansen, John H. L.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3126 - 3130
[27] Continuous Prosodic Features and Formant Modeling with Joint Factor Analysis for Speaker Verification
Dehak, Najim
Kenny, Patrick
Dumouchel, Pierre
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 853 - 856
[28] SNR-Invariant PLDA Modeling for Robust Speaker Verification
Li, Na
Mak, Man-Wai
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
[29] Psychoacoustic Model Compensation with Robust Feature Set for Speaker Verification in Additive Noise
Panda, Ashish
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 629 - 632
[30] Score-level Compensation of Extreme Speech Duration Variability in Speaker Verification
Perez-Gomez, Sergio
Ramos, Daniel
Gonzalez-Dominguez, Javier
Gonzalez-Rodriguez, Joaquin
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 374 - 377

← 1 2 3 4 5 →