On Behaviour of PLDA Models in the Task of Speaker Recognition

被引：0

作者：

Machlica, Lukas ^{[1
]}

Radova, Vlasta ^{[1
]}

机构：

[1] Univ West Bohemia Pilsen, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic

来源：

TEXT, SPEECH, AND DIALOGUE, TSD 2013 | 2013年 / 8082卷

关键词：

PDLA; i-vectors; robustness; speaker recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nowadays, Factor analysis based techniques become part of state-of-the-art Speaker Recognition (SR) systems. These are the Joint Factor Analysis, its modified version called the concept of i-vectors, and the Probabilistic Linear Discriminant Analysis (PLDA). PLDA, as a generative statistical model, is usually used as the back end of a SR system, e. g. once i-vectors have been extracted, a PLDA model is used in the i-vector space to provide a verification score of two given i-vectors. In order to train the system huge amount of development data are utilized. In this paper the behaviour of the PLDA model is investigated. It is shown how does the amount of development data influence the system's performance. PLDA has several parameters to be tuned, i. e. dimensions of latent variables/subspaces, which represent the speaker and the channel variabilities. These will be examined too.

引用

页码：352 / 359

页数：8

共 50 条

[1] Mixture of PLDA Models in I-Vector Space for Gender-Independent Speaker Recognition
Senoussaoui, Mohammed
Kenny, Patrick
Bruemmer, Niko
de Villiers, Edward
Dumouchel, Pierre
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 32 - +
[2] A PLDA approach for language and text independent speaker recognition
Khosravani, Abbas
Homayounpour, Mohammad M.
COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 457 - 474
[3] Nonlinear I-Vector Transformations for PLDA-Based Speaker Recognition
Cumani, Sandro
Laface, Pietro
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 908 - 919
[4] Blind score normalization method for PLDA based speaker recognition
Doroshin, Danila
Lubimov, Nikolay
Nastasenko, Marina
Kotov, Mikhail
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 210 - 213
[5] Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition
McCree, Alan
Sell, Gregory
Garcia-Romero, Daniel
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1552 - 1556
[6] PLDA FOR SPEAKER VERIFICATION WITH UTTERANCES OF ARBITRARY DURATION
Kenny, Patrick
Stafylakis, Themos
Ouellet, Pierre
Alam, Md Jahangir
Dumouchel, Pierre
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7649 - 7653
[7] TOWARDS PLDA-RBM BASED SPEAKER RECOGNITION IN MOBILE ENVIRONMENT: DESIGNING STACKED/DEEP PLDA-RBM SYSTEMS
Nautsch, Andreas
Hao, Hong
Stafylakis, Themos
Rathgeb, Christian
Busch, Christoph
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5055 - 5059
[8] Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification
Pang, Xiaomin
Mak, Man-Wai
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 619 - 623
[9] Scoring Heterogeneous Speaker Vectors Using Nonlinear Transformations and Tied PLDA Models
Cumani, Sandro
Laface, Pietro
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (05) : 995 - 1009
[10] Identity Vector Extraction Using Shared Mixture of PLDA for Short-Time Speaker Recognition
WANG Wenchao
XU Ji
YAN Yonghong
Chinese Journal of Electronics, 2019, 28 (02) : 357 - 363

← 1 2 3 4 5 →