On Behaviour of PLDA Models in the Task of Speaker Recognition

被引：0

作者：

Machlica, Lukas ^{[1
]}

Radova, Vlasta ^{[1
]}

机构：

[1] Univ West Bohemia Pilsen, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic

来源：

TEXT, SPEECH, AND DIALOGUE, TSD 2013 | 2013年 / 8082卷

关键词：

PDLA; i-vectors; robustness; speaker recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nowadays, Factor analysis based techniques become part of state-of-the-art Speaker Recognition (SR) systems. These are the Joint Factor Analysis, its modified version called the concept of i-vectors, and the Probabilistic Linear Discriminant Analysis (PLDA). PLDA, as a generative statistical model, is usually used as the back end of a SR system, e. g. once i-vectors have been extracted, a PLDA model is used in the i-vector space to provide a verification score of two given i-vectors. In order to train the system huge amount of development data are utilized. In this paper the behaviour of the PLDA model is investigated. It is shown how does the amount of development data influence the system's performance. PLDA has several parameters to be tuned, i. e. dimensions of latent variables/subspaces, which represent the speaker and the channel variabilities. These will be examined too.

引用

页码：352 / 359

页数：8

共 50 条

[31] Towards multi-task learning of speech and speaker recognition
Vaessen, Nik
van Leeuwen, David A.
INTERSPEECH 2023, 2023, : 4898 - 4902
[32] DNN-Driven Mixture of PLDA for Robust Speaker Verification
Li, Na
Mak, Man-Wai
Chien, Jen-Tzung
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
[33] UNIVERSAL ADVERSARIAL ATTACK AGAINST SPEAKER RECOGNITION MODELS
Hanina, Shoham
Zolfi, Alon
Elovici, Yuval
Shabtai, Asaf
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4860 - 4864
[34] A method of multi-models fusion for speaker recognition
Wu H.
Luo L.
Peng H.
Wen W.
International Journal of Speech Technology, 2022, 25 (2) : 493 - 498
[35] A Generalization of PLDA for Joint Modeling of Speaker Identity and Multiple Nuisance Conditions
Ferrer, Luciana
McLaren, Mitchell
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 82 - 86
[36] LARGE-SCALE SPEAKER SEARCH USING PLDA ON MISMATCHED CONDITIONS
Ma, Jeff
Silovsky, Jan
Siu, Man-hung
Kimball, Owen
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1846 - 1850
[37] Sparse kernel machines with empirical kernel maps for PLDA speaker verification
Rao, Wei
Mak, Man-Wai
COMPUTER SPEECH AND LANGUAGE, 2016, 38 : 104 - 121
[38] PLDA using Gaussian Restricted Boltzmann Machines with application to Speaker Verification
Stafylakis, Themos
Kenny, Patrick
Senoussaoui, Mohammed
Dumouchel, Pierre
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1690 - 1693
[39] Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
Fookes, Clinton
Himawan, Ivan
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1835 - 1838
[40] HANDLING I-VECTORS FROM DIFFERENT RECORDING CONDITIONS USING MULTI-CHANNEL SIMPLIFIED PLDA IN SPEAKER RECOGNITION
Villalba, Jesus
Lleida, Eduardo
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6763 - 6767

← 1 2 3 4 5 →