On Behaviour of PLDA Models in the Task of Speaker Recognition

被引:0
|
作者
Machlica, Lukas [1 ]
Radova, Vlasta [1 ]
机构
[1] Univ West Bohemia Pilsen, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic
来源
TEXT, SPEECH, AND DIALOGUE, TSD 2013 | 2013年 / 8082卷
关键词
PDLA; i-vectors; robustness; speaker recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, Factor analysis based techniques become part of state-of-the-art Speaker Recognition (SR) systems. These are the Joint Factor Analysis, its modified version called the concept of i-vectors, and the Probabilistic Linear Discriminant Analysis (PLDA). PLDA, as a generative statistical model, is usually used as the back end of a SR system, e. g. once i-vectors have been extracted, a PLDA model is used in the i-vector space to provide a verification score of two given i-vectors. In order to train the system huge amount of development data are utilized. In this paper the behaviour of the PLDA model is investigated. It is shown how does the amount of development data influence the system's performance. PLDA has several parameters to be tuned, i. e. dimensions of latent variables/subspaces, which represent the speaker and the channel variabilities. These will be examined too.
引用
收藏
页码:352 / 359
页数:8
相关论文
共 50 条
  • [31] Towards multi-task learning of speech and speaker recognition
    Vaessen, Nik
    van Leeuwen, David A.
    INTERSPEECH 2023, 2023, : 4898 - 4902
  • [32] DNN-Driven Mixture of PLDA for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    Chien, Jen-Tzung
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
  • [33] UNIVERSAL ADVERSARIAL ATTACK AGAINST SPEAKER RECOGNITION MODELS
    Hanina, Shoham
    Zolfi, Alon
    Elovici, Yuval
    Shabtai, Asaf
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4860 - 4864
  • [34] A method of multi-models fusion for speaker recognition
    Wu H.
    Luo L.
    Peng H.
    Wen W.
    International Journal of Speech Technology, 2022, 25 (2) : 493 - 498
  • [35] A Generalization of PLDA for Joint Modeling of Speaker Identity and Multiple Nuisance Conditions
    Ferrer, Luciana
    McLaren, Mitchell
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 82 - 86
  • [36] LARGE-SCALE SPEAKER SEARCH USING PLDA ON MISMATCHED CONDITIONS
    Ma, Jeff
    Silovsky, Jan
    Siu, Man-hung
    Kimball, Owen
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1846 - 1850
  • [37] Sparse kernel machines with empirical kernel maps for PLDA speaker verification
    Rao, Wei
    Mak, Man-Wai
    COMPUTER SPEECH AND LANGUAGE, 2016, 38 : 104 - 121
  • [38] PLDA using Gaussian Restricted Boltzmann Machines with application to Speaker Verification
    Stafylakis, Themos
    Kenny, Patrick
    Senoussaoui, Mohammed
    Dumouchel, Pierre
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1690 - 1693
  • [39] Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    Fookes, Clinton
    Himawan, Ivan
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1835 - 1838
  • [40] HANDLING I-VECTORS FROM DIFFERENT RECORDING CONDITIONS USING MULTI-CHANNEL SIMPLIFIED PLDA IN SPEAKER RECOGNITION
    Villalba, Jesus
    Lleida, Eduardo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6763 - 6767