On Behaviour of PLDA Models in the Task of Speaker Recognition

被引:0
|
作者
Machlica, Lukas [1 ]
Radova, Vlasta [1 ]
机构
[1] Univ West Bohemia Pilsen, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic
来源
TEXT, SPEECH, AND DIALOGUE, TSD 2013 | 2013年 / 8082卷
关键词
PDLA; i-vectors; robustness; speaker recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, Factor analysis based techniques become part of state-of-the-art Speaker Recognition (SR) systems. These are the Joint Factor Analysis, its modified version called the concept of i-vectors, and the Probabilistic Linear Discriminant Analysis (PLDA). PLDA, as a generative statistical model, is usually used as the back end of a SR system, e. g. once i-vectors have been extracted, a PLDA model is used in the i-vector space to provide a verification score of two given i-vectors. In order to train the system huge amount of development data are utilized. In this paper the behaviour of the PLDA model is investigated. It is shown how does the amount of development data influence the system's performance. PLDA has several parameters to be tuned, i. e. dimensions of latent variables/subspaces, which represent the speaker and the channel variabilities. These will be examined too.
引用
收藏
页码:352 / 359
页数:8
相关论文
共 50 条
  • [1] Mixture of PLDA Models in I-Vector Space for Gender-Independent Speaker Recognition
    Senoussaoui, Mohammed
    Kenny, Patrick
    Bruemmer, Niko
    de Villiers, Edward
    Dumouchel, Pierre
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 32 - +
  • [2] A PLDA approach for language and text independent speaker recognition
    Khosravani, Abbas
    Homayounpour, Mohammad M.
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 457 - 474
  • [3] Nonlinear I-Vector Transformations for PLDA-Based Speaker Recognition
    Cumani, Sandro
    Laface, Pietro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 908 - 919
  • [4] Blind score normalization method for PLDA based speaker recognition
    Doroshin, Danila
    Lubimov, Nikolay
    Nastasenko, Marina
    Kotov, Mikhail
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 210 - 213
  • [5] Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition
    McCree, Alan
    Sell, Gregory
    Garcia-Romero, Daniel
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1552 - 1556
  • [6] PLDA FOR SPEAKER VERIFICATION WITH UTTERANCES OF ARBITRARY DURATION
    Kenny, Patrick
    Stafylakis, Themos
    Ouellet, Pierre
    Alam, Md Jahangir
    Dumouchel, Pierre
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7649 - 7653
  • [7] TOWARDS PLDA-RBM BASED SPEAKER RECOGNITION IN MOBILE ENVIRONMENT: DESIGNING STACKED/DEEP PLDA-RBM SYSTEMS
    Nautsch, Andreas
    Hao, Hong
    Stafylakis, Themos
    Rathgeb, Christian
    Busch, Christoph
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5055 - 5059
  • [8] Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification
    Pang, Xiaomin
    Mak, Man-Wai
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 619 - 623
  • [9] Scoring Heterogeneous Speaker Vectors Using Nonlinear Transformations and Tied PLDA Models
    Cumani, Sandro
    Laface, Pietro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (05) : 995 - 1009
  • [10] Identity Vector Extraction Using Shared Mixture of PLDA for Short-Time Speaker Recognition
    WANG Wenchao
    XU Ji
    YAN Yonghong
    Chinese Journal of Electronics, 2019, 28 (02) : 357 - 363