On Behaviour of PLDA Models in the Task of Speaker Recognition

被引:0
作者
Machlica, Lukas [1 ]
Radova, Vlasta [1 ]
机构
[1] Univ West Bohemia Pilsen, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic
来源
TEXT, SPEECH, AND DIALOGUE, TSD 2013 | 2013年 / 8082卷
关键词
PDLA; i-vectors; robustness; speaker recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, Factor analysis based techniques become part of state-of-the-art Speaker Recognition (SR) systems. These are the Joint Factor Analysis, its modified version called the concept of i-vectors, and the Probabilistic Linear Discriminant Analysis (PLDA). PLDA, as a generative statistical model, is usually used as the back end of a SR system, e. g. once i-vectors have been extracted, a PLDA model is used in the i-vector space to provide a verification score of two given i-vectors. In order to train the system huge amount of development data are utilized. In this paper the behaviour of the PLDA model is investigated. It is shown how does the amount of development data influence the system's performance. PLDA has several parameters to be tuned, i. e. dimensions of latent variables/subspaces, which represent the speaker and the channel variabilities. These will be examined too.
引用
收藏
页码:352 / 359
页数:8
相关论文
共 50 条
  • [41] A multi-task network for speaker and command recognition in industrial environments
    Bini, Stefano
    Percannella, Gennaro
    Saggese, Alessia
    Vento, Mario
    PATTERN RECOGNITION LETTERS, 2023, 176 : 62 - 68
  • [42] A Multi-task Framework of Speaker Recognition with TTS Data Augmentation
    Xie, Xingjia
    Zhi, Yiming
    Ouyang, Beibei
    Hong, Qingyang
    Li, Lin
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 210 - 215
  • [43] Source and System Features for Text Independent Speaker Recognition Using GMM Speaker Models
    Revathi, A.
    Venkataramani, Y.
    RECENT TRENDS IN NETWORKS AND COMMUNICATIONS, 2010, 90 : 21 - +
  • [44] FULL-COVARIANCE UBM AND HEAVY-TAILED PLDA IN I-VECTOR SPEAKER VERIFICATION
    Matejka, Pavel
    Glembek, Ondrej
    Castaldo, Fabio
    Alam, M. J.
    Plchot, Oldrich
    Kenny, Patrick
    Burget, Lukas
    Cernocky, Jan 'Honza'
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4828 - 4831
  • [45] Nonparametrically trained PLDA for short duration i-vector speaker verification
    Khosravani, Abbas
    Homayounpour, Mohammad M.
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
  • [46] A Pseudo-task Design in Multi-task Learning Deep Neural Network for Speaker Recognition
    Lu, Xugang
    Shen, Peng
    Tsao, Yu
    Kawai, Hisashi
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [47] TRAINING SPEAKER RECOGNITION MODELS WITH RECORDING-LEVEL LABELS
    Alumae, Tanel
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1066 - 1072
  • [48] The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge
    Cai, Danwei
    Qin, Xiaoyi
    Cai, Weicheng
    Li, Ming
    INTERSPEECH 2019, 2019, : 2493 - 2497
  • [49] Speaker Recognition
    Tripathi, Supriya
    Bhatnagar, Smriti
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT), 2012, : 283 - 287
  • [50] NORMALIZATION OF TOTAL VARIABILITY MATRIX FOR I-VECTOR/PLDA SPEAKER VERIFICATION
    Rao, Wei
    Mak, Man-Wai
    Lee, Kong-Aik
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4180 - 4184