On Behaviour of PLDA Models in the Task of Speaker Recognition

被引：0

作者：

Machlica, Lukas ^{[1
]}

Radova, Vlasta ^{[1
]}

机构：

[1] Univ West Bohemia Pilsen, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic

来源：

TEXT, SPEECH, AND DIALOGUE, TSD 2013 | 2013年 / 8082卷

关键词：

PDLA; i-vectors; robustness; speaker recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nowadays, Factor analysis based techniques become part of state-of-the-art Speaker Recognition (SR) systems. These are the Joint Factor Analysis, its modified version called the concept of i-vectors, and the Probabilistic Linear Discriminant Analysis (PLDA). PLDA, as a generative statistical model, is usually used as the back end of a SR system, e. g. once i-vectors have been extracted, a PLDA model is used in the i-vector space to provide a verification score of two given i-vectors. In order to train the system huge amount of development data are utilized. In this paper the behaviour of the PLDA model is investigated. It is shown how does the amount of development data influence the system's performance. PLDA has several parameters to be tuned, i. e. dimensions of latent variables/subspaces, which represent the speaker and the channel variabilities. These will be examined too.

引用

页码：352 / 359

页数：8

共 50 条

[41] A multi-task network for speaker and command recognition in industrial environments
Bini, Stefano
Percannella, Gennaro
Saggese, Alessia
Vento, Mario
PATTERN RECOGNITION LETTERS, 2023, 176 : 62 - 68
[42] A Multi-task Framework of Speaker Recognition with TTS Data Augmentation
Xie, Xingjia
Zhi, Yiming
Ouyang, Beibei
Hong, Qingyang
Li, Lin
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 210 - 215
[43] Source and System Features for Text Independent Speaker Recognition Using GMM Speaker Models
Revathi, A.
Venkataramani, Y.
RECENT TRENDS IN NETWORKS AND COMMUNICATIONS, 2010, 90 : 21 - +
[44] FULL-COVARIANCE UBM AND HEAVY-TAILED PLDA IN I-VECTOR SPEAKER VERIFICATION
Matejka, Pavel
Glembek, Ondrej
Castaldo, Fabio
Alam, M. J.
Plchot, Oldrich
Kenny, Patrick
Burget, Lukas
Cernocky, Jan 'Honza'
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4828 - 4831
[45] Nonparametrically trained PLDA for short duration i-vector speaker verification
Khosravani, Abbas
Homayounpour, Mohammad M.
COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
[46] A Pseudo-task Design in Multi-task Learning Deep Neural Network for Speaker Recognition
Lu, Xugang
Shen, Peng
Tsao, Yu
Kawai, Hisashi
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[47] TRAINING SPEAKER RECOGNITION MODELS WITH RECORDING-LEVEL LABELS
Alumae, Tanel
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1066 - 1072
[48] The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge
Cai, Danwei
Qin, Xiaoyi
Cai, Weicheng
Li, Ming
INTERSPEECH 2019, 2019, : 2493 - 2497
[49] Speaker Recognition
Tripathi, Supriya
Bhatnagar, Smriti
2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT), 2012, : 283 - 287
[50] NORMALIZATION OF TOTAL VARIABILITY MATRIX FOR I-VECTOR/PLDA SPEAKER VERIFICATION
Rao, Wei
Mak, Man-Wai
Lee, Kong-Aik
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4180 - 4184

← 1 2 3 4 5 →