Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification

被引：0

作者：

Sarkar, A. K. ^{[1
]}

Matrouf, D. ^{[1
]}

Bousquet, P. M. ^{[1
]}

Bonastre, J. F. ^{[1
]}

机构：

[1] Univ Avignon, LIA, Avignon, France

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

Short segment; i-vector; Length Normalization; PLDA; Speaker Verification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is well known that state-of-the-art speaker verification systems using the i-vector concept perform well when target speakers training and test utterances have the same condition: long-long as per NIST evaluation. In practice, real-life applications impose strong constraints on the amount of data that can be used in training target and test speaker models. Since speaker verification systems based on the i-vector approach need to estimate some statistical parameters, the aim of this paper is to explore methods to train statistical parameters of the classical i-vector system when target speakers are trained and tested on mismatched data durations. Experimental results are shown on NIST 2008 SRE for various durations of target training and test speech segments ranging from long to very short, such as full (average 2.5 minutes), 5 seconds and 10 seconds.

引用

页码：2661 / 2664

页数：4

共 50 条

[1] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
Poddar, Arnab
Sahidullah, Md
Saha, Goutam
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
[2] Minimax i-vector extractor for short duration speaker verification
Hautamaki, Ville
Cheng, You-Chi
Rajan, Padmanabhan
Lee, Chin-Hui
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3675 - 3679
[3] Nonparametrically trained PLDA for short duration i-vector speaker verification
Khosravani, Abbas
Homayounpour, Mohammad M.
COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
[4] Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques
Kanagasundaram, A.
Dean, D.
Sridharan, S.
Gonzalez-Dominguez, J.
Gonzalez-Rodriguez, J.
Ramos, D.
SPEECH COMMUNICATION, 2014, 59 : 69 - 82
[5] I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification
Zhang, Jiacen
Inoue, Nakamasa
Shinoda, Koichi
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3613 - 3617
[6] Improving Short Utterance based I-vector Speaker Recognition using Source and Utterance-Duration Normalization Techniques
Kanagasundaram, A.
Dean, D.
Gonzalez-Dominguez, J.
Sridharan, S.
Ramos, D.
Gonzalez-Rodriguez, J.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2464 - 2468
[7] Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning
Rao, Wei
Mak, Man-Wai
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1012 - 1022
[8] DURATION MISMATCH COMPENSATION FOR I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS
Hasan, Taufiq
Saeidi, Rahim
Hansen, John H. L.
van Leeuwen, David A.
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7663 - 7667
[9] An I-Vector Backend for Speaker Verification
Kenny, Patrick
Stafylakis, Themos
Alam, Jahangir
Kockmann, Marcel
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
[10] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
Kang, Woo Hyun
Cho, Won Ik
Jang, Se Young
Lee, Hyeon Seung
Kim, Nam Soo
IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87

← 1 2 3 4 5 →