Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification

被引:0
|
作者
Sarkar, A. K. [1 ]
Matrouf, D. [1 ]
Bousquet, P. M. [1 ]
Bonastre, J. F. [1 ]
机构
[1] Univ Avignon, LIA, Avignon, France
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
Short segment; i-vector; Length Normalization; PLDA; Speaker Verification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that state-of-the-art speaker verification systems using the i-vector concept perform well when target speakers training and test utterances have the same condition: long-long as per NIST evaluation. In practice, real-life applications impose strong constraints on the amount of data that can be used in training target and test speaker models. Since speaker verification systems based on the i-vector approach need to estimate some statistical parameters, the aim of this paper is to explore methods to train statistical parameters of the classical i-vector system when target speakers are trained and tested on mismatched data durations. Experimental results are shown on NIST 2008 SRE for various durations of target training and test speech segments ranging from long to very short, such as full (average 2.5 minutes), 5 seconds and 10 seconds.
引用
收藏
页码:2661 / 2664
页数:4
相关论文
共 50 条
  • [1] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [2] Minimax i-vector extractor for short duration speaker verification
    Hautamaki, Ville
    Cheng, You-Chi
    Rajan, Padmanabhan
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3675 - 3679
  • [3] Nonparametrically trained PLDA for short duration i-vector speaker verification
    Khosravani, Abbas
    Homayounpour, Mohammad M.
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
  • [4] Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques
    Kanagasundaram, A.
    Dean, D.
    Sridharan, S.
    Gonzalez-Dominguez, J.
    Gonzalez-Rodriguez, J.
    Ramos, D.
    SPEECH COMMUNICATION, 2014, 59 : 69 - 82
  • [5] I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification
    Zhang, Jiacen
    Inoue, Nakamasa
    Shinoda, Koichi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3613 - 3617
  • [6] Improving Short Utterance based I-vector Speaker Recognition using Source and Utterance-Duration Normalization Techniques
    Kanagasundaram, A.
    Dean, D.
    Gonzalez-Dominguez, J.
    Sridharan, S.
    Ramos, D.
    Gonzalez-Rodriguez, J.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2464 - 2468
  • [7] Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning
    Rao, Wei
    Mak, Man-Wai
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1012 - 1022
  • [8] DURATION MISMATCH COMPENSATION FOR I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS
    Hasan, Taufiq
    Saeidi, Rahim
    Hansen, John H. L.
    van Leeuwen, David A.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7663 - 7667
  • [9] An I-Vector Backend for Speaker Verification
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Kockmann, Marcel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
  • [10] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
    Kang, Woo Hyun
    Cho, Won Ik
    Jang, Se Young
    Lee, Hyeon Seung
    Kim, Nam Soo
    IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87