Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification

被引:27
|
作者
Mak, Man-Wai [1 ]
Rao, Wei [1 ]
机构
[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Ctr Signal Proc, Hong Kong, Hong Kong, Peoples R China
关键词
Speaker verification; GMM-supervectors (GSV); Utterance partitioning; GMM-SVM; Support vector machine; Random resampling; Data imbalance; MACHINES; ENSEMBLE;
D O I
10.1016/j.specom.2010.06.011
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent research has demonstrated the merit of combining Gaussian mixture models and support vector machine (SVM) for text-independent speaker verification. However, one unaddressed issue in this GMM-SVM approach is the imbalance between the numbers of speaker-class utterances and impostor-class utterances available for training a speaker-dependent SVM. This paper proposes a resampling technique - namely utterance partitioning with acoustic vector resampling (UP-AVR) - to mitigate the data imbalance problem. Briefly, the sequence order of acoustic vectors in an enrollment utterance is first randomized, which is followed by partitioning the randomized sequence into a number of segments. Each of these segments is then used to produce a GM M supervector via MAP adaptation and mean vector concatenation. The randomization and partitioning processes are repeated several times to produce a sufficient number of speaker-class supervectors for training an SVM. Experimental evaluations based on the NIST 2002 and 2004 SRE suggest that UP-AVR can reduce the error rate of GMM-SVM systems. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [41] Multi-feature Fusion using Multi-GMM Supervector for SVM Speaker Verification
    Liu, Minghui
    Huang, Zhongwei
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4332 - 4335
  • [42] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
    Ing-Jr Ding
    Zih-Jheng Lin
    Microsystem Technologies, 2018, 24 : 3 - 8
  • [43] Robust regression fusion of GMM-UBM and GMM-SVM normalized scores using G729 bit-stream for speaker recognition over IP
    Yessad, Dalila
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 43 - 51
  • [44] GMM and i-vector based speaker verification using speaker-specific-text for short utterances
    Bharathi, B.
    Nagarajan, T.
    2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [45] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
    Ding, Ing-Jr
    Lin, Zih-Jheng
    MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2018, 24 (01): : 3 - 8
  • [46] Evaluation of I-vector and GMM Based Speaker Verification Systems for Forensic Application
    Gumus, Fatma
    Yankayis, Mustafa
    Karabiber, Fethullah
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 617 - 620
  • [47] On the use of PCA in GMM and AR-vector models for text independent speaker verification
    de Lima, CB
    Alcaim, A
    Apolinario, JA
    DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 595 - 598
  • [48] Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification
    Sarkar, A. K.
    Matrouf, D.
    Bousquet, P. M.
    Bonastre, J. F.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2661 - 2664
  • [49] Improved GMM-based Speaker Verification Using SVM-Driven Impostor Dataset Selection
    McLaren, Mitchell
    Vogt, Robbie
    Baker, Brendan
    Sridharan, Sridha
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1271 - 1274
  • [50] Combination of clean and contaminated GMM/SVM for far-field text-independent speaker verification
    Zieger, Christian
    Omologo, Maurizio
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1949 - 1952