Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification

被引:27
|
作者
Mak, Man-Wai [1 ]
Rao, Wei [1 ]
机构
[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Ctr Signal Proc, Hong Kong, Hong Kong, Peoples R China
关键词
Speaker verification; GMM-supervectors (GSV); Utterance partitioning; GMM-SVM; Support vector machine; Random resampling; Data imbalance; MACHINES; ENSEMBLE;
D O I
10.1016/j.specom.2010.06.011
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent research has demonstrated the merit of combining Gaussian mixture models and support vector machine (SVM) for text-independent speaker verification. However, one unaddressed issue in this GMM-SVM approach is the imbalance between the numbers of speaker-class utterances and impostor-class utterances available for training a speaker-dependent SVM. This paper proposes a resampling technique - namely utterance partitioning with acoustic vector resampling (UP-AVR) - to mitigate the data imbalance problem. Briefly, the sequence order of acoustic vectors in an enrollment utterance is first randomized, which is followed by partitioning the randomized sequence into a number of segments. Each of these segments is then used to produce a GM M supervector via MAP adaptation and mean vector concatenation. The randomization and partitioning processes are repeated several times to produce a sufficient number of speaker-class supervectors for training an SVM. Experimental evaluations based on the NIST 2002 and 2004 SRE suggest that UP-AVR can reduce the error rate of GMM-SVM systems. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [21] Improved GMM-UBM/SVM for speaker verification
    Liu, Minghui
    Dai, Beiqian
    Xie, Yanlu
    Yao, Zhiqiang
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 925 - 928
  • [22] Effect of Relevance Factor of Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition
    You, Chang Huai
    Li, Haizhou
    Ma, Bin
    Lee, Kong Aik
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2063 - 2066
  • [23] Evaluation of GMM-based Features for SVM Speaker Verification
    Liu, Minghui
    Huang, Zhongwei
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5027 - 5030
  • [24] Vector quantization techniques for GMM based speaker verification
    Singh, G
    Panda, A
    Bhattacharyya, S
    Srikanthan, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 65 - 68
  • [25] Robust PCA-GMM-SVM System for Speaker Verification Task
    Zergat, Kawthar Yasmine
    Amrouche, Abderrahmane
    Asbai, Nassim
    Debyeche, Mohamed
    8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 214 - 217
  • [26] Research of the speaker verification based on the SVM-GMM mixture model
    Xuan, Cui
    Bo, Deng
    Wen, Zhuang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
  • [27] SVM speaker verification using session variability modelling and GMM supervectors
    McLaren, M.
    Vogt, R.
    Sridharan, S.
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 1077 - +
  • [28] Support vector machines using GMM supervectors for speaker verification
    Campbell, WM
    Sturim, DE
    Reynolds, DA
    IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) : 308 - 311
  • [29] Investigating the Effect of Data Partitioning for GMM Supervector Based Speaker Verification
    Dikici, Erinc
    Saraclar, Murat
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 464 - 469
  • [30] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332