Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification

被引:27
|
作者
Mak, Man-Wai [1 ]
Rao, Wei [1 ]
机构
[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Ctr Signal Proc, Hong Kong, Hong Kong, Peoples R China
关键词
Speaker verification; GMM-supervectors (GSV); Utterance partitioning; GMM-SVM; Support vector machine; Random resampling; Data imbalance; MACHINES; ENSEMBLE;
D O I
10.1016/j.specom.2010.06.011
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent research has demonstrated the merit of combining Gaussian mixture models and support vector machine (SVM) for text-independent speaker verification. However, one unaddressed issue in this GMM-SVM approach is the imbalance between the numbers of speaker-class utterances and impostor-class utterances available for training a speaker-dependent SVM. This paper proposes a resampling technique - namely utterance partitioning with acoustic vector resampling (UP-AVR) - to mitigate the data imbalance problem. Briefly, the sequence order of acoustic vectors in an enrollment utterance is first randomized, which is followed by partitioning the randomized sequence into a number of segments. Each of these segments is then used to produce a GM M supervector via MAP adaptation and mean vector concatenation. The randomization and partitioning processes are repeated several times to produce a sufficient number of speaker-class supervectors for training an SVM. Experimental evaluations based on the NIST 2002 and 2004 SRE suggest that UP-AVR can reduce the error rate of GMM-SVM systems. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [1] SIGNIFICANCE OF UTTERANCE PARTITIONING IN GMM-SVM BASED SPEAKER VERIFICATION IN VARYING BACKGROUND ENVIRONMENT
    Sarkar, Sourjya
    Rao, K. Sreenivasa
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [2] MiniVectors: an Improved GMM-SVM Approach for Speaker Verification
    Anguera, Xavier
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2323 - 2326
  • [3] Combining Deep Speaker Specific Representations with GMM-SVM for Speaker Verification
    Price, Ryan
    Biswas, Sangeeta
    Shinoda, Koichi
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2787 - 2791
  • [4] Client dependent GMM-SVM models for speaker verification
    Le, Q
    Bengio, S
    ARTIFICIAL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 443 - 451
  • [5] Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
    Nirmalya Sen
    Md Sahidullah
    Hemant A. Patil
    Shyamal Kumar Das Mandal
    Krothapalli Sreenivasa Rao
    Tapan Kumar Basu
    International Journal of Speech Technology, 2021, 24 : 1067 - 1088
  • [6] Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
    Sen, Nirmalya
    Sahidullah, Md
    Patil, Hemant A.
    Das Mandal, Shyamal Kumar
    Rao, Krothapalli Sreenivasa
    Basu, Tapan Kumar
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (04) : 1067 - 1088
  • [7] A hybrid GMM-SVM speaker identification system
    Mashao, DJ
    2004 IEEE AFRICON: 7TH AFRICON CONFERENCE IN AFRICA, VOLS 1 AND 2: TECHNOLOGY INNOVATION, 2004, : 319 - 322
  • [8] A hybrid system based on GMM-SVM for Speaker Identification
    Chakroun, Rania
    Zouari, Leila Beltaifa
    Frikha, Mondher
    Ben Hamida, Ahmed
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 654 - 658
  • [9] GMM-SVM Fingerprint Verification Based on Minutiae Only
    Topcu, Berkay
    Isik, Yusuf Ziya
    Erdogan, Hakan
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 223 - 228
  • [10] Acoustic Vector Resampling for GMMSVM-Based Speaker Verification
    Mak, Man-Wai
    Rao, Wei
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1449 - 1452