Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification

被引：27

作者：

Mak, Man-Wai ^{[1
]}

Rao, Wei ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Ctr Signal Proc, Hong Kong, Hong Kong, Peoples R China

来源：

SPEECH COMMUNICATION | 2011年 / 53卷 / 01期

关键词：

Speaker verification; GMM-supervectors (GSV); Utterance partitioning; GMM-SVM; Support vector machine; Random resampling; Data imbalance; MACHINES; ENSEMBLE;

D O I：

10.1016/j.specom.2010.06.011

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recent research has demonstrated the merit of combining Gaussian mixture models and support vector machine (SVM) for text-independent speaker verification. However, one unaddressed issue in this GMM-SVM approach is the imbalance between the numbers of speaker-class utterances and impostor-class utterances available for training a speaker-dependent SVM. This paper proposes a resampling technique - namely utterance partitioning with acoustic vector resampling (UP-AVR) - to mitigate the data imbalance problem. Briefly, the sequence order of acoustic vectors in an enrollment utterance is first randomized, which is followed by partitioning the randomized sequence into a number of segments. Each of these segments is then used to produce a GM M supervector via MAP adaptation and mean vector concatenation. The randomization and partitioning processes are repeated several times to produce a sufficient number of speaker-class supervectors for training an SVM. Experimental evaluations based on the NIST 2002 and 2004 SRE suggest that UP-AVR can reduce the error rate of GMM-SVM systems. (C) 2010 Elsevier B.V. All rights reserved.

引用

页码：119 / 130

页数：12

共 50 条

[41] Multi-feature Fusion using Multi-GMM Supervector for SVM Speaker Verification
Liu, Minghui
Huang, Zhongwei
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4332 - 4335
[42] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
Ing-Jr Ding
Zih-Jheng Lin
Microsystem Technologies, 2018, 24 : 3 - 8
[43] Robust regression fusion of GMM-UBM and GMM-SVM normalized scores using G729 bit-stream for speaker recognition over IP
Yessad, Dalila
Amrouche, Abderrahmane
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 43 - 51
[44] GMM and i-vector based speaker verification using speaker-specific-text for short utterances
Bharathi, B.
Nagarajan, T.
2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
[45] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
Ding, Ing-Jr
Lin, Zih-Jheng
MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2018, 24 (01): : 3 - 8
[46] Evaluation of I-vector and GMM Based Speaker Verification Systems for Forensic Application
Gumus, Fatma
Yankayis, Mustafa
Karabiber, Fethullah
2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 617 - 620
[47] On the use of PCA in GMM and AR-vector models for text independent speaker verification
de Lima, CB
Alcaim, A
Apolinario, JA
DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 595 - 598
[48] Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification
Sarkar, A. K.
Matrouf, D.
Bousquet, P. M.
Bonastre, J. F.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2661 - 2664
[49] Improved GMM-based Speaker Verification Using SVM-Driven Impostor Dataset Selection
McLaren, Mitchell
Vogt, Robbie
Baker, Brendan
Sridharan, Sridha
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1271 - 1274
[50] Combination of clean and contaminated GMM/SVM for far-field text-independent speaker verification
Zieger, Christian
Omologo, Maurizio
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1949 - 1952

← 1 2 3 4 5 →