A NOVEL I-VECTOR FRAMEWORK USING MULTIPLE FEATURES AND PCA FOR SPEAKER RECOGNITION IN SHORT SPEECH CONDITION

被引：0

作者：

Zhang, Chi ^{[1
]}

Li, Xiaoqiang ^{[1
]}

Li, Wei ^{[2
,3
]}

Lu, Peizhong ^{[2
]}

Zhang, Wenqiang ^{[2
,3
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China

[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China

[3] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China

来源：

PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP) | 2016年

关键词：

speaker recognition; short speech condition; PCA; i-vector; JOINT FACTOR-ANALYSIS;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Speaker recognition in short speech condition is a difficult topic because the length of training and test speech is very short. One of the main disadvantage of the existing methods for speaker recognition is that they need very sufficient data and it's usually impossible in reality applications. In our experiments, the conventional methods with single feature don't make good performance in short speech. We propose a novel i-vector framework using multiple features and Principal Component Analysis (PCA) in short speech condition to overcome this difficulty, as multiple features combination can represent more aspects of a speaker. PCA is used to map the multiple features to an uncorrelated and orthogonal basis set to meet the requirements of Gaussian Mixture Model (GMM) with diagonal covariance matrices and i-vector. Improvement from the proposed approach compared to a state-of-the-art system are of roughly 50% relative at equal error rate when evaluated on the telephone conditions from the 2010 NIST speaker recognition evaluation (SRE).

引用

页码：499 / 503

页数：5

共 50 条

[1] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
Kang, Woo Hyun
Cho, Won Ik
Jang, Se Young
Lee, Hyeon Seung
Kim, Nam Soo
IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
[2] i-vector Based Speaker Recognition on Short Utterances
Kanagasundaram, Ahilan
Vogt, Robbie
Dean, David
Sridharan, Sridha
Mason, Michael
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2352 - +
[3] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
Mansour, Asma
Chenchah, Farah
Lachiri, Zied
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (06) : 6441 - 6458
[4] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
Asma Mansour
Farah Chenchah
Zied Lachiri
Multimedia Tools and Applications, 2019, 78 : 6441 - 6458
[5] Maximum Likelihood i-vector Space Using PCA for Speaker Verification
Lei, Zhenchun
Yang, Yingchun
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2736 - 2739
[6] A NOISE ROBUST I-VECTOR EXTRACTOR USING VECTOR TAYLOR SERIES FOR SPEAKER RECOGNITION
Lei, Yun
Burget, Lukas
Scheffer, Nicolas
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6788 - 6791
[7] Speaker Adaptation Using the I-Vector Technique for Bottleneck Features
Cardinal, Patrick
Dehak, Najim
Zhang, Yu
Glass, James
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2867 - 2871
[8] I-vector Based Speaker Gender Recognition
Wang, Minghe
Chen, Ying
Tang, Zhenmin
Zhang, Erhua
2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
[9] Performance Comparison Of Speaker Recognition Systems Using GMM and i-Vector Methods with PNCC and RASTA PLP Features
Nayana, P. K.
Mathew, Dominic
Thomas, Abraham
2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 438 - 443
[10] GENDER INDEPENDENT DISCRIMINATIVE SPEAKER RECOGNITION IN I-VECTOR SPACE
Cumani, Sandro
Glembek, Ondrej
Bruemmer, Niko
de Villiers, Edward
Laface, Pietro
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4361 - 4364

← 1 2 3 4 5 →