A NOVEL I-VECTOR FRAMEWORK USING MULTIPLE FEATURES AND PCA FOR SPEAKER RECOGNITION IN SHORT SPEECH CONDITION

被引：0

作者：

Zhang, Chi ^{[1
]}

Li, Xiaoqiang ^{[1
]}

Li, Wei ^{[2
,3
]}

Lu, Peizhong ^{[2
]}

Zhang, Wenqiang ^{[2
,3
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China

[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China

[3] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China

来源：

PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP) | 2016年

关键词：

speaker recognition; short speech condition; PCA; i-vector; JOINT FACTOR-ANALYSIS;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Speaker recognition in short speech condition is a difficult topic because the length of training and test speech is very short. One of the main disadvantage of the existing methods for speaker recognition is that they need very sufficient data and it's usually impossible in reality applications. In our experiments, the conventional methods with single feature don't make good performance in short speech. We propose a novel i-vector framework using multiple features and Principal Component Analysis (PCA) in short speech condition to overcome this difficulty, as multiple features combination can represent more aspects of a speaker. PCA is used to map the multiple features to an uncorrelated and orthogonal basis set to meet the requirements of Gaussian Mixture Model (GMM) with diagonal covariance matrices and i-vector. Improvement from the proposed approach compared to a state-of-the-art system are of roughly 50% relative at equal error rate when evaluated on the telephone conditions from the 2010 NIST speaker recognition evaluation (SRE).

引用

页码：499 / 503

页数：5

共 50 条

[21] PERFORMANCE OF I-VECTOR SPEAKER VERIFICATION AND THE DETECTION OF SYNTHETIC SPEECH
McClanahan, Richard D.
Stewart, Bryan
De Leon, Phillip L.
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[22] ONLINE SPEAKER DIARIZATION USING ADAPTED I-VECTOR TRANSFORMS
Zhu, Weizhong
Pelecanos, Jason
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5045 - 5049
[23] I-vector Transformation Using a Novel Discriminative Denoising Autoencoder for Noise-robust Speaker Recognition
Mahto, Shivangi
Yamamoto, Hitoshi
Koshinaka, Takafumi
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3722 - 3726
[24] I-vector based speaker recognition using advanced channel compensation techniques
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
McLaren, Mitchell
Vogt, Robbie
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 121 - 140
[25] A Comparison of Covariance Matrix and i-vector Based Speaker Recognition
Jakovljevic, Niksa
Jokic, Ivan
Josic, Slobodan
Delic, Vlado
SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 37 - 45
[26] ADDITIVE NOISE COMPENSATION IN THE I-VECTOR SPACE FOR SPEAKER RECOGNITION
Ben Kheder, Waad
Matrouf, Driss
Bonastre, Jean-Francois
Ajili, Moez
Bousquet, Pierre-Michel
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4190 - 4194
[27] Classification of Cognitive Load from Speech using an i-vector Framework
Van Segbroeck, Maarten
Travadi, Ruchir
Vaz, Colin
Kim, Jangwon
Black, Matthew P.
Potamianos, Alexandros
Narayanan, Shrikanth S.
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 751 - 755
[28] Analysis of I-vector Length Normalization in Speaker Recognition Systems
Garcia-Romero, Daniel
Espy-Wilson, Carol Y.
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 256 - 259
[29] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
Poddar, Arnab
Sahidullah, Md
Saha, Goutam
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
[30] Tied Variational Autoencoder Backends for i-Vector Speaker Recognition
Villalba, Jesus
Brummer, Niko
Dehak, Najim
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1004 - 1008

← 1 2 3 4 5 →