Variant Time-Frequency Cepstral Features for Speaker Recognition

被引：0

作者：

Zhang, Wei-Qiang ^{[1
]}

Deng, Yan ^{[1
]}

He, Liang ^{[1
]}

Liu, Jia ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

Speaker recognition (SRE); time-frequency cepstrum (TFC); IDENTIFICATION; MODELS;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In speaker recognition (SRE), the commonly used feature vector is basic ceptral coefficients concatenating with their delta and double delta cepstal features. This configuration is borrowed from speech recognition and may be not optimal for SRE. In this paper, we propose a variant time-frequency cepstral (TFC) features, which is based on our previous work for language recognition. The feature vector is obtained by performing a temporal discrete cosine transform (DCT) on the cepstrum matrix and selecting the transformed elements in a specific area with large variances. Different shapes and parameters are tested and the optimal configuration is obtained. Experimental results on the 2008 NIST speaker recognition evaluation short2 telephone-short3 telephone test set show that the proposed variant TFC is more effective than the conventional feature vectors.

引用

页码：2122 / 2125

页数：4

共 50 条

[1] Time-Frequency Cepstral Features and Combining Discriminative Training for Phonotactic Language Recognition
Deng, Yan
Zhang, Wei-Qiang
Qian, Yan-Min
Liu, Jia
JOURNAL OF COMPUTERS, 2011, 6 (02) : 178 - 183
[2] Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition
Zhang, Wei-Qiang
He, Liang
Deng, Yan
Liu, Jia
Johnson, Michael T.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 266 - 276
[3] Application of new qualitative voicing time-frequency features for speaker recognition
Ben Aloui, Nidhal
Glotin, Herve
Hebrard, Patrick
ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 1154 - +
[4] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
Jokic, Ivan D.
Jokic, Stevan D.
Delic, Vlado D.
Peric, Zoran H.
2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
[5] Time-frequency representation based cepstral processing for speech recognition
Fineberg, AB
Yu, KC
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 25 - 28
[6] Time frequency features for automatic speaker recognition
Shahrood University of Technology, Faculty of Electrical Engineering and Robotics, Shahrood, Iran
WSEAS Trans. Commun., 2006, 12 (2148-2154):
[7] Reducing the environmental sensitivity of cepstral features for speaker recognition
Openshaw, JP
Mason, JS
ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 721 - 724
[8] Filter bank Based Cepstral Features for Speaker Recognition
Chougule, Sharada V.
Chavan, Mahesh S.
Gaikwad, M. S.
2014 IEEE GLOBAL CONFERENCE ON WIRELESS COMPUTING AND NETWORKING (GCWCN), 2014, : 102 - 106
[9] Robust speaker recognition using binary time-frequency masks
Shao, Yang
Wang, DeLiang
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 645 - 648
[10] On local time-frequency features of speech and their employment in speaker verification
Nickel, RM
Williams, WJ
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2000, 337 (04): : 469 - 481

← 1 2 3 4 5 →