Variant Time-Frequency Cepstral Features for Speaker Recognition

被引：0

作者：

Zhang, Wei-Qiang ^{[1
]}

Deng, Yan ^{[1
]}

He, Liang ^{[1
]}

Liu, Jia ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

Speaker recognition (SRE); time-frequency cepstrum (TFC); IDENTIFICATION; MODELS;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In speaker recognition (SRE), the commonly used feature vector is basic ceptral coefficients concatenating with their delta and double delta cepstal features. This configuration is borrowed from speech recognition and may be not optimal for SRE. In this paper, we propose a variant time-frequency cepstral (TFC) features, which is based on our previous work for language recognition. The feature vector is obtained by performing a temporal discrete cosine transform (DCT) on the cepstrum matrix and selecting the transformed elements in a specific area with large variances. Different shapes and parameters are tested and the optimal configuration is obtained. Experimental results on the 2008 NIST speaker recognition evaluation short2 telephone-short3 telephone test set show that the proposed variant TFC is more effective than the conventional feature vectors.

引用

页码：2122 / 2125

页数：4

共 50 条

[41] Time-Frequency Features of Laplacian Decomposition
Raja, Kiran B.
Raghavendra, R.
Busch, Christoph
2015 11TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2015, : 576 - 582
[42] The contribution of cepstral and stylistic features to SRI's 2005 NIST speaker recognition evaluation system
Ferrer, Luciana
Shriberg, Elizabeth
Kajarekar, Sachin S.
Stolcke, Andreas
Sonmez, Kemal
Venkataraman, Anand
Bratt, Harry
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 101 - 104
[43] Mel Frequency Cepstral Coefficients Based Text Independent Automatic Speaker Recognition Using Matlab
Singh, Amit Kumar
Singh, Rohit
Dwivedi, Ashutosh
PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), 2014, : 524 - 527
[44] TIME-FREQUENCY REASSIGNED CEPSTRAL COEFFICIENTS FOR PHONE-LEVEL SPEECH SEGMENTATION
Tryfou, Georgina
Pellin, Marco
Omologo, Maurizio
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2060 - 2064
[45] Text independent speaker recognition using the Mel frequency cepstral coefficients and a neural network classifier
Seddik, H
Rahmouni, A
Sayadi, M
ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 631 - 634
[46] Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning
Ayvaz, Ugur
Guruler, Huseyin
Khan, Faheem
Ahmed, Naveed
Whangbo, Taegkeun
Bobomirzaevich, Abdusalomov Akmalbek
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5511 - 5521
[47] Automatic Speaker Recognition Based on Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
Memon, Sheeraz
Bhatti, Sania
Abro, Farzana Rauf
MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2013, 32 (04) : 543 - 550
[48] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
Kockmann, Marcel
Ferrer, Luciana
Burget, Lukas
Cernocky, Jan Honza
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
[49] Time-Frequency Masking for Speaker of Interest Extraction in an immersive environment
Unnikrishnan, Harikrishnan
Donohue, Kevin D.
Hannemannt, Jens
IEEE SOUTHEASTCON 2014, 2014,
[50] Application of time-frequency principal component analysis to speaker verification
Magrin-Chagnolleau, I
Durou, G
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 226 - 236

← 1 2 3 4 5 →