Variant Time-Frequency Cepstral Features for Speaker Recognition

被引:0
|
作者
Zhang, Wei-Qiang [1 ]
Deng, Yan [1 ]
He, Liang [1 ]
Liu, Jia [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
Speaker recognition (SRE); time-frequency cepstrum (TFC); IDENTIFICATION; MODELS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speaker recognition (SRE), the commonly used feature vector is basic ceptral coefficients concatenating with their delta and double delta cepstal features. This configuration is borrowed from speech recognition and may be not optimal for SRE. In this paper, we propose a variant time-frequency cepstral (TFC) features, which is based on our previous work for language recognition. The feature vector is obtained by performing a temporal discrete cosine transform (DCT) on the cepstrum matrix and selecting the transformed elements in a specific area with large variances. Different shapes and parameters are tested and the optimal configuration is obtained. Experimental results on the 2008 NIST speaker recognition evaluation short2 telephone-short3 telephone test set show that the proposed variant TFC is more effective than the conventional feature vectors.
引用
收藏
页码:2122 / 2125
页数:4
相关论文
共 50 条
  • [1] Time-Frequency Cepstral Features and Combining Discriminative Training for Phonotactic Language Recognition
    Deng, Yan
    Zhang, Wei-Qiang
    Qian, Yan-Min
    Liu, Jia
    JOURNAL OF COMPUTERS, 2011, 6 (02) : 178 - 183
  • [2] Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition
    Zhang, Wei-Qiang
    He, Liang
    Deng, Yan
    Liu, Jia
    Johnson, Michael T.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 266 - 276
  • [3] Application of new qualitative voicing time-frequency features for speaker recognition
    Ben Aloui, Nidhal
    Glotin, Herve
    Hebrard, Patrick
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 1154 - +
  • [4] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
  • [5] Time-frequency representation based cepstral processing for speech recognition
    Fineberg, AB
    Yu, KC
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 25 - 28
  • [6] Time frequency features for automatic speaker recognition
    Shahrood University of Technology, Faculty of Electrical Engineering and Robotics, Shahrood, Iran
    WSEAS Trans. Commun., 2006, 12 (2148-2154):
  • [7] Reducing the environmental sensitivity of cepstral features for speaker recognition
    Openshaw, JP
    Mason, JS
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 721 - 724
  • [8] Filter bank Based Cepstral Features for Speaker Recognition
    Chougule, Sharada V.
    Chavan, Mahesh S.
    Gaikwad, M. S.
    2014 IEEE GLOBAL CONFERENCE ON WIRELESS COMPUTING AND NETWORKING (GCWCN), 2014, : 102 - 106
  • [9] Robust speaker recognition using binary time-frequency masks
    Shao, Yang
    Wang, DeLiang
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 645 - 648
  • [10] On local time-frequency features of speech and their employment in speaker verification
    Nickel, RM
    Williams, WJ
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2000, 337 (04): : 469 - 481