Variant Time-Frequency Cepstral Features for Speaker Recognition

被引:0
|
作者
Zhang, Wei-Qiang [1 ]
Deng, Yan [1 ]
He, Liang [1 ]
Liu, Jia [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
Speaker recognition (SRE); time-frequency cepstrum (TFC); IDENTIFICATION; MODELS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speaker recognition (SRE), the commonly used feature vector is basic ceptral coefficients concatenating with their delta and double delta cepstal features. This configuration is borrowed from speech recognition and may be not optimal for SRE. In this paper, we propose a variant time-frequency cepstral (TFC) features, which is based on our previous work for language recognition. The feature vector is obtained by performing a temporal discrete cosine transform (DCT) on the cepstrum matrix and selecting the transformed elements in a specific area with large variances. Different shapes and parameters are tested and the optimal configuration is obtained. Experimental results on the 2008 NIST speaker recognition evaluation short2 telephone-short3 telephone test set show that the proposed variant TFC is more effective than the conventional feature vectors.
引用
收藏
页码:2122 / 2125
页数:4
相关论文
共 50 条
  • [31] Contribution of Prosodic and Cepstral Features in Improvment of a Synthesized Arabic Speaker Recognition Task performance
    Zergat, Kawthar Yasmine
    Amrouche, Abderrahmane
    Taher, Montadar Abas
    Zainal, Nasharuddin
    2013 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED 2013), 2013, : 70 - 73
  • [32] Wavelet packet cepstral analysis for speaker recognition
    Kinney, A
    Stevens, J
    THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 206 - 209
  • [33] Speaker recognition using Gaussian Mixtures Model with LPC-CEPSTRAL and pitch features
    Acevedo, ES
    Nagai, T
    Kurematsu, A
    Nakano-Miyatake, M
    Perez-Meana, H
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 162 - 165
  • [34] Application of time-frequency analysis methods to speaker verification
    Williams, W. J.
    Bossemeyer, R. W.
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS XVI, 2006, 6313
  • [35] PARABOLIC FILTER MEL FREQUENCY CEPSTRAL COEFFICIENT AND FUSION OF FEATURES FOR SPEAKER AGE CLASSIFICATION
    Osman, Mohammed Muntaz
    Buyuk, Osman
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2020, 38 (04): : 2177 - 2191
  • [36] Time-frequency analysis of time-variant systems
    2013, Polish Society of Technical Diagnostics (14):
  • [37] Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification
    Srivastava, Smriti
    Bhardwaj, Saurabh
    Bhandari, Abhishek
    Gupta, Krit
    Bahl, Hitesh
    Gupta, J. R. P.
    INTELLIGENT INFORMATICS, 2013, 182 : 237 - 247
  • [38] A method for radar emitter signal recognition based on time-frequency atom features
    Wang Xi-Qin
    Liu Jing-Yao
    Meng Hua-Dong
    Liu Yi-Min
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2011, 30 (06) : 566 - 570
  • [39] A method for radar emitter signal recognition based on time-frequency atom features
    Wang X.-Q.
    Liu J.-Y.
    Meng H.-D.
    Liu Y.-M.
    Hongwai Yu Haomibo Xuebao/Journal of Infrared and Millimeter Waves, 2011, 30 (06): : 566 - 570
  • [40] Extraction of Time-Frequency Target Features
    Oesterlein, Tobias G.
    He, Chensong
    Quijano, Jorge E.
    Campbell, Richard L., Jr.
    Zurk, Lisa M.
    Siderius, Martin
    2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 2156 - 2163