Variant Time-Frequency Cepstral Features for Speaker Recognition

被引:0
|
作者
Zhang, Wei-Qiang [1 ]
Deng, Yan [1 ]
He, Liang [1 ]
Liu, Jia [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
Speaker recognition (SRE); time-frequency cepstrum (TFC); IDENTIFICATION; MODELS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speaker recognition (SRE), the commonly used feature vector is basic ceptral coefficients concatenating with their delta and double delta cepstal features. This configuration is borrowed from speech recognition and may be not optimal for SRE. In this paper, we propose a variant time-frequency cepstral (TFC) features, which is based on our previous work for language recognition. The feature vector is obtained by performing a temporal discrete cosine transform (DCT) on the cepstrum matrix and selecting the transformed elements in a specific area with large variances. Different shapes and parameters are tested and the optimal configuration is obtained. Experimental results on the 2008 NIST speaker recognition evaluation short2 telephone-short3 telephone test set show that the proposed variant TFC is more effective than the conventional feature vectors.
引用
收藏
页码:2122 / 2125
页数:4
相关论文
共 50 条
  • [41] Time-Frequency Features of Laplacian Decomposition
    Raja, Kiran B.
    Raghavendra, R.
    Busch, Christoph
    2015 11TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2015, : 576 - 582
  • [42] The contribution of cepstral and stylistic features to SRI's 2005 NIST speaker recognition evaluation system
    Ferrer, Luciana
    Shriberg, Elizabeth
    Kajarekar, Sachin S.
    Stolcke, Andreas
    Sonmez, Kemal
    Venkataraman, Anand
    Bratt, Harry
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 101 - 104
  • [43] Mel Frequency Cepstral Coefficients Based Text Independent Automatic Speaker Recognition Using Matlab
    Singh, Amit Kumar
    Singh, Rohit
    Dwivedi, Ashutosh
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), 2014, : 524 - 527
  • [44] TIME-FREQUENCY REASSIGNED CEPSTRAL COEFFICIENTS FOR PHONE-LEVEL SPEECH SEGMENTATION
    Tryfou, Georgina
    Pellin, Marco
    Omologo, Maurizio
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2060 - 2064
  • [45] Text independent speaker recognition using the Mel frequency cepstral coefficients and a neural network classifier
    Seddik, H
    Rahmouni, A
    Sayadi, M
    ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 631 - 634
  • [46] Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning
    Ayvaz, Ugur
    Guruler, Huseyin
    Khan, Faheem
    Ahmed, Naveed
    Whangbo, Taegkeun
    Bobomirzaevich, Abdusalomov Akmalbek
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5511 - 5521
  • [47] Automatic Speaker Recognition Based on Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
    Memon, Sheeraz
    Bhatti, Sania
    Abro, Farzana Rauf
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2013, 32 (04) : 543 - 550
  • [48] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
    Kockmann, Marcel
    Ferrer, Luciana
    Burget, Lukas
    Cernocky, Jan Honza
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
  • [49] Time-Frequency Masking for Speaker of Interest Extraction in an immersive environment
    Unnikrishnan, Harikrishnan
    Donohue, Kevin D.
    Hannemannt, Jens
    IEEE SOUTHEASTCON 2014, 2014,
  • [50] Application of time-frequency principal component analysis to speaker verification
    Magrin-Chagnolleau, I
    Durou, G
    DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 226 - 236