PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION

被引:0
|
作者
Wang, Jianglin [1 ]
Johnson, Michael T. [1 ]
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53233 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
Speaker distinctive feature; Speaker identification; Glottal source excitation and GMM-UBM; VERIFICATION; PHASE; MFCC;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] A Review on Feature Extraction for Speaker Recognition under Degraded Conditions
    Disken, Gokay
    Tufekci, Zekeriya
    Saribulut, Lutfu
    Cevik, Ulus
    IETE TECHNICAL REVIEW, 2017, 34 (03) : 321 - 332
  • [32] Analysis, Feature Extraction, Modeling and Testing Techniques for Speaker Recognition
    Jayanna, H. S.
    Prasanna, S. R. Mahadeva
    IETE TECHNICAL REVIEW, 2009, 26 (03) : 181 - 190
  • [33] Identification of Speaker from Disguised Voice Using MFCC Feature Extraction, Chi-Square and Classification Technique
    Singh, Mahesh K.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 138 (02) : 973 - 987
  • [34] A Modified MFCC Feature Extraction Technique For Robust Speaker Recognition
    Sharma, Diksha
    Ali, Israj
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1052 - 1057
  • [35] A dynamic feature extraction based on wavelet transforms for speaker recognition
    Me Chunrong
    Zhang Jianhuan
    Long Fei
    ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL I, 2007, : 595 - 598
  • [36] Speaker Identification in Total Variability Space
    Li, Qiang
    Zhao, Mingbing
    Feng, Yong
    FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY III, PTS 1-3, 2013, 401 : 1489 - +
  • [37] Simplification of I-Vector Extraction for Speaker Identification
    XU Longting
    YANG Zhen
    SUN Linhui
    Chinese Journal of Electronics, 2016, 25 (06) : 1121 - 1126
  • [38] Simplification of I-Vector Extraction for Speaker Identification
    Xu Longting
    Yang Zhen
    Sun Linhui
    CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (06) : 1121 - 1126
  • [39] Text-independent speaker identification based on selection of the most similar feature vectors
    Soleymanpour M.
    Marvi H.
    Soleymanpour, Mohammad (Soleimanpour141@gmail.com), 1600, Springer Science and Business Media, LLC (20): : 99 - 108
  • [40] Text-Independent Speaker Identification Through Feature Fusion and Deep Neural Network
    Jahangir, Rashid
    TEh, Ying Wah
    Memon, Nisar Ahmed
    Mujtaba, Ghulam
    Zareei, Mahdi
    Ishtiaq, Uzair
    Akhtar, Muhammad Zaheer
    Ali, Ihsan
    IEEE ACCESS, 2020, 8 : 32187 - 32202