PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION

被引:0
|
作者
Wang, Jianglin [1 ]
Johnson, Michael T. [1 ]
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53233 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
Speaker distinctive feature; Speaker identification; Glottal source excitation and GMM-UBM; VERIFICATION; PHASE; MFCC;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
    Singh, Mahesh K.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
  • [2] A Hybrid GRU-CNN Feature Extraction Technique for Speaker Identification
    Shihab, Md Shazzad Hossain
    Aditya, Shuvra
    Setu, Jahangir Hossain
    Imtiaz-Ud-Din, K. M.
    Efat, Md Iftekharul Alam
    2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [3] Speaker Identification based on MFSC voice feature extraction using Transformer
    Bao, Liao
    Zuo, Yi
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1 - 7
  • [4] Speaker Identification based on Hybrid Feature Extraction Techniques
    Abualadas, Feras E.
    Zeki, Akram M.
    Al-Ani, Muzhir Shaban
    Messikh, Az-Eddine
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (03) : 322 - 327
  • [5] Robust speaker modeling using perceptually motivated feature
    Abdulla, Waleed H.
    PATTERN RECOGNITION LETTERS, 2007, 28 (11) : 1333 - 1342
  • [6] Speaker identification features extraction methods: A systematic review
    Tirumala, Sreenivas Sremath
    Shahamiri, Seyed Reza
    Garhwal, Abhimanyu Singh
    Wang, Ruili
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 : 250 - 271
  • [7] A new approach to designing a feature extractor in speaker identification based on discriminative feature extraction
    Miyajima, C
    Watanabe, H
    Tokuda, K
    Kitamura, T
    Katagiri, S
    SPEECH COMMUNICATION, 2001, 35 (3-4) : 203 - 218
  • [8] A network model of speaker identification with new feature extraction methods and BLSTM
    Wang, Xingmei
    Xue, Fuzhao
    Wang, Wei
    Liu, Anhua
    NEUROCOMPUTING, 2020, 403 (403) : 167 - 181
  • [9] Further feature extraction for speaker recognition
    Ma, ZY
    Yang, YC
    Wu, ZH
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4153 - 4158
  • [10] On the Impact of Deep Learning and Feature Extraction for Arabic Audio Classification and Speaker Identification
    Shahriar, Sakib
    Dara, Rozita
    Hayawi, Kadhim
    2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,