PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION

被引:0
|
作者
Wang, Jianglin [1 ]
Johnson, Michael T. [1 ]
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53233 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
Speaker distinctive feature; Speaker identification; Glottal source excitation and GMM-UBM; VERIFICATION; PHASE; MFCC;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Multimodal Biometrics Using Multiple Feature Representations to Speaker Identification System
    Al-Hmouz, Rami
    Daqrouq, Khaled
    Morfeq, Ali
    Pedrycz, Witold
    2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH (ICTRC), 2015, : 314 - 317
  • [22] Physiological feature extraction for text independent speaker identification using non-uniform subband processing
    Lu, Xugang
    Dang, Jianwu
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 461 - +
  • [23] Bionic Cepstral coefficients (BCC): A new auditory feature extraction to noise-robust speaker identification
    Zouhir, Youssef
    Zarka, Mohamed
    Ouni, Kais
    APPLIED ACOUSTICS, 2024, 221
  • [24] Speaker-Specific Articulatory Feature Extraction Based on Knowledge Distillation for Speaker Recognition
    Hong, Qian-Bei
    Wu, Chung-Hsien
    Wang, Hsin-Min
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (02)
  • [25] Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition
    Ferras, Marc
    Leung, Cheung-Chi
    Barras, Claude
    Gauvain, Jean-Luc
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1366 - 1378
  • [26] Audio-Visual Feature Fusion for Speaker Identification
    Almaadeed, Noor
    Aggoun, Amar
    Amira, Abbes
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 56 - 67
  • [27] A Feature Level Fusion Scheme for Robust Speaker Identification
    Sekkate, Sara
    Khalil, Mohammed
    Adib, Abdellah
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 289 - 300
  • [28] ROBUST FEATURE FRONT-END FOR SPEAKER IDENTIFICATION
    Liu, Gang
    Lei, Yun
    Hansen, John H. L.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4233 - 4236
  • [29] Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification
    Imran, Ali Shariq
    Haflan, Vetle
    Shahrebabaki, Abdolreza Sabzi
    Olfati, Negar
    Svendsen, Torbjorn Karl
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 211 - 216
  • [30] Effectiveness of Feature Collaboration in Speaker Identification for Voice Biometrics
    Das, Arunima
    Roy, Lakshi Prosad
    Das, Santos Kumar
    2023 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL & COMMUNICATION ENGINEERING, ICCECE, 2023,