PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION

被引：0

作者：

Wang, Jianglin ^{[1
]}

Johnson, Michael T. ^{[1
]}

机构：

[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53233 USA

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Speaker distinctive feature; Speaker identification; Glottal source excitation and GMM-UBM; VERIFICATION; PHASE; MFCC;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.

引用

页数：5

共 50 条

[21] Multimodal Biometrics Using Multiple Feature Representations to Speaker Identification System
Al-Hmouz, Rami
Daqrouq, Khaled
Morfeq, Ali
Pedrycz, Witold
2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH (ICTRC), 2015, : 314 - 317
[22] Physiological feature extraction for text independent speaker identification using non-uniform subband processing
Lu, Xugang
Dang, Jianwu
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 461 - +
[23] Bionic Cepstral coefficients (BCC): A new auditory feature extraction to noise-robust speaker identification
Zouhir, Youssef
Zarka, Mohamed
Ouni, Kais
APPLIED ACOUSTICS, 2024, 221
[24] Speaker-Specific Articulatory Feature Extraction Based on Knowledge Distillation for Speaker Recognition
Hong, Qian-Bei
Wu, Chung-Hsien
Wang, Hsin-Min
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (02)
[25] Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition
Ferras, Marc
Leung, Cheung-Chi
Barras, Claude
Gauvain, Jean-Luc
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1366 - 1378
[26] Audio-Visual Feature Fusion for Speaker Identification
Almaadeed, Noor
Aggoun, Amar
Amira, Abbes
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 56 - 67
[27] A Feature Level Fusion Scheme for Robust Speaker Identification
Sekkate, Sara
Khalil, Mohammed
Adib, Abdellah
BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 289 - 300
[28] ROBUST FEATURE FRONT-END FOR SPEAKER IDENTIFICATION
Liu, Gang
Lei, Yun
Hansen, John H. L.
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4233 - 4236
[29] Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification
Imran, Ali Shariq
Haflan, Vetle
Shahrebabaki, Abdolreza Sabzi
Olfati, Negar
Svendsen, Torbjorn Karl
ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 211 - 216
[30] Effectiveness of Feature Collaboration in Speaker Identification for Voice Biometrics
Das, Arunima
Roy, Lakshi Prosad
Das, Santos Kumar
2023 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL & COMMUNICATION ENGINEERING, ICCECE, 2023,

← 1 2 3 4 5 →