Multilingual Speaker Identification by Combining Evidence from LPR and Multitaper MFCC

被引:2
|
作者
Nagaraja, B. [1 ]
Jayanna, H. [1 ]
机构
[1] Siddaganga Inst Technol, Dept Informat Sci & Engn, Tumkur 572103, Karnataka, India
关键词
Speaker identification; mel-frequency cepstral coefficients; multitaper mel-frequency cepstral coefficients; multilingual; linear prediction residual; linear prediction residual phase;
D O I
10.1515/jisys-2013-0038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, the significance of combining the evidence from multitaper mel-frequency cepstral coefficients (MFCC), linear prediction residual (LPR), and linear prediction residual phase (LPRP) features for multilingual speaker identification with the constraint of limited data condition is demonstrated. The LPR is derived from linear prediction analysis, and LPRP is obtained by dividing the LPR using its Hilbert envelope. The sine-weighted cepstrum estimators (SWCE) with six tapers are considered for multitaper MFCC feature extraction. The Gaussian mixture model-universal background model is used for modeling each speaker for different evidence. The evidence is then combined at scoring level to improve the performance. The monolingual, crosslingual, and multilingual speaker identification studies were conducted using 30 randomly selected speakers from the IITG multivariability speaker recognition database. The experimental results show that the combined evidence improves the performance by nearly 8-10% compared with individual evidence.
引用
收藏
页码:241 / 251
页数:11
相关论文
共 50 条
  • [31] Improving short utterance speaker verification by combining MFCC and Entrocy in Noisy conditions
    Khamis A. Al-karawi
    Duraid Y. Mohammed
    Multimedia Tools and Applications, 2021, 80 : 22231 - 22249
  • [32] Improving short utterance speaker verification by combining MFCC and Entrocy in Noisy conditions
    Al-karawi, Khamis A.
    Mohammed, Duraid Y.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 22231 - 22249
  • [33] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
    Zhou, Xi
    Fu, Yun
    Liu, Ming
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
  • [34] NOVEL WINDOWING TECHNIQUE OF MFCC FOR SPEAKER IDENTIFICATION WITH MODIFIED POLYNOMIAL CLASSIFIERS
    Bakshi, Aarti
    Kopparapu, Sunil Kumar
    Pawar, Sanjay
    Nema, Shikha
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 292 - 297
  • [35] A Robust Speaker Identification System Based on the Combination of GFCC and MFCC Methods
    Bachir Tazi, El
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 54 - 58
  • [36] Speaker Identification with Whispered Speech mode Using MFCC: Challenges to Whispered Speech Identification
    Sardar, V. M.
    Shrbahadurkar, S. D.
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 70 - 74
  • [37] Speaker identification and localization using shuffled MFCC features and deep learning
    Barhoush M.
    Hallawa A.
    Schmeink A.
    International Journal of Speech Technology, 2023, 26 (01) : 185 - 196
  • [38] A Fuzzy-GMM Classifier For Multilingual Speaker Identification
    Devika, A. K.
    Sumithra, M. G.
    Deepika, A. K.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [39] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
    Singh, Mahesh K.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
  • [40] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
    Barhoush, Mahdi
    Hallawa, Ahmed
    Schmeink, Anke
    2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33