Multilingual Speaker Identification by Combining Evidence from LPR and Multitaper MFCC

被引:2
|
作者
Nagaraja, B. [1 ]
Jayanna, H. [1 ]
机构
[1] Siddaganga Inst Technol, Dept Informat Sci & Engn, Tumkur 572103, Karnataka, India
关键词
Speaker identification; mel-frequency cepstral coefficients; multitaper mel-frequency cepstral coefficients; multilingual; linear prediction residual; linear prediction residual phase;
D O I
10.1515/jisys-2013-0038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, the significance of combining the evidence from multitaper mel-frequency cepstral coefficients (MFCC), linear prediction residual (LPR), and linear prediction residual phase (LPRP) features for multilingual speaker identification with the constraint of limited data condition is demonstrated. The LPR is derived from linear prediction analysis, and LPRP is obtained by dividing the LPR using its Hilbert envelope. The sine-weighted cepstrum estimators (SWCE) with six tapers are considered for multitaper MFCC feature extraction. The Gaussian mixture model-universal background model is used for modeling each speaker for different evidence. The evidence is then combined at scoring level to improve the performance. The monolingual, crosslingual, and multilingual speaker identification studies were conducted using 30 randomly selected speakers from the IITG multivariability speaker recognition database. The experimental results show that the combined evidence improves the performance by nearly 8-10% compared with individual evidence.
引用
收藏
页码:241 / 251
页数:11
相关论文
共 50 条
  • [21] Noise Robust Speaker Identification by Dividing MFCC
    Matsumoto, Kizuki
    Hayasaka, Noboru
    Iiguni, Youji
    2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 652 - 655
  • [22] Speaker Recognition by Combining MFCC and Phase Information in Noisy Conditions
    Wang, Longbiao
    Minami, Kazue
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2397 - 2406
  • [23] Speaker gender recognition based on combining the contribution of MFCC and pitch features
    Engineering Lab on Intelligent Perception for Internet of Things, Shenzhen Graduate School, Peking University, Shenzhen 518055, Guangdong, China
    Huazhong Ligong Daxue Xuebao, 2013, SUPPL.I (108-111+120):
  • [24] ANALYZING NOISE ROBUSTNESS OF MFCC AND GFCC FEATURES IN SPEAKER IDENTIFICATION
    Zhao, Xiaojia
    Wang, DeLiang
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7204 - 7208
  • [25] Text independent speaker identification in multilingual environments
    Luengo, Iker
    Navas, Eva
    Sainz, Inaki
    Saratxaga, Ibon
    Sanchez, Jon
    Odriozola, Igor
    Hernaez, Inma
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1814 - 1817
  • [26] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    Tsinghua Science and Technology, 2005, (02) : 158 - 161
  • [27] Speaker identification using multilingual phone strings
    Jin, Q
    Schultz, T
    Waibel, A
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 145 - 148
  • [28] Efficient Window for Monolingual and Crosslingual Speaker Identification using MFCC
    Nagaraja, B. G.
    Jayanna, H. S.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [29] Speaker identification based on combination of MFCC and UMRT based features
    Antony, Anett
    Gopikakumari, R.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 250 - 257
  • [30] A Speaker Identification System using MFCC Features with VQ Technique
    Zulfiqar, Ali
    Muhammad, Aslam
    Enriquez A M, Martinez
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 115 - +