A Speaker Identification System using MFCC Features with VQ Technique

被引:22
|
作者
Zulfiqar, Ali [1 ]
Muhammad, Aslam [2 ]
Enriquez A M, Martinez [3 ]
机构
[1] UoG, Dept CS & IT, Gujrat, Pakistan
[2] UET, Dept CS & E, Lahore, Pakistan
[3] CINVESTAV, IPN, Dept CS, Ciudad De Mexico, DF, Mexico
关键词
RECOGNITION;
D O I
10.1109/IITA.2009.420
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of speaker identification systems has improved due to recent advances in speech processing techniques but there is still need of improvement in term of text-independent speaker identification and suitable modelling techniques for voice feature vectors. It becomes difficult for person to recognize a voice when an uncontrollable noise adds in to it. In this paper, feature vectors from speech are extracted by using Mel-Frequency Cepstral Coefficients and Vector Quantization technique is implemented through Linde-Buzo-Gray algorithm. Two purposeful speech databases with added noise, recorded at sampling frequencies 8000 Hz and 11025 Hz, are used to check the accuracy of the developed speaker identification system in non-ideal conditions. An analysis is also provided by performing different experiments on the databases that number of vectors in VQ codebook and sampling frequency influence the identification accuracy significantly.
引用
收藏
页码:115 / +
页数:2
相关论文
共 50 条
  • [41] Unsupervised speaker segmentation with residual phase and MFCC features
    Jothilakshmi, S.
    Ramalingam, V.
    Palanivel, S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9799 - 9804
  • [42] LDA combination of pitch and MFCC features in speaker recognition
    Harrag, A
    Mohamadi, T
    Serignat, JF
    INDICON 2005 Proceedings, 2005, : 237 - 240
  • [43] Speaker discriminative weighting method for VQ-based speaker identification
    Kinnunen, T
    Fränti, P
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2001, 2091 : 150 - 156
  • [44] Fusion of TEO Phase with MFCC Features for Speaker Verification
    Agrawal, Purvi
    Patil, Hemant A.
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 161 - 166
  • [45] A Comparative Study on Speaker Gender Identification Using MFCC and Statistical Learning Methods
    Xiao, Hanguang
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSAIT 2013), 2014, 255 : 715 - 723
  • [46] Arabic Speaker Identification System using Combination of DWT and LPC Features
    Shah, Shahid Munir
    Ahsan, Syed Nadeem
    2014 INTERNATIONAL CONFERENCE ON OPEN SOURCE SYSTEMS AND TECHNOLOGIES (ICOSST), 2014, : 176 - 181
  • [47] STUDY OF FUSION STRATEGIES AND EXPLOITING THE COMBINATION OF MFCC AND PNCC FEATURES FOR ROBUST BIOMETRIC SPEAKER IDENTIFICATION
    Al-Kaltakchi, M. T. S.
    Woo, W. L.
    Dlay, S. S.
    Chambers, J. A.
    2016 4TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF), 2016,
  • [48] Real-Time Speaker Identification System using Cepstral Features
    Barik, Monalisha
    Sarangi, Susanta Kumar
    Sahu, Sushanta Kumar
    2016 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND INTELLIGENT SYSTEMS (CCIS), 2016, : 89 - 93
  • [49] Closed-set speaker identification using VQ and GMM based models
    Bidhan Barai
    Tapas Chakraborty
    Nibaran Das
    Subhadip Basu
    Mita Nasipuri
    International Journal of Speech Technology, 2022, 25 : 173 - 196
  • [50] Speaker Recognition using MFCC, shifted MFCC with Vector Quantization and Fuzzy
    Bansal, Priyanka
    Imam, Syed Akhtar
    Bharti, Roma
    2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,