On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification

被引:0
|
作者
MF Abu El-Yazeed
MA El Gamal
MMH El Ayadi
机构
[1] Cairo University,Department of Electronics & Communications Engineering, Faculty of Engineering
[2] Cairo University,Department of Engineering Physics & Mathematics, Faculty of Engineering
来源
EURASIP Journal on Advances in Signal Processing | / 2004卷
关键词
Gaussian mixture model; goodness of fit; minimum description length; Akaike information criterion; speaker identification; text-independent speaker identification;
D O I
暂无
中图分类号
学科分类号
摘要
Gaussian mixture models (GMMs) are recently employed to provide a robust technique for speaker identification. The determination of the appropriate number of Gaussian components in a model for adequate speaker representation is a crucial but difficult problem. This number is in fact speaker dependent. Therefore, assuming a fixed number of Gaussian components for all speakers is not justified. In this paper, we develop a procedure for roughly estimating the maximum possible model order above which the estimation of model parameters becomes unreliable. In addition, a theoretical measure, namely, a goodness of fit (GOF) measure is derived and utilized in estimating the number of Gaussian components needed to characterize different speakers. The estimation is carried out by exploiting the distribution of the training data for each speaker. Experimental results indicate that the proposed technique provides comparable results to other well-known model selection criteria like the minimum description length (MDL) and the Akaike information criterion (AIC).
引用
收藏
相关论文
共 50 条
  • [1] On the determination of optimal model order for GMM-based text-independent speaker identification
    Abu El-Yazeed, MF
    El Gamal, MA
    El Ayadi, MMH
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (08) : 1078 - 1087
  • [2] Improving Text-independent Speaker Recognition with GMM
    Chakroun, Rania
    Zouari, Leila Beltaifa
    Frikha, Mondher
    Ben Hamida, Ahmed
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 693 - 696
  • [3] Text-Independent Speaker Identification Using the Histogram Transform Model
    Ma, Zhanyu
    Yu, Hong
    Tan, Zheng-Hua
    Guo, Jun
    IEEE ACCESS, 2016, 4 : 9733 - 9739
  • [4] DISTRIBUTED AUTOMATIC TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM SPEAKER MODELS
    Chowdhury, Md Foezur Rahman
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 1039 - +
  • [5] Automatic, Text-Independent, Speaker Identification and Verification System Using Mel Cepstrum and GMM
    Al Marashli, Ahmad
    Al Dakkak, Oumayma
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 657 - +
  • [6] A Critical Comparison between GMM Classifier and Polynomial Classifier for Text-Independent Speaker Identification
    Sen, Nirmalya
    Basu, T. K.
    FRONTIERS IN COMPUTER EDUCATION, 2012, 133 : 545 - +
  • [7] A GMM-Based Speaker Identification System on FPGA
    Kan, Phak Len Eh
    Allen, Tim
    Quigley, Steven F.
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
  • [8] Text-independent speaker identification based on deep Gaussian correlation supervector
    Sun, Linhui
    Gu, Ting
    Xie, Keli
    Chen, Jia
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 449 - 457
  • [9] Text-independent speaker identification based on deep Gaussian correlation supervector
    Linhui Sun
    Ting Gu
    Keli Xie
    Jia Chen
    International Journal of Speech Technology, 2019, 22 : 449 - 457
  • [10] Text-independent speaker identification in noisy background
    Zhou, Y
    Xu, BL
    PROGRESS IN NATURAL SCIENCE, 2001, 11 : S384 - S387