COMPARISON OF TEXT-INDEPENDENT SPEAKER RECOGNITION METHODS USING VECTOR-QUANTIZATION DISTORTION AND DISCRETE AND CONTINUOUS HMMS

被引:0
|
作者
MATSUI, T
FURUI, S
机构
[1] NTT Human Interface Laboratories, Musashino
关键词
SPEAKER RECOGNITION; TEXT-INDEPENDENT; VECTOR QUANTIZATION; ERGODIC HMM; UTTERANCE VARIATION;
D O I
10.1002/ecjc.4430771207
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The results of speaker recognition methods using vector quantization (VQ) distortion and discrete or continuous ergodic hidden Markov models (HMMs) are compared. The effectiveness of these methods is examined from the viewpoint of robustness against utterance variation such as differences in content, temporal variation, and changes in utterance speed. It is shown that the continuous HMM performs much better than the discrete HMM and its performance is close to that of the VQ distortion method. When the amount of training data is limited, however, the VQ distortion method achieves a better recognition rate than the continuous HMM. The transition information between the states is shown to contribute little to identifying the individual characteristics of a voice. An increase in the number of states or in the number of mixture components in a state both have an equal effect, and recognition performance is almost completely determined by the product of these two numbers.
引用
收藏
页码:63 / 70
页数:8
相关论文
共 50 条
  • [11] A new text-independent speaker identification using Vector Quantization and Multi-layer Perceptron
    Keum, Ji-Soo
    Park, Chan-Ho
    Lee, Hyon-Soo
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 165 - 171
  • [12] Research on text-independent speaker recognition methods using wavelet neural network
    Bai, Ying
    Zhao, Zhen-Dong
    Qi, Yin-Cheng
    Wang, Bin
    Guo, Jian-Yong
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2006, 28 (06): : 1036 - 1039
  • [13] Hybridization Process for Text-Independent Speaker Identification Based on Vector Quantization Model
    Djeghader, Mohammed
    Huang, Qin
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 596 - 601
  • [14] Text-independent speaker verification using Support Vector Machines
    Kharroubi, J
    Chollet, G
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4017 - 4017
  • [15] Text-independent speaker verification using speaker clustering and support vector machines
    Hou, FL
    Wang, BX
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 456 - 459
  • [16] Common vector approach and its combination with GMM for text-independent speaker recognition
    Sadic, Selami
    Gulmezoglu, M. Bilginer
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11394 - 11400
  • [17] A Comparative Study of Text-Independent Speaker Recognition Systems Using Gaussian Mixture Modeling and i-vector Methods
    Paulose, Suma
    Mathew, Dominic
    Thomas, Abraham
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 444 - 448
  • [18] Comparison of clustering methods: A case study of text-independent speaker modeling
    Kinnunen, Tomi
    Sidoroff, Ilja
    Tuononen, Marko
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2011, 32 (13) : 1604 - 1617
  • [19] Text-independent Speaker Recognition Using Radial Basis Function Network
    Yakovenko, Anton
    Malychina, Galina
    ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 74 - 81
  • [20] Text-independent speaker recognition using probabilistic SVM with GMM adjustment
    Hou, FL
    Wang, BX
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 305 - 308