COMPARISON OF TEXT-INDEPENDENT SPEAKER RECOGNITION METHODS USING VECTOR-QUANTIZATION DISTORTION AND DISCRETE AND CONTINUOUS HMMS

被引：0

作者：

MATSUI, T

FURUI, S

机构：

[1] NTT Human Interface Laboratories, Musashino

来源：

ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE | 1994年 / 77卷 / 12期

关键词：

SPEAKER RECOGNITION; TEXT-INDEPENDENT; VECTOR QUANTIZATION; ERGODIC HMM; UTTERANCE VARIATION;

D O I：

10.1002/ecjc.4430771207

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The results of speaker recognition methods using vector quantization (VQ) distortion and discrete or continuous ergodic hidden Markov models (HMMs) are compared. The effectiveness of these methods is examined from the viewpoint of robustness against utterance variation such as differences in content, temporal variation, and changes in utterance speed. It is shown that the continuous HMM performs much better than the discrete HMM and its performance is close to that of the VQ distortion method. When the amount of training data is limited, however, the VQ distortion method achieves a better recognition rate than the continuous HMM. The transition information between the states is shown to contribute little to identifying the individual characteristics of a voice. An increase in the number of states or in the number of mixture components in a state both have an equal effect, and recognition performance is almost completely determined by the product of these two numbers.

引用

页码：63 / 70

页数：8

共 50 条

[11] A new text-independent speaker identification using Vector Quantization and Multi-layer Perceptron
Keum, Ji-Soo
Park, Chan-Ho
Lee, Hyon-Soo
ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 165 - 171
[12] Research on text-independent speaker recognition methods using wavelet neural network
Bai, Ying
Zhao, Zhen-Dong
Qi, Yin-Cheng
Wang, Bin
Guo, Jian-Yong
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2006, 28 (06): : 1036 - 1039
[13] Hybridization Process for Text-Independent Speaker Identification Based on Vector Quantization Model
Djeghader, Mohammed
Huang, Qin
2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 596 - 601
[14] Text-independent speaker verification using Support Vector Machines
Kharroubi, J
Chollet, G
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4017 - 4017
[15] Text-independent speaker verification using speaker clustering and support vector machines
Hou, FL
Wang, BX
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 456 - 459
[16] Common vector approach and its combination with GMM for text-independent speaker recognition
Sadic, Selami
Gulmezoglu, M. Bilginer
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11394 - 11400
[17] A Comparative Study of Text-Independent Speaker Recognition Systems Using Gaussian Mixture Modeling and i-vector Methods
Paulose, Suma
Mathew, Dominic
Thomas, Abraham
2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 444 - 448
[18] Comparison of clustering methods: A case study of text-independent speaker modeling
Kinnunen, Tomi
Sidoroff, Ilja
Tuononen, Marko
Franti, Pasi
PATTERN RECOGNITION LETTERS, 2011, 32 (13) : 1604 - 1617
[19] Text-independent Speaker Recognition Using Radial Basis Function Network
Yakovenko, Anton
Malychina, Galina
ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 74 - 81
[20] Text-independent speaker recognition using probabilistic SVM with GMM adjustment
Hou, FL
Wang, BX
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 305 - 308

← 1 2 3 4 5 →