Self-Organizing Mixture Models for Text-Independent Speaker Identification

被引：0

作者：

Bouziane, Ayoub ^{[1
]}

Kharroubi, Jamal ^{[1
]}

Zarghili, Arsalane ^{[1
]}

机构：

[1] Univ Sidi Mohamed Ben Abdellah, Fac Sci & Technol, Lab Intelligent Syst & Applicat, Fes, Morocco

来源：

2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14) | 2014年

关键词：

Speaker Recognition System; Speaker Identification; Gaussian Mixture Model (GMM); Self-Organizing Mixture Models; Speaker Modeling; Mel-frequency Cepstral Coefficients (MFCC); RECOGNITION; BANDWIDTH;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Over the past several years, The Mel-Frequency Cepstral Coefficients (MFCCs) and Gaussian mixture models (GMMs) using the well-known EM algorithm have become the state-of-the-art approach in text-independent speaker recognition applications. However, in recent few years, Self-Organizing Mixture Models which combines the strengths of Self-Organizing Maps and Mixture Models have been proposed in the literature and yielded better results than the classical GMM training in many applications. In this paper, firstly, the implementation and the comparison of the most popular MFCCs variants are done in order to find the best implementation for our speaker identification system. Then, The Self-Organizing Mixture Models are introduced for speaker modeling in text-independent speaker identification. The performance of the Self-Organizing Mixture Models is assessed and compared with the classical Gaussian mixture models using the EM algorithm.

引用

页码：345 / 350

页数：6

共 20 条

[1]

[Anonymous], 2001, DISCRETE TIME SPEECH

[2]

[Anonymous], 2005, Technical report

[3]

Beigi H, 2011, FUNDAMENTALS OF SPEAKER RECOGNITION, P1, DOI 10.1007/978-0-387-77592-0

[4]

Cheng S.-S., 2007, 2007 IEEE WORKSH MAC, P378

[5] Model-Based Clustering by Probabilistic Self-Organizing Maps [J].

Cheng, Shih-Sian ;

Fu, Hsin-Chia ;

Wang, Hsin-Min .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (05) :805-826

[6] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].

DAVIS, SB ;

MERMELSTEIN, P .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366

[7]

Ganchev T, 2011, SPRINGERBRIEF SPEECH, P1, DOI 10.1007/978-1-4419-8447-0

[8] SELF-ORGANIZED FORMATION OF TOPOLOGICALLY CORRECT FEATURE MAPS [J].

KOHONEN, T .

BIOLOGICAL CYBERNETICS, 1982, 43 (01) :59-69

[9]

Kohonen T., 2001, INFORM SCIENCES

[10]

Muller C., 2007, SPEAKER CLASSIFICATI

← 1 2 →