Speaker identification using a novel combination of sparse representation and Gaussian mixture models

被引：0

作者：

Ma Yunjie ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Nanjing, Jiangsu, Peoples R China

来源：

AUTOMATIC CONTROL AND MECHATRONIC ENGINEERING III | 2014年 / 615卷

关键词：

Speaker identification; GMM; Sparse representation; Learned dictionary; K-SVD; ALGORITHM;

D O I：

10.4028/www.scientific.net/AMM.615.265

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

In recent years, sparse representation has become a very popular method for pattern recognition which could outperform the traditional methods. This paper presents a novel combination of sparse representation and traditional Gaussian mixture models. Each person's dictionary or termed as subspace in this paper are learned using K-SVD algorithm while the entries are GMM mean matrixes union for each speaker. Then project the test utterance into each dictionary and finally make decision depending on the reconstruction errors. The experiments are conducted on the database collected in our anechoic chamber. The proposed approach results in different accuracy for different sparsity and dictionary size. In appropriate parameters, the accuracy can reach 98.5% which is fairly good.

引用

页码：265 / 269

页数：5

共 12 条

[1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
Aharon, Michal
Elad, Michael
Bruckstein, Alfred
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
[2] [Anonymous], IEEE T PATTERN ANAL
[3] IEEE-SPS and connexions - An open access education collaboration
Baraniuk, Richard G.
Burrus, C. Sidney
Thierstein, E. Joel
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (06) : 6 - +
[4] Candès EJ, 2008, IEEE SIGNAL PROC MAG, V25, P21, DOI 10.1109/MSP.2007.914731
[5] Stable recovery of sparse overcomplete representations in the presence of noise
Donoho, DL
Elad, M
Temlyakov, VN
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (01) : 6 - 18
[6] Haris B. C., 2012, P NAT C COMM NCC, P1
[7] Naseem Imran, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P4460, DOI 10.1109/ICPR.2010.1083
[8] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
REYNOLDS, DA
[J]. SPEECH COMMUNICATION, 1995, 17 (1-2) : 91 - 108
[9] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
REYNOLDS, DA
ROSE, RC
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83
[10] Speaker verification using adapted Gaussian mixture models
Reynolds, DA
Quatieri, TF
Dunn, RB
[J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 19 - 41

← 1 2 →