Speaker identification using a novel combination of sparse representation and Gaussian mixture models

被引:0
作者
Ma Yunjie [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing, Jiangsu, Peoples R China
来源
AUTOMATIC CONTROL AND MECHATRONIC ENGINEERING III | 2014年 / 615卷
关键词
Speaker identification; GMM; Sparse representation; Learned dictionary; K-SVD; ALGORITHM;
D O I
10.4028/www.scientific.net/AMM.615.265
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
In recent years, sparse representation has become a very popular method for pattern recognition which could outperform the traditional methods. This paper presents a novel combination of sparse representation and traditional Gaussian mixture models. Each person's dictionary or termed as subspace in this paper are learned using K-SVD algorithm while the entries are GMM mean matrixes union for each speaker. Then project the test utterance into each dictionary and finally make decision depending on the reconstruction errors. The experiments are conducted on the database collected in our anechoic chamber. The proposed approach results in different accuracy for different sparsity and dictionary size. In appropriate parameters, the accuracy can reach 98.5% which is fairly good.
引用
收藏
页码:265 / 269
页数:5
相关论文
共 12 条