基于鉴别性i-vector局部距离保持映射的说话人识别

被引：11

作者：

栗志意

何亮

张卫强

刘加

机构：

[1] 清华大学电子工程系,清华信息科学与技术国家实验室

来源：

清华大学学报(自然科学版) | 2012年 / 52卷 / 05期

关键词：

流形学习; i-vector; 鉴别性; 局部距离保持映射; 说话人识别;

D O I：

10.16511/j.cnki.qhdxxb.2012.05.005

中图分类号：

TN912.34 [语音识别与设备];

学科分类号：

0711 ;

摘要：

为了进一步提高i-vector说话人识别系统的性能,该文提出了一种鉴别性i-vector局部距离保持映射(discriminant i-vector local distance preserving projection,DIVLDPP)的流形学习算法。该算法以i-vector间的Euclid距离作为度量准则,并以最小化同类点间距离同时最大化异类近邻点间距离的鉴别性准则作为优化目标函数,利用求解广义特征值的方法,得到最终的投影映射矩阵。在美国国家标准技术局2008年说话人识别核心数据集上的实验结果表明:该算法可以明显提高目前i-vector说话人识别系统的性能。

引用

页码：598 / 601

页数：4

共 11 条

[1]

Speaker verification using adapted Gaussian mixture models. Reynolds D A,Quatieri T F,Dunn R B. Digital Signal Processing . 2000

[2]

NIST Speaker Recognition Evaluation. http://www.itl.nist.gov/iad/mig/tests/sre/2008/index.html .

[3]

Speaker andsession variability in GMM-based speaker verification. Patrick Kenny,Boulianne G,Ouellet P,et al. IEEE Transactions on Audio,Speech and LanguageProcessing . 2007

[4]

A study ofinter-speaker variability in speaker verification. Kenny P,Ouellet P,Dehak N,et al. IEEE Transactions on Audio,Speech and LanguageProcessing . 2008

[5]

Locality preserving projections. He Xiaofei,Niyogi Partha. Advances in Neural Information Processing Systems 16 . 2004

[6] 集合分类中的鉴别式局部信息距离保持映射 [J].

何亮 ;

栗志意 ;

蔡猛 ;

刘加 .

清华大学学报(自然科学版), 2011, 51 (07) :1010-1016

[7]

Front-End Factor Analysis for speaker verification. N. Dehak,P. J. Kenny,R. Dehak,P. Dumouchel,P. Ouellet. IEEE Trans. Audio, Speech and Languge Processing . 2011

[8]

An overview of text-independent speakerrecognition:From features to supervectors. Kinnunen T,Li H. Space Communications . 2010

[9]

矩阵分析与应用[M]. 清华大学出版社 , 张贤达著, 2004

[10]

The EM algorithm for mixtures of factor analyzers. Ghahramani Z,Hinton GE. Technical Report CRG-TR-96-1 . 1996

← 1 2 →