Selecting Phonotactic Features for Language Recognition

被引:0
作者
Tong, Rong [1 ,2 ]
Ma, Bin [1 ]
Li, Haizhou [1 ,2 ]
Chng, Eng Siong [2 ]
机构
[1] ASTAR, Human Language Technol Dept, Inst Infocomm Res, Singapore 138632, Singapore
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
feature selection; spoken language recognition; support vector machine; separation margin; Chi-Squared test;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies feature selection in phonotactic language recognition. The phonotactic feature is presented by n-gram statistics derived from one or more phone recognizers in the form of high dimensional feature vectors. Two feature selection strategies are proposed to select the n-gram statistics for reducing the dimension of feature vectors, so that higher order n-gram features can be adopted in language recognition. With the proposed feature selection techniques, we achieved equal error rates (EERs) of 1.84% with 4-gram statistics on the 2007 NIST Language Recognition Evaluation 30s closed test sets.
引用
收藏
页码:737 / +
页数:2
相关论文
共 9 条
[1]  
[Anonymous], 1997, ICML
[2]  
Gauvain J. L., 2004, P ICSLP, P1283
[3]  
Lander T., 1995, EUR 1995, P895
[4]  
Matejka P., 2005, INTERSPEECH, P2237
[5]   An introduction to kernel-based learning algorithms [J].
Müller, KR ;
Mika, S ;
Rätsch, G ;
Tsuda, K ;
Schölkopf, B .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (02) :181-201
[6]   Language recognition with discriminative keyword selection [J].
Richardson, F. S. ;
Campbell, W. M. .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4145-4148
[7]  
Schwarz P, 2006, INT CONF ACOUST SPEE, P325
[8]   A Target-Oriented Phonotactic Front-End for Spoken Language Recognition [J].
Tong, Rong ;
Ma, Bin ;
Li, Haizhou ;
Chng, Eng Siong .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07) :1335-1347
[9]  
Torres-Carrasquillo P., 2008, INT 2008, P718