Subspace-Based Feature Representation and Learning for Language Recognition

被引:0
作者
Shih, Yu-Chin [1 ]
Lee, Hung-Shin [1 ]
Wang, Hsin-Min
Jeng, Shyh-Kang [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
language recognition; subspace-based learning; IDENTIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel subspace-based approach for phonotactic language recognition. The whole framework is divided into two parts: the speech feature representation and the subspace-based learning algorithm. First, the phonetic information as well as the contextual relationship, possessed by spoken utterances, are more abundantly retrieved by likelihood computation and feature concatenation through the decoding processed by an automatic speech recognizer. It is assumed that the extracted phone frames reside in a lower dimensional eigen-subspace, in which the structure of data can be approximately captured. Each utterance is further represented by a fixed-dimensional linear subspace. Second, to measure the similarity between two utterances, suitable non-Euclidean metrics are explored and applied to non-linear discriminant analysis in a kernel fashion, followed by a back-end classifier, such as the k-nearest neighbor (K-NN) classifier. The results of experiments on the OGI-TS database demonstrate that the proposed framework outperforms the well-known vector space modeling based method with relative reductions of 38.90% and 27.13% on the 1-to-50-second and 3-second data sets respectively in equal error rate (EER).
引用
收藏
页码:2059 / 2062
页数:4
相关论文
共 50 条
[21]   A subspace-based method for solving Lagrange-Sylvester interpolation problems [J].
Akcay, Hueseyin ;
Turekay, Semiha .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2007, 29 (02) :377-395
[22]   Subspace-Based Rational Interpolation of Analytic Functions From Phase Data [J].
Akcay, Hueseyin .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (03) :1069-1081
[23]   Noise effects analysis on subspace-based damage detection with neural networks [J].
Rosso, Marco Martino ;
Aloisio, Angelo ;
Melchiorre, Jonathan ;
Huo, Fei ;
Marano, Giuseppe Carlo .
STRUCTURES, 2023, 54 :23-37
[24]   Auxiliary input design for stochastic subspace-based structural damage detection [J].
Ashari, Alireza Esna ;
Mevel, Laurent .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2013, 34 (1-2) :241-258
[25]   A Robust Equalization Feature for Language Recognition [J].
Song, Wen-Jie ;
Chen, Chen ;
Sun, Tian-Yang ;
Wang, Wei .
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (03) :561-576
[26]   Subspace-based predictive control of Parkinson's disease: A model-based study [J].
Ahmadipour, Mahboubeh ;
Barkhordari-Yazdi, Mojtaba ;
Seydnejad, Saeid R. .
NEURAL NETWORKS, 2021, 142 :680-689
[27]   Research on language recognition algorithm based on improved CFCC feature extraction [J].
Long H. ;
Huang Z. ;
Shao Y. ;
Du Q. ;
Su S. .
Tongxin Xuebao/Journal on Communications, 2022, 43 (12) :211-221
[28]   Parallel Absolute-Relative Feature Based Phonotactic Language Recognition [J].
Liu, Weiwei ;
Zhang, Wei-Qiang ;
Li, Zhiyi ;
Liu, Jia .
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, :59-63
[29]   Local discriminant coding based convolutional feature representation for multimodal finger recognition [J].
Li, Shuyi ;
Zhang, Bob ;
Zhao, Shuping ;
Yang, Jinfeng .
INFORMATION SCIENCES, 2021, 547 :1170-1181
[30]   Handling the Temperature Effect in Vibration Monitoring: Two Subspace-Based Analytical Approaches [J].
Basseville, Michele ;
Bourquin, Frederic ;
Mevel, Laurent ;
Nasser, Houssein ;
Treyssede, Fabien .
JOURNAL OF ENGINEERING MECHANICS, 2010, 136 (03) :367-378