Subspace-Based Feature Representation and Learning for Language Recognition

被引:0
作者
Shih, Yu-Chin [1 ]
Lee, Hung-Shin [1 ]
Wang, Hsin-Min
Jeng, Shyh-Kang [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
language recognition; subspace-based learning; IDENTIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel subspace-based approach for phonotactic language recognition. The whole framework is divided into two parts: the speech feature representation and the subspace-based learning algorithm. First, the phonetic information as well as the contextual relationship, possessed by spoken utterances, are more abundantly retrieved by likelihood computation and feature concatenation through the decoding processed by an automatic speech recognizer. It is assumed that the extracted phone frames reside in a lower dimensional eigen-subspace, in which the structure of data can be approximately captured. Each utterance is further represented by a fixed-dimensional linear subspace. Second, to measure the similarity between two utterances, suitable non-Euclidean metrics are explored and applied to non-linear discriminant analysis in a kernel fashion, followed by a back-end classifier, such as the k-nearest neighbor (K-NN) classifier. The results of experiments on the OGI-TS database demonstrate that the proposed framework outperforms the well-known vector space modeling based method with relative reductions of 38.90% and 27.13% on the 1-to-50-second and 3-second data sets respectively in equal error rate (EER).
引用
收藏
页码:2059 / 2062
页数:4
相关论文
共 50 条
[31]   Subspace-based identification of power transformer models from frequency response data [J].
Akçay, H ;
Islam, SM ;
Ninness, B .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 1999, 48 (03) :700-704
[32]   Subspace-based noise covariance estimation for Kalman filter in virtual sensing applications [J].
Gres, Szymon ;
Dohler, Michael ;
Dertimanis, Vasilis K. ;
Chatzi, Eleni N. .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 222
[33]   Aircarft Signal Feature Extraction and Recognition Based on Deep Learning [J].
Wang, Guanhua ;
Zou, Cong ;
Zhang, Chao ;
Pan, Changyong ;
Song, Jian ;
Yang, Fang .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (09) :9625-9634
[34]   A human ear recognition method using nonlinear curvelet feature subspace [J].
Basit, A. ;
Shoaib, M. .
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2014, 91 (03) :616-624
[35]   Exemplar-Based Sparse Representation for Language Recognition on I-Vectors [J].
Jiang, Bing ;
Song, Yan ;
Guo, Wu ;
Dai, LiRong .
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, :2055-2058
[36]   Weighted Data-Driven Fault Detection and Isolation: A Subspace-Based Approach and Algorithms [J].
Chen, Zhaoxu ;
Fang, Huajing ;
Chang, Yang .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2016, 63 (05) :3290-3298
[37]   A SAMPLE AND FEATURE SELECTION SCHEME FOR GMM-SVM BASED LANGUAGE RECOGNITION [J].
Song, Yan ;
Dai, Li-Rong .
2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, :326-329
[38]   Theoretical and Experimental Identification of Cantilever Beam With Clearances Using Statistical and Subspace-Based Methods [J].
Li, Bing ;
Han, Luofeng ;
Jin, Wei ;
Quan, Shuanglu .
JOURNAL OF COMPUTATIONAL AND NONLINEAR DYNAMICS, 2016, 11 (03)
[39]   Subspace-based linear multi-step predictors in type 1 diabetes mellitus [J].
Cescon, Marzia ;
Johansson, Rolf ;
Renard, Eric .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 22 :99-110
[40]   Differentiation similarities in fractional pseudo-state space representations and the subspace-based methods [J].
Malti, Rachid ;
Thomassin, Magalie .
FRACTIONAL CALCULUS AND APPLIED ANALYSIS, 2013, 16 (01) :273-287