Developments of Machine Learning Schemes for Dynamic Time-Wrapping-Based Speech Recognition

被引:7
作者
Ding, Ing-Jr [1 ]
Yen, Chih-Ta [1 ]
Hsu, Yen-Ming [1 ]
机构
[1] Natl Formosa Univ, Dept Elect Engn, Huwei Township 632, Yunlin County, Taiwan
关键词
SPEAKER ADAPTATION;
D O I
10.1155/2013/542680
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper presents a machine learning scheme for dynamic time-wrapping-based (DTW) speech recognition. Two categories of learning strategies, supervised and unsupervised, were developed for DTW. Two supervised learning methods, incremental learning and priority-rejection learning, were proposed in this study. The incremental learning method is conceptually simple but still suffers from a large database of keywords for matching the testing template. The priority-rejection learning method can effectively reduce the matching time with a slight decrease in recognition accuracy. Regarding the unsupervised learning category, an automatic learning approach, called "most-matching learning," which is based on priority-rejection learning, was developed in this study. Most-matching learning can be used to intelligently choose the appropriate utterances for system learning. The effectiveness and efficiency of all three proposed machine-learning approaches for DTW were demonstrated using keyword speech recognition experiments.
引用
收藏
页数:10
相关论文
共 15 条
[1]   PARTIAL SEQUENCE MATCHING USING AN UNBOUNDED DYNAMIC TIME WARPING ALGORITHM [J].
Anguera, Xavier ;
Macrae, Robert ;
Oliver, Nuria .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :3582-3585
[2]  
Chen Q., 2012, P INT C INF SCI SIGN
[3]  
Gaikwad S., 2010, INT J COMPUT APPL, V10, P16
[4]   Applications of support vector machines to speech recognition [J].
Ganapathiraju, A ;
Hamaker, JE ;
Picone, J .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) :2348-2355
[5]   State-of-the-Art Intelligent Mechatronics in Human-Machine Interaction [J].
Harashima, Fumio ;
Suzuki, Satoshi .
IEEE INDUSTRIAL ELECTRONICS MAGAZINE, 2010, 4 (02) :9-13
[6]  
Huang LF, 2012, PROC INT CONF ANTI
[7]   Rapid speaker adaptation in eigenvoice space [J].
Kuhn, R ;
Junqua, JC ;
Nguyen, P ;
Niedzielski, N .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06) :695-707
[8]   A STUDY ON SPEAKER ADAPTATION OF THE PARAMETERS OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].
LEE, CH ;
LIN, CH ;
JUANG, BH .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) :806-814
[9]   MAXIMUM-LIKELIHOOD LINEAR-REGRESSION FOR SPEAKER ADAPTATION OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].
LEGGETTER, CJ ;
WOODLAND, PC .
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :171-185
[10]  
Lin Y.-S., 2010, P INT C COMP APPL SY, V9, pV418