Robust vocabulary recognition clustering model using an average estimator least mean square filter in noisy environments

被引:6
作者
Ahn, Chan-Shik [1 ]
Oh, Sang-Yeob [2 ]
机构
[1] Seoul Metro Rapid Transit Media Co Ltd, Seoul, South Korea
[2] Gachon Univ, Dept Interact Media, Songnam 461701, Gyeonggi Do, South Korea
关键词
AELMS filter; Clustering model; Noise estimation; Gaussian state model; END-POINT DETECTION; SPEECH SYNTHESIS;
D O I
10.1007/s00779-013-0732-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Noise estimation and detection algorithms must adapt to a changing environment quickly, so they use a least mean square (LMS) filter. However, there is a downside. An LMS filter is very low, and it consequently lowers speech recognition rates. In order to overcome such a weak point, we propose a method to establish a robust speech recognition clustering model for noisy environments. Since this proposed method allows the cancelation of noise with an average estimator least mean square (AELMS) filter in a noisy environment, a robust speech recognition clustering model can be established. With the AELMS filter, which can preserve source features of speech and decrease the degradation of speech information, noise in a contaminated speech signal gets canceled, and a Gaussian state model is clustered as a method to make noise more robust. By composing a Gaussian clustering model, which is a robust speech recognition clustering model, in a noisy environment, recognition performance was evaluated. The study shows that the signal-to-noise ratio of speech, which was improved by canceling environment noise that kept changing, was enhanced by 2.8 dB on average, and recognition rate improved by 4.1 %.
引用
收藏
页码:1295 / 1301
页数:7
相关论文
共 23 条
[1]  
Ahmed B, 2004, ACOUSTICS SPEECH SIG
[2]   Dynamic Reconfiguration Based on Goal-Scenario by Adaptation Strategy [J].
Baek, Su-Jin ;
Han, Jung-Soo ;
Chung, Kyung-Yong .
WIRELESS PERSONAL COMMUNICATIONS, 2013, 73 (02) :309-318
[3]   Effect of facial makeup style recommendation on visual sensibility [J].
Chung, Kyung-Yong .
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (02) :843-853
[4]  
Elmezain M, 2008, INT C PATT RECOG, P424
[5]  
*ETSI, 2003, 202050V113 ETSI ES
[6]   Policy on literature content based on software as service [J].
Han, Jung-Soo ;
Chung, Kyung-Yong ;
Kim, Gui-Jung .
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (20) :9087-9096
[7]  
Homer J., 2004, P IEEE 2004 INT C AC
[8]   Development of head detection and tracking systems for visual surveillance [J].
Kang, Sung-Kwan ;
Chung, Kyung-Yong ;
Lee, Jung-Hyun .
PERSONAL AND UBIQUITOUS COMPUTING, 2014, 18 (03) :515-522
[9]   Towards virtualized and automated software performance test architecture [J].
Kim, Gwang-Hun ;
Kim, Yeon-Gyun ;
Chung, Kyung-Yong .
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (20) :8745-8759
[10]   Ontology-based healthcare context information model to implement ubiquitous environment [J].
Kim, Jonghun ;
Chung, Kyung-Yong .
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (02) :873-888