Smoothed unit HMM in mandarin speech recognition

被引：0

作者：

He, Q ^{[1
]}

Mao, SY ^{[1
]}

Zhang, YW ^{[1
]}

机构：

[1] Beijing Univ Aeronaut & Astronaut, Dept EE, Beijing 100083, Peoples R China

来源：

2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III | 2000年

关键词：

speech recognition; HMM; demi-syllable; SUHMM;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The base unit in mandarin speech recognition can be phoneme, demi-syllable or syllable. Demi-syllable system has fewer HMM models and need less computation, thus it's suitable for real-time systems. But due to poor description for the acoustic properties of the speech signal, it generally shows a low performance compared to syllable system. While system based on syllable of phoneme (tri-phone or di-phone) has much more HMM models, and needs massive computation in training and recognition. In this paper, a compromised scheme is proposed. The new system is based on demi-syllable, but the two demi-syllable HMMs are connected into a full syllable HMM in training phase,so the data of the whole length of the syllable are used, and smoothing between two demi-syllables is introduced. This can increase the system performance without increasing HMM models, and it fits to real-time systems with DSP kernel.

引用

页码：792 / 795

页数：4

共 50 条

[1] A Study on HMM based Speech Recognition System
Boruah, Saptarshi
Basishtha, Subhash
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 153 - 157
[2] Speech recognition of mandarin monosyllables
Li, TF
PATTERN RECOGNITION, 2003, 36 (11) : 2713 - 2721
[3] HMM/SVM segmentation and labelling of Arabic speech for speech recognition applications
Frihia H.
Bahi H.
International Journal of Speech Technology, 2017, 20 (3) : 563 - 573
[4] HMM BASED ISOLATED WORD NEPALI SPEECH RECOGNITION
Ssarma, Manish K.
Gajurel, Avaas
Pokhrel, Anup
Joshi, Basanta
PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2017, : 71 - 76
[5] Generalization of linear discriminant analysis used in segmental unit input HMM for speech recognition
Sakai, Makoto
Kitaoka, Norihide
Nakagawa, Seiichi
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 333 - +
[6] Adaptive HMM Topology for Speech Recognition
Ting, Chuan-Wei
Lee, Kuo-Yuan
Chien, Jen-Tzung
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1237 - 1240
[7] Feature extraction for HMM speech recognition systems using DTW
Go, J
Hyun, D
Lee, C
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 241 - 244
[8] Modified Viterbi Decoder for Hmm Based Speech Recognition System
Kumar, Y. Rajeev
Babu, A. Venkatesh
Kumar, K. A. Naveen
Alex, John Sahaya Rani
2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 470 - 474
[9] Heuristic Improvements of the HMM Use in Isolated Word Speech Recognition
Dimov, Dimo
Azmanov, Ivan
CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 73 - 88
[10] Design and Development of Speech Recognition System Based on HMM Algorithm
Xu Zhengru
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (ICCSE 2017), 2017, 81 : 124 - 128

← 1 2 3 4 5 →