Smoothed unit HMM in mandarin speech recognition

被引:0
作者
He, Q [1 ]
Mao, SY [1 ]
Zhang, YW [1 ]
机构
[1] Beijing Univ Aeronaut & Astronaut, Dept EE, Beijing 100083, Peoples R China
来源
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III | 2000年
关键词
speech recognition; HMM; demi-syllable; SUHMM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The base unit in mandarin speech recognition can be phoneme, demi-syllable or syllable. Demi-syllable system has fewer HMM models and need less computation, thus it's suitable for real-time systems. But due to poor description for the acoustic properties of the speech signal, it generally shows a low performance compared to syllable system. While system based on syllable of phoneme (tri-phone or di-phone) has much more HMM models, and needs massive computation in training and recognition. In this paper, a compromised scheme is proposed. The new system is based on demi-syllable, but the two demi-syllable HMMs are connected into a full syllable HMM in training phase,so the data of the whole length of the syllable are used, and smoothing between two demi-syllables is introduced. This can increase the system performance without increasing HMM models, and it fits to real-time systems with DSP kernel.
引用
收藏
页码:792 / 795
页数:4
相关论文
共 50 条
  • [1] A Study on HMM based Speech Recognition System
    Boruah, Saptarshi
    Basishtha, Subhash
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 153 - 157
  • [2] Speech recognition of mandarin monosyllables
    Li, TF
    PATTERN RECOGNITION, 2003, 36 (11) : 2713 - 2721
  • [3] HMM/SVM segmentation and labelling of Arabic speech for speech recognition applications
    Frihia H.
    Bahi H.
    International Journal of Speech Technology, 2017, 20 (3) : 563 - 573
  • [4] HMM BASED ISOLATED WORD NEPALI SPEECH RECOGNITION
    Ssarma, Manish K.
    Gajurel, Avaas
    Pokhrel, Anup
    Joshi, Basanta
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2017, : 71 - 76
  • [5] Generalization of linear discriminant analysis used in segmental unit input HMM for speech recognition
    Sakai, Makoto
    Kitaoka, Norihide
    Nakagawa, Seiichi
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 333 - +
  • [6] Adaptive HMM Topology for Speech Recognition
    Ting, Chuan-Wei
    Lee, Kuo-Yuan
    Chien, Jen-Tzung
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1237 - 1240
  • [7] Feature extraction for HMM speech recognition systems using DTW
    Go, J
    Hyun, D
    Lee, C
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 241 - 244
  • [8] Modified Viterbi Decoder for Hmm Based Speech Recognition System
    Kumar, Y. Rajeev
    Babu, A. Venkatesh
    Kumar, K. A. Naveen
    Alex, John Sahaya Rani
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 470 - 474
  • [9] Heuristic Improvements of the HMM Use in Isolated Word Speech Recognition
    Dimov, Dimo
    Azmanov, Ivan
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 73 - 88
  • [10] Design and Development of Speech Recognition System Based on HMM Algorithm
    Xu Zhengru
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (ICCSE 2017), 2017, 81 : 124 - 128