IMPROVED TONE MODELING BY EXPLOITING ARTICULATORY FEATURES FOR MANDARIN SPEECH RECOGNITION

被引:0
作者
Chao, Hao [1 ]
Yang, Zhanlei [1 ]
Liu, Wenju [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
来源
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年
关键词
tone modeling; Mandarin; speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For the same tone pattern, different articulatory characteristics may make the pitch contour change. This paper applies articulatory features, which represent the articulatory information, as well as prosodic features to the tone modeling. Three kinds of tone models are trained to verify the effectiveness of articulatory features. Tone recognition experiments indicate significant improvement can be achieved when using both articulatory features and prosodic features. After the first pass search of a speech recognition system, tone models using new tonal features are employed to rescoring the N-best hypotheses, and a 6.5% relative reduction of character error rate is achieved.
引用
收藏
页码:4741 / 4744
页数:4
相关论文
共 8 条
  • [1] [Anonymous], LIBSVM LIB SUPPORT V
  • [2] Lee T., 2002, ACM Transactions on Asian Language Information Processing (TALIP), V1, P83, DOI DOI 10.1145/595576.595581
  • [3] Lei X, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P1237
  • [4] Tone recognition in continuous Cantonese speech using supratone models
    Qian, Yao
    Lee, Tan
    Soong, Frank K.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (05) : 2936 - 2945
  • [5] A Multi-Space Distribution (MSD) and two-stream tone modeling approach to Mandarin speech recognition
    Qian, Yao
    Soong, Frank K.
    [J]. SPEECH COMMUNICATION, 2009, 51 (12) : 1169 - 1179
  • [6] TIAN Y, 2004, P ICASSP 2004, P105
  • [7] Wei HX, 2008, INT CONF ACOUST SPEE, P4549
  • [8] Young S., 2000, HTK BOOK