A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引:0
|
作者
He, Lei [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech recognition; tone recognition; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.
引用
收藏
页码:1575 / 1578
页数:4
相关论文
共 50 条
  • [21] Can voice quality improve Mandarin tone recognition?
    Surendran, Dinoj
    Levow, Gina-Anne
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4177 - 4180
  • [22] Continuous Automatic Speech Recognition System using MapReduce Framework
    Vikram, M.
    Reddy, N. Sudhakar
    Madhavi, K.
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 80 - 83
  • [23] Mandarin Connected Digits Recognition for Whispered Speech
    Ru Tingting
    Xie Xiang
    Yin Hui
    Kuang Jingming
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1141 - 1144
  • [24] A simple statistical speech recognition of mandarin monosyllables
    Li, Tze Fen
    Chang, Shui-Ching
    Lee, Chung-Bow
    APPLIED MATHEMATICS AND COMPUTATION, 2006, 177 (02) : 644 - 651
  • [25] Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
    Shen, JL
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1998, 145 (05): : 309 - 315
  • [26] Smoothed unit HMM in mandarin speech recognition
    He, Q
    Mao, SY
    Zhang, YW
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 792 - 795
  • [27] Mandarin Tone Recognition using Affine-Invariant Prosodic Features and Tone Posteriorgram
    Wang, Yow-Bang
    Lee, Lin-Shan
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2854 - 2857
  • [28] A Mandarin E-Learning System Based on Speech Recognition and Evaluation
    Ming, Yue
    Bai, Zongshan
    COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2011, 19 (04) : 651 - 659
  • [29] Tone recognition of continuous Thai speech under tonal assimilation and declination effects using half-tone model
    Thubthong, N
    Kijsirikul, B
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2001, 9 (06) : 815 - 825
  • [30] Improved Large Vocabulary Mandarin Speech Recognition by Selectively Using Tone Information with a Two-stage Prosodic Model
    Cheng, Li-Wei
    Lee, Lin-Shan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1137 - 1140