A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引:0
作者
He, Lei [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech recognition; tone recognition; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.
引用
收藏
页码:1575 / 1578
页数:4
相关论文
共 50 条
  • [41] A Comparative Study of the Classification Techniques in Isolated Mandarin Syllable Tone Recognition
    Dong, Jiatang
    Li, Cen
    PROCEEDINGS OF THE 49TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE (ACMSE '11), 2011, : 263 - 269
  • [42] A Mandarin Tone Recognition Algorithm Based on Random Forest and Feature Fusion
    Yan, Jiameng
    Meng, Qiang
    Tian, Lan
    Wang, Xiaoyu
    Liu, Junhui
    Li, Meng
    Zeng, Ming
    Xu, Huifang
    MATHEMATICS, 2023, 11 (08)
  • [43] Mandarin tone recognition in English speakers with normal hearing and with cochlear implants
    Nie, Kaibao
    Hannaford, Sophia
    Director, Hannah M.
    Nishigaki, Micah A.
    Drennan, Ward R.
    Rubinstein, Jay T.
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2019, 58 (12) : 913 - 922
  • [44] Influence of Emotional Speech on Continuous Speech Recognition
    Zgank, Andrej
    Maucec, Mirjam Sepesy
    13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,
  • [45] The implementation of a practical high performance mandarin and sichuan dialect continuous speech recognition system for parcels checking task
    Shan, YX
    Zhang, HT
    Li, HS
    Zhong, L
    Zhang, J
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 409 - 412
  • [46] Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
    Wang, HM
    Ho, TH
    Yang, RC
    Shen, JL
    Bai, BR
    Hong, JC
    Chen, WP
    Yu, TL
    Lee, LS
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (02): : 195 - 200
  • [47] A framework for secure speech recognition
    Smaragdis, Paris
    Shashanka, Madhusudana V. S.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 969 - +
  • [48] A Framework for Speech Recognition Benchmarking
    Dernoncourt, Franck
    Trung Bui
    Chang, Walter
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 169 - 170
  • [49] A framework for secure speech recognition
    Smaragdis, Paris
    Shashanka, Madhusudana
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1404 - 1413
  • [50] A MANDARIN SPEECH RECOGNITION RESULT EVALUATION ALGORITHM ON MIXED WORDS
    Liu, Gang
    Chen, Wei
    Guo, Yujing
    Guo, Jun
    2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 695 - 700