A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引:0
作者
He, Lei [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech recognition; tone recognition; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.
引用
收藏
页码:1575 / 1578
页数:4
相关论文
共 50 条
  • [31] Selective MCE training strategy in mandarin speech recognition
    Zhao, JM
    Zhu, XZ
    Xu, HY
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 679 - 683
  • [32] Research on tone recognition in Chinese spontaneous speech
    Liu Zhao-Jie
    Shao Jian
    Zhang Peng-Yuan
    Zhao Qing-Wei
    Yan Yong-Hong
    Feng Ji
    ACTA PHYSICA SINICA, 2007, 56 (12) : 7064 - 7069
  • [33] Robust mandarin speech recognition for car navigation interface
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2006, PROCEEDINGS, 2006, 4261 : 302 - +
  • [34] Adaptive data augmentation for mandarin automatic speech recognition
    Ding, Kai
    Li, Ruixuan
    Xu, Yuelin
    Du, Xingyue
    Deng, Bin
    APPLIED INTELLIGENCE, 2024, 54 (07) : 5674 - 5687
  • [35] Prosodic Modeling in Large Vocabulary Mandarin Speech Recognition
    Huang, Jui-Ting
    Lee, Lin-shan
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1241 - 1244
  • [36] MODELING CHARACTERS VERSUS WORDS FOR MANDARIN SPEECH RECOGNITION
    Luo, Jun
    Lamel, Lori
    Gauvain, Jean-Luc
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4325 - 4328
  • [37] An Improved Lexicon Generation Method for Mandarin Speech Recognition
    Zhang, Yike
    Zhang, Pengyuan
    Zhao, Qingwei
    Yan, Yonghong
    Dong, Zhenjiang
    Jia, Xia
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 661 - 665
  • [38] Speech Recognition of Accented Mandarin Based on Improved Conformer
    Yang, Xing-Yao
    Zhang, Shao-Dong
    Xiao, Rui
    Yu, Jiong
    Li, Zi-Yang
    SENSORS, 2023, 23 (08)
  • [39] A Combined Speaker Adaptation Method for Mandarin Speech Recognition
    徐向华
    朱杰
    Journal of Shanghai Jiaotong University, 2004, (04) : 21 - 24
  • [40] MANDARIN TONE RECOGNITION BASED ON WAVELET TRANSFORM AND HIDDEN MARKOV MODELING
    Cheng Jun Yi Kechu Li Bingbing (National Key Laboratory on ISN
    JournalofElectronics(China), 2000, (01) : 1 - 8