A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引：0

作者：

He, Lei ^{[1
]}

Hao, Jie ^{[1
]}

机构：

[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; tone recognition; feature selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.

引用

页码：1575 / 1578

页数：4

共 50 条

[31] Selective MCE training strategy in mandarin speech recognition
Zhao, JM
Zhu, XZ
Xu, HY
2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 679 - 683
[32] Research on tone recognition in Chinese spontaneous speech
Liu Zhao-Jie
Shao Jian
Zhang Peng-Yuan
Zhao Qing-Wei
Yan Yong-Hong
Feng Ji
ACTA PHYSICA SINICA, 2007, 56 (12) : 7064 - 7069
[33] Robust mandarin speech recognition for car navigation interface
Ding, Pei
He, Lei
Yan, Xiang
Zhao, Rui
Hao, Jie
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2006, PROCEEDINGS, 2006, 4261 : 302 - +
[34] Adaptive data augmentation for mandarin automatic speech recognition
Ding, Kai
Li, Ruixuan
Xu, Yuelin
Du, Xingyue
Deng, Bin
APPLIED INTELLIGENCE, 2024, 54 (07) : 5674 - 5687
[35] Prosodic Modeling in Large Vocabulary Mandarin Speech Recognition
Huang, Jui-Ting
Lee, Lin-shan
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1241 - 1244
[36] MODELING CHARACTERS VERSUS WORDS FOR MANDARIN SPEECH RECOGNITION
Luo, Jun
Lamel, Lori
Gauvain, Jean-Luc
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4325 - 4328
[37] An Improved Lexicon Generation Method for Mandarin Speech Recognition
Zhang, Yike
Zhang, Pengyuan
Zhao, Qingwei
Yan, Yonghong
Dong, Zhenjiang
Jia, Xia
2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 661 - 665
[38] Speech Recognition of Accented Mandarin Based on Improved Conformer
Yang, Xing-Yao
Zhang, Shao-Dong
Xiao, Rui
Yu, Jiong
Li, Zi-Yang
SENSORS, 2023, 23 (08)
[39] A Combined Speaker Adaptation Method for Mandarin Speech Recognition
徐向华
朱杰
Journal of Shanghai Jiaotong University, 2004, (04) : 21 - 24
[40] MANDARIN TONE RECOGNITION BASED ON WAVELET TRANSFORM AND HIDDEN MARKOV MODELING
Cheng Jun Yi Kechu Li Bingbing (National Key Laboratory on ISN
JournalofElectronics(China), 2000, (01) : 1 - 8

← 1 2 3 4 5 →