A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引：0

作者：

He, Lei ^{[1
]}

Hao, Jie ^{[1
]}

机构：

[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; tone recognition; feature selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.

引用

页码：1575 / 1578

页数：4

共 50 条

[41] A Comparative Study of the Classification Techniques in Isolated Mandarin Syllable Tone Recognition
Dong, Jiatang
Li, Cen
PROCEEDINGS OF THE 49TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE (ACMSE '11), 2011, : 263 - 269
[42] A Mandarin Tone Recognition Algorithm Based on Random Forest and Feature Fusion
Yan, Jiameng
Meng, Qiang
Tian, Lan
Wang, Xiaoyu
Liu, Junhui
Li, Meng
Zeng, Ming
Xu, Huifang
MATHEMATICS, 2023, 11 (08)
[43] Mandarin tone recognition in English speakers with normal hearing and with cochlear implants
Nie, Kaibao
Hannaford, Sophia
Director, Hannah M.
Nishigaki, Micah A.
Drennan, Ward R.
Rubinstein, Jay T.
INTERNATIONAL JOURNAL OF AUDIOLOGY, 2019, 58 (12) : 913 - 922
[44] Influence of Emotional Speech on Continuous Speech Recognition
Zgank, Andrej
Maucec, Mirjam Sepesy
13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,
[45] The implementation of a practical high performance mandarin and sichuan dialect continuous speech recognition system for parcels checking task
Shan, YX
Zhang, HT
Li, HS
Zhong, L
Zhang, J
PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 409 - 412
[46] Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
Wang, HM
Ho, TH
Yang, RC
Shen, JL
Bai, BR
Hong, JC
Chen, WP
Yu, TL
Lee, LS
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (02): : 195 - 200
[47] A framework for secure speech recognition
Smaragdis, Paris
Shashanka, Madhusudana V. S.
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 969 - +
[48] A Framework for Speech Recognition Benchmarking
Dernoncourt, Franck
Trung Bui
Chang, Walter
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 169 - 170
[49] A framework for secure speech recognition
Smaragdis, Paris
Shashanka, Madhusudana
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1404 - 1413
[50] A MANDARIN SPEECH RECOGNITION RESULT EVALUATION ALGORITHM ON MIXED WORDS
Liu, Gang
Chen, Wei
Guo, Yujing
Guo, Jun
2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 695 - 700

← 1 2 3 4 5 →