A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引：0

作者：

He, Lei ^{[1
]}

Hao, Jie ^{[1
]}

机构：

[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; tone recognition; feature selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.

引用

页码：1575 / 1578

页数：4

共 50 条

[21] Can voice quality improve Mandarin tone recognition?
Surendran, Dinoj
Levow, Gina-Anne
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4177 - 4180
[22] Continuous Automatic Speech Recognition System using MapReduce Framework
Vikram, M.
Reddy, N. Sudhakar
Madhavi, K.
2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 80 - 83
[23] Mandarin Connected Digits Recognition for Whispered Speech
Ru Tingting
Xie Xiang
Yin Hui
Kuang Jingming
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1141 - 1144
[24] A simple statistical speech recognition of mandarin monosyllables
Li, Tze Fen
Chang, Shui-Ching
Lee, Chung-Bow
APPLIED MATHEMATICS AND COMPUTATION, 2006, 177 (02) : 644 - 651
[25] Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
Shen, JL
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1998, 145 (05): : 309 - 315
[26] Smoothed unit HMM in mandarin speech recognition
He, Q
Mao, SY
Zhang, YW
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 792 - 795
[27] Mandarin Tone Recognition using Affine-Invariant Prosodic Features and Tone Posteriorgram
Wang, Yow-Bang
Lee, Lin-Shan
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2854 - 2857
[28] A Mandarin E-Learning System Based on Speech Recognition and Evaluation
Ming, Yue
Bai, Zongshan
COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2011, 19 (04) : 651 - 659
[29] Tone recognition of continuous Thai speech under tonal assimilation and declination effects using half-tone model
Thubthong, N
Kijsirikul, B
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2001, 9 (06) : 815 - 825
[30] Improved Large Vocabulary Mandarin Speech Recognition by Selectively Using Tone Information with a Two-stage Prosodic Model
Cheng, Li-Wei
Lee, Lin-Shan
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1137 - 1140

← 1 2 3 4 5 →