Robust Mandarin speech recognition in car environments for embedded navigation system

被引：7

作者：

Ding, Pei ^{[1
]}

He, Lei ^{[1
]}

Yan, Xiang ^{[1
]}

Zhao, Rui ^{[1
]}

Hao, Jie ^{[1
]}

机构：

[1] Toshiba China, Speech Grp, Res & Dev Ctr, Beijing 100738, Peoples R China

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2008年 / 54卷 / 02期

关键词：

speech recognition; noise robustness; accented Mandarin; car navigation;

D O I：

10.1109/TCE.2008.4560134

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A low-cost robust Mandarin speech recognition system is investigated for embedded car navigation application. In the front-end, log-spectral minimum mean-square error (LogMMSE) estimation algorithm is applied to suppress the background noise, and a piece-wise linear function is used to approximate the traditional Taylor expansion in its gain function calculation to reduce the computational complexity. After speech enhancement, spectral smoothing is implemented in both time and frequency indexes with geometric sequence weights to further compensate the spectral components distorted by noise over-reduction. In acoustic model training, an immunity learning scheme is applied, in which pre-recorded car noise is artificially added to clean. training utterances to simulate the in-car environment. In the context of Mandarin speech recognition, a special difficulty is the diversity of Chinese dialects, i.e. the pronunciation difference among accents degrades the recognition performance if the acoustic models are trained with a mismatched accented database. We propose to train the models with multiple accented Mandarin databases to deal with this problem. Evaluation results of isolated phrase recognition confirm the effectivity of the proposed technologies(1).

引用

页码：584 / 590

页数：7

共 50 条

[1] Robust mandarin speech recognition for car navigation interface
Ding, Pei
He, Lei
Yan, Xiang
Zhao, Rui
Hao, Jie
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2006, PROCEEDINGS, 2006, 4261 : 302 - +
[2] Robust Automatic Speech Recognition for Accented Mandarin in Car Environments
Pei Ding
Lei He
Xiang Yan
Jie Hao
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2542 - 2545
[3] Robust speech recognition in car environments
Shozakai, M
Nakamura, S
Shikano, K
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 269 - 272
[4] Efficient Embedded Speech Recognition for Very Large Vocabulary Mandarin Car-Navigation Systems
Qian, Yanmin
Liu, Jia
Johnson, Michael T.
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1496 - 1500
[5] Feature masking in an embedded Mandarin speech recognition system
Tang, YZ
Wang, X
Cao, Y
Ding, F
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 245 - 248
[6] An embedded multilingual speech recognition system for Mandarin, Cantonese, and English
Wang, X
Cao, Y
Ding, F
Tang, YZ
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 758 - 764
[7] Robust technologies towards automatic speech recognition in car noise environments
Ding, Pei
He, Lei
Yan, Xiang
Zhao, Rui
Hao, Jie
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 776 - +
[8] Two-channel noise reduction for robust speech recognition in car environments
Jeong, S.
Yang, H.
Hahn, M.
ELECTRONICS LETTERS, 2008, 44 (17) : 1042 - U54
[9] Spectral subtraction using spectral harmonics for robust speech recognition in car environments
Beh, J
Ko, H
COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 1109 - 1116
[10] Fast Speech Recognition for Voice Destination Entry in a Car Navigation System
Chung, Hoon
Park, JeonGue
Jeon, HyeonBae
Lee, YunKeun
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 979 - 982

← 1 2 3 4 5 →