Robust Mandarin speech recognition in car environments for embedded navigation system

被引:7
|
作者
Ding, Pei [1 ]
He, Lei [1 ]
Yan, Xiang [1 ]
Zhao, Rui [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba China, Speech Grp, Res & Dev Ctr, Beijing 100738, Peoples R China
关键词
speech recognition; noise robustness; accented Mandarin; car navigation;
D O I
10.1109/TCE.2008.4560134
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A low-cost robust Mandarin speech recognition system is investigated for embedded car navigation application. In the front-end, log-spectral minimum mean-square error (LogMMSE) estimation algorithm is applied to suppress the background noise, and a piece-wise linear function is used to approximate the traditional Taylor expansion in its gain function calculation to reduce the computational complexity. After speech enhancement, spectral smoothing is implemented in both time and frequency indexes with geometric sequence weights to further compensate the spectral components distorted by noise over-reduction. In acoustic model training, an immunity learning scheme is applied, in which pre-recorded car noise is artificially added to clean. training utterances to simulate the in-car environment. In the context of Mandarin speech recognition, a special difficulty is the diversity of Chinese dialects, i.e. the pronunciation difference among accents degrades the recognition performance if the acoustic models are trained with a mismatched accented database. We propose to train the models with multiple accented Mandarin databases to deal with this problem. Evaluation results of isolated phrase recognition confirm the effectivity of the proposed technologies(1).
引用
收藏
页码:584 / 590
页数:7
相关论文
共 50 条
  • [1] Robust mandarin speech recognition for car navigation interface
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2006, PROCEEDINGS, 2006, 4261 : 302 - +
  • [2] Robust Automatic Speech Recognition for Accented Mandarin in Car Environments
    Pei Ding
    Lei He
    Xiang Yan
    Jie Hao
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2542 - 2545
  • [3] Robust speech recognition in car environments
    Shozakai, M
    Nakamura, S
    Shikano, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 269 - 272
  • [4] Efficient Embedded Speech Recognition for Very Large Vocabulary Mandarin Car-Navigation Systems
    Qian, Yanmin
    Liu, Jia
    Johnson, Michael T.
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1496 - 1500
  • [5] Feature masking in an embedded Mandarin speech recognition system
    Tang, YZ
    Wang, X
    Cao, Y
    Ding, F
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 245 - 248
  • [6] An embedded multilingual speech recognition system for Mandarin, Cantonese, and English
    Wang, X
    Cao, Y
    Ding, F
    Tang, YZ
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 758 - 764
  • [7] Robust technologies towards automatic speech recognition in car noise environments
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 776 - +
  • [8] Two-channel noise reduction for robust speech recognition in car environments
    Jeong, S.
    Yang, H.
    Hahn, M.
    ELECTRONICS LETTERS, 2008, 44 (17) : 1042 - U54
  • [9] Spectral subtraction using spectral harmonics for robust speech recognition in car environments
    Beh, J
    Ko, H
    COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 1109 - 1116
  • [10] Fast Speech Recognition for Voice Destination Entry in a Car Navigation System
    Chung, Hoon
    Park, JeonGue
    Jeon, HyeonBae
    Lee, YunKeun
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 979 - 982