Fast speech recognition to access a very large list of items on embedded devices

被引:4
|
作者
Chung, Hoon [1 ]
Park, Jeon Gue [1 ]
Lee, Yun Keun [1 ]
Chung, Ikjoo [2 ]
机构
[1] ETRI, Spoken Language Proc Team, Taejon 305700, South Korea
[2] Kangwon Natl Univ, Dept Elect Engn, Chunchon 200701, South Korea
关键词
fast decoding; HSR;
D O I
10.1109/TCE.2008.4560163
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a fast decoding algorithm to recognize a very large number of item names on a resource-limited embedded device. The proposed algorithm is based on a multi-pass search scheme. The algorithm is composed of a two-stage HMM-based coarse match and a detailed match. The two-stage HMM-based coarse match is aimed at rapidly selecting a small set of candidates that are assumed to contain a correct hypothesis with high probability, and the detailed match re-ranks the candidates by performing acoustic rescoring. The proposed algorithm is implemented on an in-car navigation system with a 32-bit fixed-point processor operating at 620MHz. The experimental result shows that the proposed method runs at maximum speed 1.74 times real-time on the embedded device while minimizing the degradation of the recognition accuracy for a 220K Korean Point-of-Interest (POI),recognition domain.
引用
收藏
页码:803 / 807
页数:5
相关论文
共 45 条
  • [1] Two-pass search strategy for large list recognition on embedded speech recognition platforms
    Novak, M
    Hampl, R
    Krbec, R
    Bergl, V
    Sedivy, J
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 200 - 203
  • [2] A fast HMM match algorithm for very large vocabulary speech recognition
    Seward, A
    SPEECH COMMUNICATION, 2004, 42 (02) : 191 - 206
  • [3] Efficient Embedded Speech Recognition for Very Large Vocabulary Mandarin Car-Navigation Systems
    Qian, Yanmin
    Liu, Jia
    Johnson, Michael T.
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1496 - 1500
  • [4] A scalable architecture for multilingual speech recognition on embedded devices
    Raab, Martin
    Gruhn, Rainer
    Noeth, Elmar
    SPEECH COMMUNICATION, 2011, 53 (01) : 62 - 74
  • [5] Lip reading for robust speech recognition on embedded devices
    Perez, JFG
    Frangi, AF
    Solano, EL
    Lukas, K
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 473 - 476
  • [6] Recognition Unit Determination of Interactive Chinese Speech Recognition for Embedded Devices
    Jang, Gil-Jin
    Pan, Chunghsi
    Park, Jae-Hyun
    Park, Jeong-sik
    Kim, Ji-Hwan
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1353 - 1358
  • [7] LEXICAL ACCESS TO LARGE VOCABULARIES FOR SPEECH RECOGNITION
    FISSORE, L
    LAFACE, P
    MICCA, G
    PIERACCINI, R
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (08): : 1197 - 1213
  • [8] Chinese speech recognition system with very large vocabulary
    Qin, Y
    Mo, FY
    Li, CL
    Guan, DH
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 817 - 820
  • [9] Profiling Large-Vocabulary Continuous Speech Recognition on Embedded Devices: A Hardware Resource Sensitivity Analysis
    Yu, Kai
    Rutenbar, Rob A.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1903 - 1906
  • [10] Fast script word recognition with very large vocabulary
    Schambach, MP
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 9 - 13