Design of a Low-Power Coprocessor for Mid-Size Vocabulary Speech Recognition Systems

被引:9
|
作者
Li, Peng [1 ]
Tang, Hua [1 ]
机构
[1] Univ Minnesota, Dept Elect & Comp Engn, Duluth, MN 55812 USA
关键词
Coprocessor; custom design; field-programmable gate array (FPGA); hardware implementation; hidden Markov model (HMM); speech recognition; VLSI; VLSI IMPLEMENTATION;
D O I
10.1109/TCSI.2010.2090569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech recognition systems have gained popularity in consumer electronics. This paper presents a custom-designed coprocessor for output probability calculation (OPC), which is the most computation-intensive processing step in continuous hidden Markov model (CHMM)-based speech recognition algorithms. To save hardware resource and reduce power consumption, a polynomial addition-based method is used to compute add-log instead of the traditional look-up table-based method. In addition, the optimal tradeoff between speech processing delay, energy consumption, and hardware resources is explored for the coprocessor. The proposed coprocessor has been implemented and tested in Xilinx Spartan-3A DSP XC3SD3400A, and also validated using the standard-cell-based approach in IBM 0.13 mu m technology. To implement an entire speech recognition system, SAMSUNG S3C44b0X (containing an ARM7) is used as the micro-controller to execute the rest of speech processing. Tested with a 358-state 3-mixture 27-feature 800-word HMM, S3C44b0X operates at 40 MHz and coprocessor at 10 MHz to meet the real-time requirement, and the recognition accuracy is 95.2%. Power consumption of the micro-controller is 10 mW, and that of the coprocessor 15.2 mW. The overall speech recognition system achieves the lowest energy consumption per word recognition among many reported designs. Experiment and analysis show that the speech recognition system based on the proposed coprocessor is especially suitable for mid-size vocabulary (100-1000 words) recognition tasks.
引用
收藏
页码:961 / 970
页数:10
相关论文
共 50 条
  • [1] Design of a Low-Power Asynchronous Elliptic Curve Cryptography Coprocessor
    Zeidler, Steffen
    Goderbauer, Michael
    Krstic, Milos
    2013 IEEE 20TH INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (ICECS), 2013, : 569 - 572
  • [2] A low-power VLSI design of a HMM based speech recognition system
    Yoshizawa, S
    Miyanaga, Y
    Wada, N
    2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, CONFERENCE PROCEEDINGS, 2002, : 489 - 492
  • [3] Mid-size nuclear power reactor
    不详
    NUCLEAR PLANT JOURNAL, 2008, 26 (01) : 8 - 8
  • [4] LOW POWER EMBEDDED SPEECH RECOGNITION SYSTEM BASED ON A MCU AND A COPROCESSOR
    Li, Peng
    Tang, Hua
    Liang, Weiqian
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 625 - +
  • [5] Architecture for low power large vocabulary speech recognition
    Chandra, Dhruba
    Pazhayaveetil, Ullas
    Franzon, Paul D.
    IEEE INTERNATIONAL SOC CONFERENCE, PROCEEDINGS, 2006, : 25 - +
  • [6] The Design of a Low-Power Asynchronous DES Coprocessor for Sensor Network Encryption
    Liu, Yijun
    Chen, Pinghua
    Xie, Guobo
    Liu, Zhusong
    Li, Zhenkun
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 190 - 193
  • [7] QualServe pilots program for mid-size systems
    Crotty, P
    Arndt
    Moss
    JOURNAL AMERICAN WATER WORKS ASSOCIATION, 1998, 90 (11): : 32 - +
  • [8] A low-power integrated circuit for remote speech recognition
    Borgatti, M
    Felici, M
    Ferrari, A
    Guerrieri, R
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1998, 33 (07) : 1082 - 1089
  • [9] A Low-Power Hardware Search Architecture for Speech Recognition
    Bourke, Patrick J.
    Rutenbar, Rob A.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2102 - 2105
  • [10] A low-power VLSI feature extractor for speech recognition
    Felici, M
    Borgatti, M
    Ferrari, A
    Guerrieri, R
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3061 - 3064