Design of a Low-Power Coprocessor for Mid-Size Vocabulary Speech Recognition Systems

被引:9
|
作者
Li, Peng [1 ]
Tang, Hua [1 ]
机构
[1] Univ Minnesota, Dept Elect & Comp Engn, Duluth, MN 55812 USA
关键词
Coprocessor; custom design; field-programmable gate array (FPGA); hardware implementation; hidden Markov model (HMM); speech recognition; VLSI; VLSI IMPLEMENTATION;
D O I
10.1109/TCSI.2010.2090569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech recognition systems have gained popularity in consumer electronics. This paper presents a custom-designed coprocessor for output probability calculation (OPC), which is the most computation-intensive processing step in continuous hidden Markov model (CHMM)-based speech recognition algorithms. To save hardware resource and reduce power consumption, a polynomial addition-based method is used to compute add-log instead of the traditional look-up table-based method. In addition, the optimal tradeoff between speech processing delay, energy consumption, and hardware resources is explored for the coprocessor. The proposed coprocessor has been implemented and tested in Xilinx Spartan-3A DSP XC3SD3400A, and also validated using the standard-cell-based approach in IBM 0.13 mu m technology. To implement an entire speech recognition system, SAMSUNG S3C44b0X (containing an ARM7) is used as the micro-controller to execute the rest of speech processing. Tested with a 358-state 3-mixture 27-feature 800-word HMM, S3C44b0X operates at 40 MHz and coprocessor at 10 MHz to meet the real-time requirement, and the recognition accuracy is 95.2%. Power consumption of the micro-controller is 10 mW, and that of the coprocessor 15.2 mW. The overall speech recognition system achieves the lowest energy consumption per word recognition among many reported designs. Experiment and analysis show that the speech recognition system based on the proposed coprocessor is especially suitable for mid-size vocabulary (100-1000 words) recognition tasks.
引用
收藏
页码:961 / 970
页数:10
相关论文
共 50 条
  • [41] Low-Power FPGA Design Using Memoization-Based Approximate Computing
    Sinha, Sharad
    Zhang, Wei
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (08) : 2665 - 2678
  • [42] Algorithm and VLSI Architecture Design of Low-Power SPIHT Decoder for mHealth Applications
    Hsieh, Jui-Hung
    Shih, Meng-Ju
    Huang, Xin-Hao
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2018, 12 (06) : 1450 - 1457
  • [43] Design and Validation of a Very Low-Power Phasor Measurement Unit for the Distribution System
    Lythgoe, Zachary J.
    Long, Thomas F.
    Buchholz, Michael J.
    Livernois, Anthony R.
    Kanuteh, Kebba
    Allee, David R.
    Pal, Anamitra
    Graham, Ian R.
    Drummond, Zachary D.
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2025, 61 (02) : 3553 - 3562
  • [44] Enhancing Speech Recognition In Developing Language Learning Systems For Low Cost Androids
    Jayakumar, Akshay
    Raghunath, Meera
    Sakthipriya, M. S.
    Akhila, S.
    Sadanandan, Anuja
    Nedungadi, Prema
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES IN INFORMATION AND COMMUNICATION TECHNOLOGIES (ICCTICT), 2016,
  • [45] Design, optimization of low-power high-performance DSP building blocks
    Gemmeke, T
    Gansen, M
    Stockmans, HJ
    Noll, TG
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2004, 39 (07) : 1131 - 1139
  • [46] Low-power VLSI design for motion estimation using adaptive pixel truncation
    He, ZL
    Tsui, CY
    Chan, KK
    Liou, ML
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2000, 10 (05) : 669 - 678
  • [47] VCO-based Feature Extraction Architecture for Low Power Speech Recognition Applications
    Gutierrez, Eric
    Perez, Carlos
    Hernandez, Fernando
    Hernandez, Luis
    2019 IEEE 62ND INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2019, : 1175 - 1178
  • [48] Design and analysis of hybrid NEMS-CMOS circuits for ultra low-power applications
    Dadgour, Hamed F.
    Banerjee, Kaustav
    2007 44TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2007, : 306 - +
  • [49] A New Approach to Zero-Crossing and LPC Speech Detection for Low Power Terminals Using Speech Recognition Technology
    Ema K.
    Yasuda A.
    Oshima F.
    IEEJ Transactions on Electronics, Information and Systems, 2021, 141 (12) : 1424 - 1429
  • [50] The analysis and design of architecture systems for speech recognition on modern handheld-computing devices
    Hagen, A
    Connors, DA
    Pellom, BL
    CODES(PLUS)ISSS 2003: FIRST IEEE/ACM/IFIP INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN & SYSTEM SYNTHESIS, 2003, : 65 - 70