Design of a Low-Power Coprocessor for Mid-Size Vocabulary Speech Recognition Systems

被引：9

作者：

Li, Peng ^{[1
]}

Tang, Hua ^{[1
]}

机构：

[1] Univ Minnesota, Dept Elect & Comp Engn, Duluth, MN 55812 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS | 2011年 / 58卷 / 05期

关键词：

Coprocessor; custom design; field-programmable gate array (FPGA); hardware implementation; hidden Markov model (HMM); speech recognition; VLSI; VLSI IMPLEMENTATION;

D O I：

10.1109/TCSI.2010.2090569

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speech recognition systems have gained popularity in consumer electronics. This paper presents a custom-designed coprocessor for output probability calculation (OPC), which is the most computation-intensive processing step in continuous hidden Markov model (CHMM)-based speech recognition algorithms. To save hardware resource and reduce power consumption, a polynomial addition-based method is used to compute add-log instead of the traditional look-up table-based method. In addition, the optimal tradeoff between speech processing delay, energy consumption, and hardware resources is explored for the coprocessor. The proposed coprocessor has been implemented and tested in Xilinx Spartan-3A DSP XC3SD3400A, and also validated using the standard-cell-based approach in IBM 0.13 mu m technology. To implement an entire speech recognition system, SAMSUNG S3C44b0X (containing an ARM7) is used as the micro-controller to execute the rest of speech processing. Tested with a 358-state 3-mixture 27-feature 800-word HMM, S3C44b0X operates at 40 MHz and coprocessor at 10 MHz to meet the real-time requirement, and the recognition accuracy is 95.2%. Power consumption of the micro-controller is 10 mW, and that of the coprocessor 15.2 mW. The overall speech recognition system achieves the lowest energy consumption per word recognition among many reported designs. Experiment and analysis show that the speech recognition system based on the proposed coprocessor is especially suitable for mid-size vocabulary (100-1000 words) recognition tasks.

引用

页码：961 / 970

页数：10

共 50 条

[31] Design of Low-Power High-Performance FinFET Standard Cells
Wang, Tian
Cui, Xiaoxin
Liao, Kai
Liao, Nan
Yu, Dunshan
Cui, Xiaole
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (05) : 1789 - 1806
[32] A low-power Digital Matched Filter for Spread-Spectrum systems
Goto, S
Yamada, T
Takayama, N
Matsushita, Y
Harada, Y
Yasuura, H
ISLPED'02: PROCEEDINGS OF THE 2002 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2002, : 301 - 306
[33] Feature pruning for low-power ASR systems in clean and noisy environments
Li, X
Bilmes, J
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (07) : 489 - 492
[34] Design of a low-power real-time wavelet CODEC circuit
Lee, S
Cho, K
CURRENT APPLIED PHYSICS, 2005, 5 (04) : 365 - 372
[35] Design of Low-Power High-Performance FinFET Standard Cells
Tian Wang
Xiaoxin Cui
Kai Liao
Nan Liao
Dunshan Yu
Xiaole Cui
Circuits, Systems, and Signal Processing, 2018, 37 : 1789 - 1806
[36] GENERATING COMPOUND WORDS WITH HIGH ORDER N-GRAM INFORMATION IN LARGE VOCABULARY SPEECH RECOGNITION SYSTEMS
Zhau, Lie
Shi, Qin
Qin, Yang
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5560 - 5563
[37] A High-Rate, Low-Power, ASIC Speech Decoder Using Finite State Transducers
Johnston, Jeffrey R.
Rutenbar, Rob A.
2012 IEEE 23RD INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2012, : 77 - 85
[38] A Low-Power and Low-Latency Speech Feature Extractor Based on Time-Domain Filter Bank
Yu, Baoying
Luo, Mengjie
Wang, Dingyi
Wang, Xiaoqin
Qiao, Shushan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1421 - 1425
[39] Low Power Design of VLSI Circuits and Systems
Hao, Peiyi
Wang, Hongfeng
2009 IEEE 8TH INTERNATIONAL CONFERENCE ON ASIC, VOLS 1 AND 2, PROCEEDINGS, 2009, : 17 - +
[40] High-Speed Low-Power Viterbi Decoder Design for TCM Decoders
He, Jinjin
Liu, Huaping
Wang, Zhongfeng
Huang, Xinming
Zhang, Kai
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2012, 20 (04) : 755 - 759

← 1 2 3 4 5 →