An Ultra Low-Power Hardware Accelerator for Automatic Speech Recognition

被引:0
作者
Yazdani, Reza [1 ]
Segura, Albert [1 ]
Arnau, Jose-Maria [1 ]
Gonzalez, Antonio [1 ]
机构
[1] Univ Politecn Cataluna, Comp Architecture Dept, E-08028 Barcelona, Spain
来源
2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO) | 2016年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at a high energy cost which is not affordable for the tiny power budget of mobile devices. Hardware acceleration can reduce power consumption of ASR systems, while delivering high-performance. In this paper, we present an accelerator for large-vocabulary, speaker-independent, continuous speech recognition. It focuses on the Viterbi search algorithm, that represents the main bottleneck in an ASR system. The proposed design includes innovative techniques to improve the memory subsystem, since memory is identified as the main bottleneck for performance and power in the design of these accelerators. We propose a prefetching scheme tailored to the needs of an ASR system that hides main memory latency for a large fraction of the memory accesses with a negligible impact on area. In addition, we introduce a novel bandwidth saving technique that removes 20% of the off-chip memory accesses issued during the Viterbi search. The proposed design outperforms software implementations running on the CPU by orders of magnitude and achieves 1.7x speedup over a highly optimized CUDA implementation running on a high-end Geforce GTX 980 GPU, while reducing by two orders of magnitude (287x) the energy required to convert the speech into text.
引用
收藏
页数:12
相关论文
共 50 条
[41]   Comparison of Speech Processing Strategies for the Design of an Ultra Low-Power Analog Bionic Ear [J].
Sawigun, Chutham ;
Ngamkham, Wannaya ;
Serdijn, Wouter A. .
2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, :1374-1377
[42]   A Framework for Ultra Low-Power Hardware Accelerators Using NNs for Embedded Time Series Classification [J].
Reiser, Daniel ;
Reichel, Peter ;
Pechmann, Stefan ;
Mallah, Maen ;
Oppelt, Maximilian ;
Hagelauer, Amelie ;
Breiling, Marco ;
Fey, Dietmar ;
Reichenbach, Marc .
JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS, 2022, 12 (01)
[43]   Low-Power HW Accelerator for AI Edge-Computing in Human Activity Recognition Systems [J].
De Vita, Antonio ;
Pau, Danilo ;
Parrella, Claudio ;
Di Benedetto, Luigi ;
Rubino, Alfedo ;
Licciardo, Gian Domenico .
2020 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2020), 2020, :291-295
[44]   Ultra-low Power and Area-efficient Hardware Accelerator for Adaptive Neural Signal Compression [J].
Ma, Qier ;
Guo, Liyuan ;
Zeinolabedin, Seyed Mohammad Ali ;
Mayr, Christian .
2021 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (IEEE BIOCAS 2021), 2021,
[45]   An Ultra-Low Power Hardware Accelerator Architecture for Wearable Computers Using Dynamic Time Warping [J].
Lotfian, Reza ;
Jafari, Roozbeh .
DESIGN, AUTOMATION & TEST IN EUROPE, 2013, :913-916
[46]   Design of a Low-Power Coprocessor for Mid-Size Vocabulary Speech Recognition Systems [J].
Li, Peng ;
Tang, Hua .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2011, 58 (05) :961-970
[47]   Low-Power Image Recognition Challenge [J].
Lu, Yung-Hsiang ;
Berg, Alexander C. ;
Chen, Yiran .
AI MAGAZINE, 2018, 39 (02) :87-88
[48]   Low-Power Image Recognition Challenge [J].
Gauen, Kent ;
Rangan, Rohit ;
Mohan, Anup ;
Lu, Yung-Hsiang ;
Liu, Wei ;
Berg, Alexander C. .
2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, :99-104
[49]   LOCATOR: Low-power ORB accelerator for autonomous cars [J].
Taranco, Raul ;
Arnau, Jose-Maria ;
Gonzalez, Antonio .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 174 :32-45
[50]   Low-Power Manycore Accelerator for Personalized Biomedical Applications [J].
Page, Adam ;
Attaran, Nasrin ;
Shea, Colin ;
Homayoun, Houman ;
Mohsenin, Tinoosh .
2016 INTERNATIONAL GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI), 2016, :63-68