Stochastic Computing Convolutional Neural Network Architecture Reinvented for Highly Efficient Artificial Intelligence Workload on Field-Programmable Gate Array

被引:5
作者
Lee, Yang Yang [1 ]
Halim, Zaini Abdul [1 ]
Wahab, Mohd Nadhir Ab [2 ]
Almohamad, Tarik Adnan [3 ]
机构
[1] Univ Sains Malaysia, Sch Elect & Elect Engn, Nibong Tebal 14300, Malaysia
[2] Univ Sains Malaysia, Sch Comp Sci, Gelugor 11800, Malaysia
[3] Karabuk Univ, Fac Engn, Elect Elect Engn Dept, TR-78050 Karabuk, Turkiye
关键词
CIRCUITS; DESIGN;
D O I
10.34133/research.0307
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Stochastic computing (SC) has a substantial amount of study on application-specific integrated circuit (ASIC) design for artificial intelligence (AI) edge computing, especially the convolutional neural network (CNN) algorithm. However, SC has little to no optimization on field-programmable gate array (FPGA). Scaling up the ASIC logic without FPGA-oriented designs is inefficient, while aggregating thousands of bitstreams is still challenging in the conventional SC. This research has reinvented several FPGA-efficient function generator, and binary rectified linear unit, and successfully scaled and implemented a fully parallel CNN model on Kintex7 FPGA. The proposed SC hardware only compromises 0.14% accuracy compared to binary computing on the handwriting Modified National Institute of Standards and Technology classification task and achieved at least 99.72% energy saving per image feedforward and 31x more data throughput than modern hardware. Unique to SC, early decision termination pushed the performance baseline exponentially with minimum accuracy loss, making SC CNN extremely lucrative for AI edge computing but limited to classification tasks. The SC's inherent noise heavily penalizes CNN regression performance, rendering SC unsuitable for regression tasks.
引用
收藏
页数:25
相关论文
共 68 条
[1]   The Promise and Challenge of Stochastic Computing [J].
Alaghi, Armin ;
Qian, Weikang ;
Hayes, John P. .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (08) :1515-+
[2]   Survey of Stochastic Computing [J].
Alaghi, Armin ;
Hayes, John P. .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 12
[3]   VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing [J].
Ardakani, Arash ;
Leduc-Primeau, Francois ;
Onizawa, Naoya ;
Hanyu, Takahiro ;
Gross, Warren J. .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (10) :2688-2699
[4]   Late Breaking Results: LDFSM: A Low-Cost Bit-Stream Generator for Low-Discrepancy Stochastic Computing [J].
Asadi, Sina ;
Najafi, M. Hassan .
PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
[5]  
Asadi S, 2021, PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), P908, DOI 10.23919/DATE51398.2021.9474143
[6]   Design of Large-Scale Stochastic Computing Adders and their Anomalous Behavior [J].
Baker, Timothy ;
Hayes, John P. .
2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
[7]   CeMux: Maximizing the Accuracy of Stochastic Mux Adders and an Application to Filter Design [J].
Baker, Timothy J. ;
Hayes, John P. .
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (03)
[8]   ON A SIMPLE STOCHASTIC NEURON - LIKE UNIT [J].
BANZHAF, W .
BIOLOGICAL CYBERNETICS, 1988, 60 (02) :153-160
[9]   FINN-R: An End-to-End Deep-Learning Framework for Fast Exploration of Quantized Neural Networks [J].
Blott, Michaela ;
Preusser, Thomas B. ;
Fraser, Nicholas J. ;
Gambardella, Giulio ;
O'Brien, Kenneth ;
Umuroglu, Yaman ;
Leeser, Miriam ;
Vissers, Kees .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2018, 11 (03)
[10]   Sustainable electronics: On the trail of reconfigurable computing [J].
Bossuet, Lilian .
SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2014, 4 (03) :196-202