Generic model implementation of deep neural network activation functions using GWO-optimized SCPWL model on FPGA

被引:13
作者
Al-Rikabi, Hussein M. H. [1 ]
Al-Ja'afari, Mohannad A. M. [2 ]
Ali, Ameer H. [2 ]
Abdulwahed, Saif H. [2 ]
机构
[1] Univ Kufa, Fac Engn, Dept Elect & Commun Engn, Najaf, Iraq
[2] Al Furat Al Awsat Tech Univ, Najaf Tech Inst, Najaf, Iraq
关键词
Deep neural network; FPGA; Activation function; SCPWL model; Optimization; HARDWARE IMPLEMENTATION;
D O I
10.1016/j.micpro.2020.103141
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The implementation of non-linear Activation Functions (AFs) within the Artificial Neural Network (ANN) on the Field Programmable Gate Array (FPGA) is substantial due to the various applications it performs. Accuracy, speed and complexity are the most crucial factors considered in this implementation. Building non-linear AFs in a reconfigurable ANN requires either sequential operations and/or additional complexity. In this paper, a generic model for three types of non-linear AFs (Logistic sigmoid (LogSig), Tan sigmoid (TanSig) and Radial Basis Function (RBF)) has been designed based on Simplicial Canonical Piecewise Linear (SCPWL) model that is optimized using Grey Wolf Optimizer (GWO(Algorithm. The designed model has been achieved by nine segments of the SCPWL model. The input of the AFs is ranging from (-8 to 8). Matlab has been deployed to design, optimize, simulate and validate this model. The maximum errors were 5.2e-3, 15.4e-3 and 7e-3 for LogSig, TanSig and RBF respectively. And, the Mean Square Error (MSE) were 1.81e-6, 1.22e-5 and 1.42e-5 for LogSig, TanSig and RBF respectively. The Matlab/HDL Coder has been used to generate the VHDL codes. The Xilinx Arty A7 (Xc7a35ticsg324-1L) FPGA kit is used to validate the designed model on Vivado Design Suite software. It has been noticed that it takes 581 Look-Up Tables (LUTs), nine DSP slices and a delay of (35.346 ns) to implement the nine SCPWL segments for any linear and non-linear AF. For validation, a complete ANN has been built with three hidden layers, each layer contain with one of the proposed AF models. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 32 条
[21]   FPGA Implementation of the Multilayer Neural Network for the Speed Estimation of the Two-Mass Drive System [J].
Orlowska-Kowalska, Teresa ;
Kaminski, Marcin .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2011, 7 (03) :436-445
[22]   Adaptive activation functions in convolutional neural networks [J].
Qian, Sheng ;
Liu, Hua ;
Liu, Cheng ;
Wu, Si ;
Wong, Hau San .
NEUROCOMPUTING, 2018, 272 :204-212
[23]  
Renteria-Cedano J, 2016, EUR MICROW CONF, P854, DOI 10.1109/EuMC.2016.7824478
[24]  
SAHIN S, 2006, LECT NOTES COMPUTER, V4234
[25]   Nonlinear Modeling for Super-RENS Disc Using a Simplicial Canonical Piecewise-Linear Adaptive Filter [J].
Seo, Manjung ;
Im, Sungbin .
IEEE TRANSACTIONS ON MAGNETICS, 2011, 47 (03) :572-575
[26]   A novel hybrid GWO-SCA approach for optimization problems [J].
Singh, N. ;
Singh, S. B. .
ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2017, 20 (06) :1586-1601
[27]   Hardware implementation of neural network with Sigmoidal activation functions using CORDIC [J].
Tiwari, Vipin ;
Khare, Nilay .
MICROPROCESSORS AND MICROSYSTEMS, 2015, 39 (06) :373-381
[28]   Hierarchy Strengthened Grey Wolf Optimizer for Numerical Optimization and Feature Selection [J].
Tu, Qiang ;
Chen, Xuechen ;
Liu, Xingcheng .
IEEE ACCESS, 2019, 7 :78012-78028
[29]   Design and field programmable gate array implementation of cascade neural network based flux estimator for speed estimation in induction motor drives [J].
Venkadesan, Arunachalam ;
Himavathi, Srinivasan ;
Sedhuraman, Karthikeyan ;
Muthuramalingam, A. .
IET ELECTRIC POWER APPLICATIONS, 2017, 11 (01) :121-131
[30]  
Xie ZZ, 2012, 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), P221, DOI 10.1109/ICACI.2012.6463155