Sigmoid Activation Implementation for Neural Networks Hardware Accelerators Based on Reconfigurable Computing Environments for Low-Power Intelligent Systems

被引:5
作者
Shatravin, Vladislav [1 ]
Shashev, Dmitriy [1 ]
Shidlovskiy, Stanislav [1 ]
机构
[1] Tomsk State Univ, Fac Innovat Technol, Tomsk 634050, Russia
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 10期
基金
俄罗斯科学基金会;
关键词
deep neural networks; hardware accelerators; low-power systems; homogeneous structures; reconfigurable environments; parallel processing; DESIGN;
D O I
10.3390/app12105216
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The remarkable results of applying machine learning algorithms to complex tasks are well known. They open wide opportunities in natural language processing, image recognition, and predictive analysis. However, their use in low-power intelligent systems is restricted because of high computational complexity and memory requirements. This group includes a wide variety of devices, from smartphones and Internet of Things (IoT)smart sensors to unmanned aerial vehicles (UAVs), self-driving cars, and nodes of Edge Computing systems. All of these devices have severe limitations to their weight and power consumption. To apply neural networks in these systems efficiently, specialized hardware accelerators are used. However, hardware implementation of some neural network operations is a challenging task. Sigmoid activation is popular in the classification problem and is a notable example of such a complex operation because it uses division and exponentiation. The paper proposes efficient implementations of this activation for dynamically reconfigurable accelerators. Reconfigurable computing environments (RCE) allow achieving reconfigurability of accelerators. The paper shows the advantages of applying such accelerators in low-power systems, proposes the centralized and distributed hardware implementations of the sigmoid, presents comparisons with the results of other studies, and describes application of the proposed approaches to other activation functions. Timing simulations of the developed Verilog modules show low delay (14-18.5 ns) with acceptable accuracy (average absolute error is 4 x 10(-3)).
引用
收藏
页数:16
相关论文
共 32 条
[1]  
[Anonymous], ARXIV
[2]   Design of a Model of a Reconfigurable Computing Environment for Determining Image Gradient Characteristics [J].
Bondarchuk, A. S. ;
Shashev, D., V ;
Shidlovskiy, S., V .
OPTOELECTRONICS INSTRUMENTATION AND DATA PROCESSING, 2021, 57 (02) :132-140
[3]  
Brown TB, 2020, ADV NEUR IN, V33
[4]   A Review of Deep Learning Methods and Applications for Unmanned Aerial Vehicles [J].
Carrio, Adrian ;
Sampedro, Carlos ;
Rodriguez-Ramos, Alejandro ;
Campoy, Pascual .
JOURNAL OF SENSORS, 2017, 2017
[5]   GPU based model-predictive path control for self-driving vehicles [J].
Chajan, Eduard ;
Schulte-Tigges, Joschua ;
Reke, Michael ;
Ferrein, Alexander ;
Matheis, Dominik ;
Walter, Thomas .
2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, :1243-1248
[6]  
Chen J., MAKE EVERY FEATURE B
[7]   Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices [J].
Chen, Yu-Hsin ;
Yange, Tien-Ju ;
Emer, Joel S. ;
Sze, Vivienne .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (02) :292-308
[8]   Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks [J].
Chen, Yu-Hsin ;
Krishna, Tushar ;
Emer, Joel S. ;
Sze, Vivienne .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (01) :127-138
[9]  
Evreinov E.V, 1981, Homogeneous Computing Systems, Structures and Environments
[10]  
Faiedh H, 2001, INT C MICROELECTRON, P189