Hardware Acceleration of Multilayer Perceptron Based on Inter-Layer Optimization

被引:3
作者
Chen, Shenggang [1 ]
Lu, Zhonghai [2 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha, Peoples R China
[2] KTH Royal Inst Technol, Dept Elect & Embedded Syst, Stockholm, Sweden
来源
2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019) | 2019年
关键词
Multilayer Perceptron; Hardware Accelerator; Inter-Layer Optimization; RECONFIGURABLE ARCHITECTURE; NEURAL-NETWORK; PERFORMANCE;
D O I
10.1109/ICCD46524.2019.00028
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multilayer Perceptron (MLP) is used in a broad range of applications. Hardware acceleration of MLP is one most promising way to provide better performance-energy efficiency. Previous works focused on the intra-layer optimization and layer-after-layer processing, while leaving the inter-layer optimization never studied. In this paper, we propose hardware acceleration of MLPs based on inter-layer optimization which allows us to overlap the execution of MLP layers. First we describe the inter-layer optimization from software and mathematical perspectives. Then, a reference Two-Neuron architecture which is efficient to support the inter-layer optimization is proposed and implemented. Discussions about area cost, performance and energy consumption are carried out to explore the scalability of the Two-Neuron architecture. Results show that the proposed MLP design optimized across layers achieves better performance and energy efficiency than the conventional intra-layer optimized designs. As such, the inter-layer optimization provides another possible direction other than the intra-layer optimization to gain further performance and energy improvements for the hardware acceleration of MLPs.
引用
收藏
页码:164 / 172
页数:9
相关论文
共 23 条
[1]  
[Anonymous], P ANN DES AUT C NY U
[2]  
[Anonymous], 2017, ARXIV170404861
[3]  
Bahoura M., 2011, 2011 18th IEEE International Conference on Electronics, Circuits and Systems (ICECS 2011), P426, DOI 10.1109/ICECS.2011.6122304
[4]   DianNao: A Small-Footprint High-Throughput Accelerator for Ubiquitous Machine-Learning [J].
Chen, Tianshi ;
Du, Zidong ;
Sun, Ninghui ;
Wang, Jia ;
Wu, Chengyong ;
Chen, Yunji ;
Temam, Olivier .
ACM SIGPLAN NOTICES, 2014, 49 (04) :269-283
[5]   Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks [J].
Chen, Yu-Hsin ;
Emer, Joel ;
Sze, Vivienne .
2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, :367-379
[6]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[7]  
do A. Ferreira Antonyus P., 2010, Proceedings of the 2010 17th IEEE International Conference on Electronics, Circuits and Systems (ICECS 2010), P742, DOI 10.1109/ICECS.2010.5724619
[8]  
Domingos P. O., 2005, Proceedings. 2005 International Conference on Field Programmable Logic and Applications (IEEE Cat. No.05EX1155), P89
[9]   Neural Acceleration for General-Purpose Approximate Programs [J].
Esmaeilzadeh, Hadi ;
Sampson, Adrian ;
Ceze, Luis ;
Burger, Doug .
2012 IEEE/ACM 45TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-45), 2012, :449-460
[10]   Artificial neural networks (the multilayer perceptron) - A review of applications in the atmospheric sciences [J].
Gardner, MW ;
Dorling, SR .
ATMOSPHERIC ENVIRONMENT, 1998, 32 (14-15) :2627-2636