Hardware Acceleration of Multilayer Perceptron Based on Inter-Layer Optimization

被引：3

作者：

Chen, Shenggang ^{[1
]}

Lu, Zhonghai ^{[2
]}

机构：

[1] Natl Univ Def Technol, Sch Comp, Changsha, Peoples R China

[2] KTH Royal Inst Technol, Dept Elect & Embedded Syst, Stockholm, Sweden

来源：

2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019) | 2019年

关键词：

Multilayer Perceptron; Hardware Accelerator; Inter-Layer Optimization; RECONFIGURABLE ARCHITECTURE; NEURAL-NETWORK; PERFORMANCE;

D O I：

10.1109/ICCD46524.2019.00028

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multilayer Perceptron (MLP) is used in a broad range of applications. Hardware acceleration of MLP is one most promising way to provide better performance-energy efficiency. Previous works focused on the intra-layer optimization and layer-after-layer processing, while leaving the inter-layer optimization never studied. In this paper, we propose hardware acceleration of MLPs based on inter-layer optimization which allows us to overlap the execution of MLP layers. First we describe the inter-layer optimization from software and mathematical perspectives. Then, a reference Two-Neuron architecture which is efficient to support the inter-layer optimization is proposed and implemented. Discussions about area cost, performance and energy consumption are carried out to explore the scalability of the Two-Neuron architecture. Results show that the proposed MLP design optimized across layers achieves better performance and energy efficiency than the conventional intra-layer optimized designs. As such, the inter-layer optimization provides another possible direction other than the intra-layer optimization to gain further performance and energy improvements for the hardware acceleration of MLPs.

引用

页码：164 / 172

页数：9

共 23 条

[1]

[Anonymous], P ANN DES AUT C NY U

[2]

[Anonymous], 2017, ARXIV170404861

[3]

Bahoura M., 2011, 2011 18th IEEE International Conference on Electronics, Circuits and Systems (ICECS 2011), P426, DOI 10.1109/ICECS.2011.6122304

[4] DianNao: A Small-Footprint High-Throughput Accelerator for Ubiquitous Machine-Learning [J].

Chen, Tianshi ;

Du, Zidong ;

Sun, Ninghui ;

Wang, Jia ;

Wu, Chengyong ;

Chen, Yunji ;

Temam, Olivier .

ACM SIGPLAN NOTICES, 2014, 49 (04) :269-283

[5] Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks [J].

Chen, Yu-Hsin ;

Emer, Joel ;

Sze, Vivienne .

2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, :367-379

[6]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[7]

do A. Ferreira Antonyus P., 2010, Proceedings of the 2010 17th IEEE International Conference on Electronics, Circuits and Systems (ICECS 2010), P742, DOI 10.1109/ICECS.2010.5724619

[8]

Domingos P. O., 2005, Proceedings. 2005 International Conference on Field Programmable Logic and Applications (IEEE Cat. No.05EX1155), P89

[9] Neural Acceleration for General-Purpose Approximate Programs [J].

Esmaeilzadeh, Hadi ;

Sampson, Adrian ;

Ceze, Luis ;

Burger, Doug .

2012 IEEE/ACM 45TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-45), 2012, :449-460

[10] Artificial neural networks (the multilayer perceptron) - A review of applications in the atmospheric sciences [J].

Gardner, MW ;

Dorling, SR .

ATMOSPHERIC ENVIRONMENT, 1998, 32 (14-15) :2627-2636

← 1 2 3 →