A Survey of Convolutional Neural Networks on Edge with Reconfigurable Computing

被引:81
作者
Vestias, Mario P. [1 ]
机构
[1] Inst Politecn Lisboa, Inst Super Engn Lisboa, INESC ID, P-1500335 Lisbon, Portugal
关键词
deep learning; convolutional neural network; reconfigurable computing; field-programmable gate array; edge inference;
D O I
10.3390/a12080154
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The convolutional neural network (CNN) is one of the most used deep learning models for image detection and classification, due to its high accuracy when compared to other machine learning algorithms. CNNs achieve better results at the cost of higher computing and memory requirements. Inference of convolutional neural networks is therefore usually done in centralized high-performance platforms. However, many applications based on CNNs are migrating to edge devices near the source of data due to the unreliability of a transmission channel in exchanging data with a central server, the uncertainty about channel latency not tolerated by many applications, security and data privacy, etc. While advantageous, deep learning on edge is quite challenging because edge devices are usually limited in terms of performance, cost, and energy. Reconfigurable computing is being considered for inference on edge due to its high performance and energy efficiency while keeping a high hardware flexibility that allows for the easy adaption of the target computing platform to the CNN model. In this paper, we described the features of the most common CNNs, the capabilities of reconfigurable computing for running CNNs, the state-of-the-art of reconfigurable computing implementations proposed to run CNN models, as well as the trends and challenges for future edge reconfigurable platforms.
引用
收藏
页数:24
相关论文
共 77 条
[1]   Cnvlutin: Ineffectual-Neuron-Free Deep Neural Network Computing [J].
Albericio, Jorge ;
Judd, Patrick ;
Hetherington, Tayler ;
Aamodt, Tor ;
Jerger, Natalie Enright ;
Moshovos, Andreas .
2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, :1-13
[2]  
[Anonymous], PROC CVPR IEEE
[3]  
[Anonymous], FLEX LOG IMPR DEEP L
[4]  
[Anonymous], ARXIV190104988
[5]  
[Anonymous], 2016, ARXIV160207360
[6]  
[Anonymous], 2016, ARXIV161207119
[7]  
[Anonymous], P APPT 2017 OSL NORW
[8]  
[Anonymous], 1980, ARITHMETIC COMPLEXIT, DOI [DOI 10.1137/1.9781611970364, 10.1137/1.9781611970364]
[9]  
[Anonymous], 2015, 32 ICML
[10]  
[Anonymous], 2009, VISUALIZING HIGHER L