Real-Time Fixed-Point Hardware Accelerator of Convolutional Neural Network on FPGA Based

被引:3
作者
Ozkilbac, Bahadir [1 ]
Ozbek, Ibrahim Yucel [1 ]
Karacali, Tevhit [1 ]
机构
[1] Ataturk Univ, Elect Elect Engn, Erzurum, Turkey
来源
5TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATICS (ICCI 2022) | 2022年
关键词
FPGA; ARM; CNN; hardware accelerator;
D O I
10.1109/ICCI54321.2022.9756093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNN), which have the advantage of automatically detecting the important features of the input data without any human interfere, are widely used in many applications such as face recognition, speech recognition, image classification and object detection. In real-time CNN applications, computation speed is very important as well as accuracy. However, in some applications with high computational complexity, available systems are insufficient to meet the high-speed performance demand at low power consumption. In this study, the design of the CNN accelerator hardware in FPGA is presented to meet the speed demand. In this design, CNN is considered as a streaming interface application. Thus, temporary storage amount and memory latency are reduced. Each layer is designed with maximum parallelism, taking advantage of the FPGA. Because fixed-point number representation has the advantage of low latency, it is preferred in design with negligible sacrifice of accuracy. Thus, forward propagation of a CNN can be executed at high speed in FPGA. In order to compare real-time performance, digit classification application is executed in this hardware designed in FPGA and ARM processor on the same chip. The real-time results show that the application in the hardware designed in the FPGA is 30x faster than the ARM processor. (Abstract)
引用
收藏
页码:1 / 5
页数:5
相关论文
共 22 条
[1]   DaDianNao: A Machine-Learning Supercomputer [J].
Chen, Yunji ;
Luo, Tao ;
Liu, Shaoli ;
Zhang, Shijin ;
He, Liqiang ;
Wang, Jia ;
Li, Ling ;
Chen, Tianshi ;
Xu, Zhiwei ;
Sun, Ninghui ;
Temam, Olivier .
2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2014, :609-622
[2]  
Chetlur Sharan, 2014, CUDNN EFFICIENT PRIM
[3]  
Di Carlo S., 2011, 2011 IEEE 6th International Design and Test Workshop, P88, DOI 10.1109/IDT.2011.6123108
[4]   ShiDianNao: Shifting Vision Processing Closer to the Sensor [J].
Du, Zidong ;
Fasthuber, Robert ;
Chen, Tianshi ;
Ienne, Paolo ;
Li, Ling ;
Luo, Tao ;
Feng, Xiaobing ;
Chen, Yunji ;
Temam, Olivier .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :92-104
[5]  
Guo J, 2020, J MACH LEARN RES, V21
[6]   Deception in the eyes of deceiver: A computer vision and machine learning based automated deception detection [J].
Khan, Wasiq ;
Crockett, Keeley ;
O'Shea, James ;
Hussain, Abir ;
Khan, Bilal M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169
[7]   Voice Recognition and Document Classification-Based Data Analysis for Voice Phishing Detection [J].
Kim, Jeong-Wook ;
Hong, Gi-Wan ;
Chang, Hangbae .
HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2021, 11
[8]   A Survey of Deep Learning Applications to Autonomous Vehicle Control [J].
Kuutti, Sampo ;
Bowden, Richard ;
Jin, Yaochu ;
Barber, Phil ;
Fallah, Saber .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) :712-733
[9]   Backpropagation Applied to Handwritten Zip Code Recognition [J].
LeCun, Y. ;
Boser, B. ;
Denker, J. S. ;
Henderson, D. ;
Howard, R. E. ;
Hubbard, W. ;
Jackel, L. D. .
NEURAL COMPUTATION, 1989, 1 (04) :541-551
[10]   A High Performance FPGA-based Accelerator for Large-Scale Convolutional Neural Networks [J].
Li, Huimin ;
Fan, Xitian ;
Jiao, Li ;
Cao, Wei ;
Zhou, Xuegong ;
Wang, Lingli .
2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,