Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs

被引:109
作者
Liang, Yun [1 ,2 ]
Lu, Liqiang [3 ]
Xiao, Qingcheng [3 ]
Yan, Shengen [4 ]
机构
[1] Peking Univ, Sch EECS, Beijing 100871, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Peking Univ, Ctr Energy Efficient Comp & Applicat, Beijing 100871, Peoples R China
[4] SenseTime, Algorithm Platform Dept, Hong Kong, Peoples R China
基金
北京市自然科学基金;
关键词
Field programmable gate arrays; Convolution; Space exploration; Prediction algorithms; Transforms; Analytical models; Convolutional neural networks; Convolutional neural network (CNN); fast algorithm; fast Fourier transformation (FFT); field-programmable gate array (FPGA); Winograd; HIGH-LEVEL SYNTHESIS; PERFORMANCE;
D O I
10.1109/TCAD.2019.2897701
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, convolutional neural networks (CNNs) have become widely adopted for computer vision tasks. Field-programmable gate arrays (FPGAs) have been adequately explored as a promising hardware accelerator for CNNs due to its high performance, energy efficiency, and reconfigurability. However, prior FPGA solutions based on the conventional convolutional algorithm is often bounded by the computational capability of FPGAs (e.g., the number of DSPs). To address this problem, the feature maps are transformed to a special domain using fast algorithms to reduce the arithmetic complexity. Winograd and fast Fourier transformation (FFT), as fast algorithm representatives, first transform input data and filter to Winograd or frequency domain, then perform element-wise multiplication, and apply inverse transformation to get the final output. In this paper, we propose a novel architecture for implementing fast algorithms on FPGAs. Our design employs line buffer structure to effectively reuse the feature map data among different tiles. We also effectively pipeline the Winograd/FFT processing element (PE) engine and initiate multiple PEs through parallelization. Meanwhile, there exists a complex design space to explore. We propose an analytical model to predict the resource usage and the performance. Then, we use the model to guide a fast design space exploration. Experiments using the state-of-the-art CNNs demonstrate the best performance and energy efficiency on FPGAs. We achieve 854.6 and 2479.6 GOP/s for AlexNet and VGG16 on Xilinx ZCU102 platform using Winograd. We achieve 130.4 GOP/s for Resnet using Winograd and 201.1 GOP/s for YOLO using FFT on Xilinx ZC706 platform.
引用
收藏
页码:857 / 870
页数:14
相关论文
共 50 条
  • [31] Convolutional neural networks combined with classification algorithms for the diagnosis of periodontitis
    Dai, Fang
    Liu, Qiangdong
    Guo, Yuchen
    Xie, Ruixiang
    Wu, Jingting
    Deng, Tian
    Zhu, Hongbiao
    Deng, Libin
    Song, Li
    ORAL RADIOLOGY, 2024, 40 (03) : 357 - 366
  • [32] Review of Image Classification Algorithms Based on Convolutional Neural Networks
    Chen, Leiyu
    Li, Shaobo
    Bai, Qiang
    Yang, Jing
    Jiang, Sanlong
    Miao, Yanming
    REMOTE SENSING, 2021, 13 (22)
  • [33] Applications and Comparisons of Optimization Algorithms Used in Convolutional Neural Networks
    Seyyarer, Ebubekir
    Uckan, Taner
    Hark, Cengiz
    Ayata, Faruk
    Inan, Mevlut
    Karci, Ali
    2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [34] Evaluating fish feeding intensity in aquaculture with convolutional neural networks
    Ubina, Naomi
    Cheng, Shyi-Chyi
    Chang, Chin-Chun
    Chen, Hung-Yuan
    AQUACULTURAL ENGINEERING, 2021, 94
  • [35] DeepFire2: A Convolutional Spiking Neural Network Accelerator on FPGAs
    Aung, Myat Thu Linn
    Gerlinghoff, Daniel
    Qu, Chuping
    Yang, Liwei
    Huang, Tian
    Goh, Rick Siow Mong
    Luo, Tao
    Wong, Weng-Fai
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2847 - 2857
  • [36] Fast battery capacity estimation using convolutional neural networks
    Li, Yihuan
    Li, Kang
    Liu, Xuan
    Zhang, Li
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020,
  • [37] RadioUNet: Fast Radio Map Estimation With Convolutional Neural Networks
    Levie, Ron
    Yapar, Cagkan
    Kutyniok, Gitta
    Caire, Giuseppe
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (06) : 4001 - 4015
  • [38] An Efficient Task Assignment Framework to Accelerate DPU-Based Convolutional Neural Network Inference on FPGAs
    Zhu, Jiang
    Wang, Lizan
    Liu, Haolin
    Tian, Shujuan
    Deng, Qingyong
    Li, Jianqi
    IEEE ACCESS, 2020, 8 : 83224 - 83237
  • [39] Adaptive Integer Quantisation for Convolutional Neural Networks through Evolutionary Algorithms
    Wang, Ziwei
    Trefzer, Martin A.
    Bale, Simon J.
    Tyrrell, Andy M.
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [40] Evaluating Convolutional Neural Networks for No-Reference Image Quality Assessment
    Apostolidis, Kyriakos D.
    Polyzos, Theodore
    Grigoriadis, Ioannis
    Papakostas, George A.
    2021 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INFORMATION SECURITY (ICSPIS), 2021,