Optimizing OpenCL-Based CNN Design on FPGA with Comprehensive Design Space Exploration and Collaborative Performance Modeling

被引:9
|
作者
Mu, Jiandong [1 ]
Zhang, Wei [1 ]
Liang, Hao [2 ]
Sinha, Sharad [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Indian Inst Technol IIT, Veling, Goa, India
关键词
CNN; modeling; hardware design; design space exploration;
D O I
10.1145/3397514
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent success in applying convolutional neural networks (CNNs) to object detection and classification has sparked great interest in accelerating CNNs using hardware-like field-programmable gate arrays (FPGAs). However, finding an efficient FPGA design for a given CNN model and FPGA board is not trivial since a strong background in hardware design and detailed knowledge of the target board are required. In this work, we try to solve this problem by design space exploration with a collaborative framework. Our framework consists of three main parts: FPGA design generation, coarse-grained modeling, and fine-grained modeling. In the FPGA design generation, we propose a novel data structure, LoopTree, to capture the details of the FPGA design for CNN applications without writing down the source code. Different LoopTrees, which indicate different FPGA designs, are automatically generated in this process. A coarse-grained model will evaluate LoopTrees at the operation level, e.g., add, mult, and so on, so that the most efficient LoopTrees can be selected. A fine-grained model, which is based on the source code, will then refine the selected design in a cycle-accurate manner. A set of comprehensive OpenCL-based designs have been implemented on board to verify our framework. An average estimation error of 8.87% and 4.8% has been observed for our coarse-grained model and fine-grained model, respectively. This is much lower than the prevalent operation-statistics-based estimation, which is obtained according to a predefined formula for specific loop schedules.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Design of an Efficient CNN-Based Cough Detection System on Lightweight FPGA
    Peng, Peng
    Jiang, Kai
    You, Mingyu
    Xie, Jialin
    Zhou, Hongjun
    Xu, Weisheng
    Lu, Jicheng
    Li, Xiayu
    Xu, Yun
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2023, 17 (01) : 116 - 128
  • [32] Design space exploration of multi-core RTL via high level synthesis from OpenCL models
    Roozmeh, Mehdi
    Lavagno, Luciano
    MICROPROCESSORS AND MICROSYSTEMS, 2018, 63 : 199 - 208
  • [33] Design Space Exploration of Application Specific Number Formats Targeting an FPGA Implementation of SPICE
    Gehrunger, Jonas
    Hochberger, Christian
    APPLIED RECONFIGURABLE COMPUTING. ARCHITECTURES, TOOLS, AND APPLICATIONS, ARC 2023, 2023, 14251 : 66 - 80
  • [34] SpikeExplorer: Hardware-Oriented Design Space Exploration for Spiking Neural Networks on FPGA
    Padovano, Dario
    Carpegna, Alessio
    Savino, Alessandro
    Di Carlo, Stefano
    ELECTRONICS, 2024, 13 (09)
  • [35] Modeling of Collaborative Design Based on Colored Petri Nets
    Hou Junming
    Chong, Su
    Liang, Tang
    Wang Wanshan
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 6, 2008, : 428 - 432
  • [36] Enhanced function-means modeling supporting design space exploration
    Mueller, Jakob R.
    Isaksson, Ola
    Landahl, Jonas
    Raja, Visakha
    Panarotto, Massimo
    Levandowski, Christoffer
    Raudberget, Dag
    AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2019, 33 (04): : 502 - 516
  • [37] Machine Learning for Microarchitecture Power Modeling and Design Space Exploration: A Survey
    Zhai, Jianwang
    Ling, Zichao
    Bai, Chen
    Zhao, Kang
    Yu, Bei
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (06): : 1351 - 1369
  • [38] Machine Learning Based Design Space Exploration for Hybrid Main-Memory Design
    Sen, Satyabrata
    Imam, Neena
    MEMSYS 2019: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2019, : 480 - 489
  • [39] Investigating Methods for ASPmT-Based Design Space Exploration in Evolutionary Product Design
    Mueller, Luise
    Wanko, Philipp
    Haubelt, Christian
    Schaub, Torsten
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (1-2) : 59 - 92
  • [40] Investigating Methods for ASPmT-Based Design Space Exploration in Evolutionary Product Design
    Luise Müller
    Philipp Wanko
    Christian Haubelt
    Torsten Schaub
    International Journal of Parallel Programming, 2024, 52 : 59 - 92