Optimizing OpenCL-Based CNN Design on FPGA with Comprehensive Design Space Exploration and Collaborative Performance Modeling

被引:9
|
作者
Mu, Jiandong [1 ]
Zhang, Wei [1 ]
Liang, Hao [2 ]
Sinha, Sharad [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Indian Inst Technol IIT, Veling, Goa, India
关键词
CNN; modeling; hardware design; design space exploration;
D O I
10.1145/3397514
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent success in applying convolutional neural networks (CNNs) to object detection and classification has sparked great interest in accelerating CNNs using hardware-like field-programmable gate arrays (FPGAs). However, finding an efficient FPGA design for a given CNN model and FPGA board is not trivial since a strong background in hardware design and detailed knowledge of the target board are required. In this work, we try to solve this problem by design space exploration with a collaborative framework. Our framework consists of three main parts: FPGA design generation, coarse-grained modeling, and fine-grained modeling. In the FPGA design generation, we propose a novel data structure, LoopTree, to capture the details of the FPGA design for CNN applications without writing down the source code. Different LoopTrees, which indicate different FPGA designs, are automatically generated in this process. A coarse-grained model will evaluate LoopTrees at the operation level, e.g., add, mult, and so on, so that the most efficient LoopTrees can be selected. A fine-grained model, which is based on the source code, will then refine the selected design in a cycle-accurate manner. A set of comprehensive OpenCL-based designs have been implemented on board to verify our framework. An average estimation error of 8.87% and 4.8% has been observed for our coarse-grained model and fine-grained model, respectively. This is much lower than the prevalent operation-statistics-based estimation, which is obtained according to a predefined formula for specific loop schedules.
引用
收藏
页数:28
相关论文
共 50 条
  • [21] Security Based Design Space Exploration for CPS
    Gressl, Lukas
    Rech, Alexander
    Steger, Christian
    Sinnhofer, Andreas
    Weissnegger, Ralph
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 593 - 595
  • [22] A Feedback-based Design Space Exploration Subsystem for the Automation of Architectures Synthesis on Proprietary FPGA Toolchains
    Pappalardo, Alessandro
    Natale, Giuseppe
    Santambrogio, Marco Domenico
    2017 EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2017, : 151 - 154
  • [23] A Time Efficient Comprehensive Model of Approximate Multipliers for Design Space Exploration
    Cui, Ziying
    Chen, Ke
    Wu, Bi
    Yan, Chenggang
    Gong, Yu
    Liu, Weiqiang
    PROCEEDINGS 2024 IEEE 31ST SYMPOSIUM ON COMPUTER ARITHMETIC, ARITH 2024, 2024, : 116 - 123
  • [24] A Comprehensive Model for Efficient Design Space Exploration of Imprecise Computational Blocks
    Javadi, Mohammad Haji Seyed
    Faryabi, Mohsen
    Mahdiani, Hamid Reza
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (06)
  • [25] Efficient architectural design space exploration via predictive modeling
    Ipek, Engin
    McKee, Sally A.
    Singh, Karan
    Caruana, Rich
    De Supinski, Bronis R.
    Schulz, Martin
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2007, 4 (04) : 1 - 34
  • [26] Accelerating convolutional neural networks on FPGA platforms: a high-performance design methodology using OpenCL
    Gdaim, Soufien
    Mtibaa, Abdellatif
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2025, 22 (02)
  • [27] Optimizing RTL to C Abstraction Methodologies to improve HLS Design Space Exploration
    Mahapatra, Anushree
    Schafer, Benjamin Carrion
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [28] AERO: Design Space Exploration Framework for Resource-Constrained CNN Mapping on Tile-Based Accelerators
    Yang, Simei
    Bhattacharjee, Debjyoti
    Kumar, Vinay B. Y.
    Chatterjee, Saikat
    De, Sayandip
    Debacker, Peter
    Verkest, Diederik
    Mallik, Arindam
    Catthoor, Francky
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 508 - 521
  • [29] Model-Based Design Space Exploration for FPGA-based Image Processing Applications Employing Parameterizable Approximations
    Conrady, Simon
    Kreddig, Arne
    Manuel, Manu
    Nguyen Anh Vu Doan
    Stechele, Walter
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 87
  • [30] Fast Performance Estimation and Design Space Exploration of Manycore-based Neural Processors
    Kang, Jintaek
    Jung, Dowhan
    Chung, Kwanghyun
    Ha, Soonhoi
    PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,