Fast Design Exploration for Performance, Power and Accuracy Tradeoffs in FPGA-Based Accelerators

被引:5
|
作者
Ulusel, Onur [1 ]
Nepal, Kumud [1 ]
Bahar, R. Iris [1 ]
Reda, Sherief [1 ]
机构
[1] Brown Univ, Sch Engn, Providence, RI 02912 USA
关键词
Design; Performance; Block-matching; design space exploration; fast regression analysis; hardware accelerators; image deblur; multi-objective co-exploration; real time image processing; SPACE EXPLORATION;
D O I
10.1145/2567661
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ease-of-use and reconfigurability of FPGAs makes them an attractive platform for accelerating algorithms. However, accelerating becomes a challenging task as the large number of possible design parameters lead to different accelerator variants. In this article, we propose techniques for fast design exploration and multi-objective optimization to quickly identify both algorithmic and hardware parameters that optimize these accelerators. This information is used to run regression analysis and train mathematical models within a nonlinear optimization framework to identify the optimal algorithm and design parameters under various objectives and constraints. To automate and improve the model generation process, we propose the use of L-1-regularized least squares regression techniques. We implement two real-time image processing accelerators as test cases: one for image deblurring and one for block matching. For these designs, we demonstrate that by sampling only a small fraction of the design space (0.42% and 1.1%), our modeling techniques are accurate within 2%-4% for area and throughput, 8%-9% for power, and 5%-6% for arithmetic accuracy. We show speedups of 340x and 90x in time for the test cases compared to brute-force enumeration. We also identify the optimal set of parameters for a number of scenarios (e.g., minimizing power under arithmetic inaccuracy bounds).
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Optimization of FPGA-based CNN accelerators using metaheuristics
    Sadiq M. Sait
    Aiman El-Maleh
    Mohammad Altakrouri
    Ahmad Shawahna
    The Journal of Supercomputing, 2023, 79 : 4493 - 4533
  • [42] A generic execution framework for shared FPGA-based accelerators
    Alexandru, Dumitru Laurentiu
    Maniu, Rares
    2017 INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT (OPTIM) & 2017 INTL AEGEAN CONFERENCE ON ELECTRICAL MACHINES AND POWER ELECTRONICS (ACEMP), 2017, : 803 - 808
  • [43] Families of FPGA-based accelerators for approximate string matching
    Van Court, Tom
    Herbordt, Martin C.
    MICROPROCESSORS AND MICROSYSTEMS, 2007, 31 (02) : 135 - 145
  • [44] A compact shader for FPGA-based volume rendering accelerators
    Knittel, G.
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2007, 4419 : 271 - 282
  • [45] Optimization of FPGA-based CNN accelerators using metaheuristics
    Sait, Sadiq M.
    El-Maleh, Aiman
    Altakrouri, Mohammad
    Shawahna, Ahmad
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (04): : 4493 - 4533
  • [46] A Survey and Taxonomy of FPGA-based Deep Learning Accelerators
    Blaiech, Ahmed Ghazi
    Ben Khalifa, Khaled
    Valderrama, Carlos
    Fernandes, Marcelo A. C.
    Bedoui, Mohamed Hedi
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 98 : 331 - 345
  • [47] A survey of FPGA-based accelerators for convolutional neural networks
    Mittal, Sparsh
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (04): : 1109 - 1139
  • [48] An FPGA-based Emulation Platform for Edge Computing Node Design Exploration
    Soriano, Theo
    Novo, David
    Benoit, Pascal
    PROCEEDINGS OF THE 2021 32ND INTERNATIONAL WORKSHOP ON RAPID SYSTEM PROTOTYPING (RSP): SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 2021, : 8 - 14
  • [49] Design Space Exploration of FPGA-Based Deep Convolutional Neural Networks
    Motamedi, Mohammad
    Gysel, Philipp
    Akella, Venkatesh
    Ghiasi, Soheil
    2016 21ST ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2016, : 575 - 580
  • [50] Accuracy-Performance Tradeoffs on an FPGA Through Overclocking
    Shi, Kan
    Boland, David
    Constantinides, George A.
    2013 IEEE 21ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2013, : 29 - 36