Fast Design Exploration for Performance, Power and Accuracy Tradeoffs in FPGA-Based Accelerators

被引:6
|
作者
Ulusel, Onur [1 ]
Nepal, Kumud [1 ]
Bahar, R. Iris [1 ]
Reda, Sherief [1 ]
机构
[1] Brown Univ, Sch Engn, Providence, RI 02912 USA
关键词
Design; Performance; Block-matching; design space exploration; fast regression analysis; hardware accelerators; image deblur; multi-objective co-exploration; real time image processing; SPACE EXPLORATION;
D O I
10.1145/2567661
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ease-of-use and reconfigurability of FPGAs makes them an attractive platform for accelerating algorithms. However, accelerating becomes a challenging task as the large number of possible design parameters lead to different accelerator variants. In this article, we propose techniques for fast design exploration and multi-objective optimization to quickly identify both algorithmic and hardware parameters that optimize these accelerators. This information is used to run regression analysis and train mathematical models within a nonlinear optimization framework to identify the optimal algorithm and design parameters under various objectives and constraints. To automate and improve the model generation process, we propose the use of L-1-regularized least squares regression techniques. We implement two real-time image processing accelerators as test cases: one for image deblurring and one for block matching. For these designs, we demonstrate that by sampling only a small fraction of the design space (0.42% and 1.1%), our modeling techniques are accurate within 2%-4% for area and throughput, 8%-9% for power, and 5%-6% for arithmetic accuracy. We show speedups of 340x and 90x in time for the test cases compared to brute-force enumeration. We also identify the optimal set of parameters for a number of scenarios (e.g., minimizing power under arithmetic inaccuracy bounds).
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Model-Based Design Space Exploration for FPGA-based Image Processing Applications Employing Parameterizable Approximations
    Conrady, Simon
    Kreddig, Arne
    Manuel, Manu
    Nguyen Anh Vu Doan
    Stechele, Walter
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 87
  • [22] New paradigm of FPGA-based computational intelligence from surveying the implementation of DNN accelerators
    You, Yang
    Chang, Yinghui
    Wu, Weikang
    Guo, Bingrui
    Luo, Hongyin
    Liu, Xiaojie
    Liu, Bijing
    Zhao, Kairong
    He, Shan
    Li, Lin
    Guo, Donghui
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2022, 26 (01) : 1 - 27
  • [23] Optimizing FPGA-Based Convolutional Neural Network Performance
    Kao, Chi-Chou
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (15)
  • [24] POWER-Area-Performance Characteristics of FPGA-based Sigma-Delta FIR Filters
    Memon, Tayab D.
    Beckett, Paul
    Sadik, Amin Z.
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 70 (03): : 275 - 288
  • [25] Accurate Area, Time and Power Models for FPGA-Based Implementations
    Deng, Lanping
    Sobti, Kanwaldeep
    Zhang, Yuanrui
    Chakrabarti, Chaitali
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 63 (01): : 39 - 50
  • [26] Accurate Area, Time and Power Models for FPGA-Based Implementations
    Lanping Deng
    Kanwaldeep Sobti
    Yuanrui Zhang
    Chaitali Chakrabarti
    Journal of Signal Processing Systems, 2011, 63 : 39 - 50
  • [27] Data-reuse exploration under an on-chip memory constraint for low-power FPGA-based systems
    Liu, Q.
    Constantinides, G. A.
    Masselos, K.
    Cheung, P. Y. K.
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2009, 3 (03) : 235 - 246
  • [28] A Collaborative Framework for FPGA-based CNN Design Modeling and Optimization
    Mu, Jiandong
    Zhang, Wei
    Liang, Hao
    Sinha, Sharad
    2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, : 139 - 146
  • [29] Run-time performance optimization of an FPGA-based deduction engine for SAT solvers
    Dandalis, A
    Prasanna, VK
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2002, 7 (04) : 547 - 562
  • [30] Optimized Memory Allocation and Power Minimization for FPGA-Based Image Processing
    Garcia, Paulo
    Bhowmik, Deepayan
    Stewart, Robert
    Michaelson, Greg
    Wallace, Andrew
    JOURNAL OF IMAGING, 2019, 5 (01):