Performance Modeling for FPGAs: Extending the Roofline Model with High-Level Synthesis Tools

被引:43
作者
da Silva, Bruno [1 ]
Braeken, An [1 ]
D'Hollander, Erik H. [2 ]
Touhafi, Abdellah [1 ,3 ]
机构
[1] Vrije Univ Brussel, INDI Dept, B-1050 Brussels, Belgium
[2] Univ Ghent, ELIS Dept, B-9000 Ghent, Belgium
[3] Vrije Univ Brussel, ETRO Dept, B-1050 Brussels, Belgium
关键词
High level synthesis;
D O I
10.1155/2013/428078
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The potential of FPGAs as accelerators for high-performance computing applications is very large, but many factors are involved in their performance. The design for FPGAs and the selection of the proper optimizations when mapping computations to FPGAs lead to prohibitively long developing time. Alternatives are the high-level synthesis (HLS) tools, which promise a fast design space exploration due to design at high-level or analytical performance models which provide realistic performance expectations, potential impediments to performance, and optimization guidelines. In this paper we propose the combination of both, in order to construct a performance model for FPGAs which is able to visually condense all the helpful information for the designer. Our proposed model extends the roofline model, by considering the resource consumption and the parameters used in the HLS tools, to maximize the performance and the resource utilization within the area of the FPGA. The proposed model is applied to optimize the design exploration of a class of window-based image processing applications using two different HLS tools. The results show the accuracy of the model as well as its flexibility to be combined with any HLS tool.
引用
收藏
页数:10
相关论文
共 18 条
[1]   Performance Analysis Framework for High-Level Language Applications in Reconfigurable Computing [J].
Curreri, John ;
Koehler, Seth ;
George, Alan D. ;
Holland, Brian ;
Garcia, Rafael .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (01)
[2]  
da Silva B., 2013, FPGA, P274
[3]  
da Silva B., 2013, PERFORMANCE RESOURCE
[4]   Accurate Area, Time and Power Models for FPGA-Based Implementations [J].
Deng, Lanping ;
Sobti, Kanwaldeep ;
Zhang, Yuanrui ;
Chakrabarti, Chaitali .
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 63 (01) :39-50
[5]   Input data reuse in compiling window operations onto reconfigurable hardware [J].
Guo, Z ;
Buyukkurt, B ;
Najjar, W .
ACM SIGPLAN NOTICES, 2004, 39 (07) :249-256
[6]  
Holland B., 2009, ACM T RECONFIG TECHN, V1, P4
[7]  
Jia HP, 2012, LECT NOTES COMPUT SC, V7484, P920, DOI 10.1007/978-3-642-32820-6_90
[8]   Performance analysis and optimization of three-dimensional FDTD on GPU using roofline model [J].
Kim, Ki-Hwan ;
Kim, KyoungHo ;
Park, Q-Han .
COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (06) :1201-1207
[9]  
Lorenzo J. A., 2011, P INT C PAR DISTR PR
[10]   The Boat Hull Model: Adapting the Roofline Model to Enable Performance Prediction for Parallel Computing [J].
Nugteren, Cedric ;
Corporaal, Henk .
ACM SIGPLAN NOTICES, 2012, 47 (08) :291-292