Exploring Portability and Performance of OpenCL FPGA Kernels on Intel HARPv2

被引:5
作者
Cabrera, Anthony M. [1 ]
Chamberlain, Roger D. [1 ]
机构
[1] Washington Univ, St Louis, MO 63110 USA
来源
PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL (IWOCL'19) | 2019年
关键词
High Level Synthesis; Needleman-Wunsch; FPGA; Design Space Search; Shared Virtual Memory;
D O I
10.1145/3318170.3318180
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGAs offer a heterogenous compute solution to the continuous desire for increased performance by enabling the creation of application-specific hardware that accelerates computation. While the barrier to entry has historically been steep, advances in High Level Synthesis (HLS) are making FPGAs more accessible. Specifically, the Intel FPGA OpenCL SDK allows software designers to abstract away low level details of architecting hardware on an FPGA and allows them to author computational kernels in a higher level language. Furthermore, Intel has developed a system that incorporates both a multicore Xeon CPU and Arria 10 FPGA into the same chip package as part of the Heterogeneous Accelerator Research Program (HARP) that can be targeted by their SDK. In this work, we target the second iteration of the HARP platform (HARPv2) using HLS through porting of OpenCL kernels originally written for FPGAs connected via a PCIe bus. We evaluate the HARPv2 system's performance against previously reported results, explore the portability of kernels through a hardware design space search, and empirically show the benefits of using the shared virtual memory (SVM) abstraction over explicit reads and writes.
引用
收藏
页数:10
相关论文
共 34 条
  • [1] Alves Fredy Augusto M, 2017, P INT C REC COMP EPG
  • [2] [Anonymous], 2019, Amazon EC2 Instance Types
  • [3] Bachrach J, 2012, DES AUT CON, P1212
  • [4] Cabrera AnthonyM., 2019, Exploring Portability and Performance of OpenCL FPGA Kernels on Intel HARPv2: Research Artifacts, DOI [DOI 10.7936/M2YQ-A123, 10.7936/m2yq-a123.]
  • [5] A Programmable Parallel Accelerator for Learning and Classification
    Cadambi, Srihari
    Majumdar, Abhinandan
    Becchi, Michela
    Chakradhar, Srimat
    Graf, Hans Peter
    [J]. PACT 2010: PROCEEDINGS OF THE NINETEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2010, : 273 - 283
  • [6] Canis A, 2011, FPGA 11: PROCEEDINGS OF THE 2011 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, P33
  • [7] Caulfield Adrian M., 2016, P 49 IEEE ACM INT S
  • [8] Che SA, 2009, I S WORKL CHAR PROC, P44, DOI 10.1109/IISWC.2009.5306797
  • [9] A Quantitative Analysis on Microarchitectures of Modern CPU-FPGA Platforms
    Choi, Young-Kyu
    Cong, Jason
    Fang, Zhenman
    Hao, Yuchen
    Reinman, Glenn
    Wei, Peng
    [J]. 2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [10] Faict Thomas, 2018, THESIS