A Study of Data Partitioning on OpenCL-based FPGAs

被引:0
|
作者
Wang, Zeke [1 ]
He, Bingsheng [1 ]
Zhang, Wei [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A lot of research efforts have been devoted to accelerating relational database applications on FPGAs, due to their high energy efficiency and high throughput. Most of the existing studies are based on hardware description languages (HDLs). Recently, FPGA vendors have started to develop OpenCL SDKs for much better programmability. In this paper, we investigate the performance of relational database applications on OpenCL-based FPGAs. As a start, we study the performance of data partitioning, a core operation widely used in relational databases. Due to random memory accesses, data partitioning is time-consuming and can become a major bottleneck for database operators such as hash join. We start with the state-of-theart OpenCL implementation which was originally designed for CPUs/GPUs, and find that it suffers from lock overheads and memory bandwidth overheads. To reduce lock overheads, we develop a simple yet efficient multi-kernel approach to leverage two emerging features of Altera OpenCL SDK, namely task kernel and channel. Moreover, on-chip buckets are employed to reduce the number of memory transactions. We further develop a cost model to guide the parameter configuration. We evaluate the proposed design on a recent Altera Stratix V FPGA. Our results demonstrate 1) our cost model can accurately predict the performance of data partitioning under different parameter settings; 2) our proposed multi-kernel approach can achieve 10.7X speedup over the existing OpenCL implementation. Also, the experiments with three case studies show that the optimized implementations can achieve 4-12X performance improvement over the original implementations.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] An OpenCL-based Framework for Rapid Virtual Prototyping of Heterogeneous Architectures
    Sotiriou-Xanthopoulos, Efstathios
    Masing, Leonard
    Siozios, Kostas
    Economakos, George
    Soudris, Dimitrios
    Becker, Juergen
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION (SAMOS), 2016, : 372 - 377
  • [32] A Scalable OpenCL-Based FPGA Accelerator For YOLOv2
    Xu, Ke
    Wang, Xiaoyun
    Wang, Dong
    2019 27TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2019, : 317 - 317
  • [33] An OpenCL-Based FPGA Accelerator for Faster R-CNN
    An, Jianjing
    Zhang, Dezheng
    Xu, Ke
    Wang, Dong
    ENTROPY, 2022, 24 (10)
  • [34] OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework
    Koo, Yongbon
    Kim, Sunghoon
    Ha, Young-guk
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (04): : 1299 - 1319
  • [35] OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework
    Yongbon Koo
    Sunghoon Kim
    Young-guk Ha
    World Wide Web, 2021, 24 : 1299 - 1319
  • [36] Compiling openCL to FPGAs
    Gilliland, Spenser
    Vallina, Fernando Martinez
    Singh, Vinay
    Electronic Products, 2015, 57 (09):
  • [37] OpenCL-Based Design Pattern for Line Rate Packet Processing
    Khan, Jehandad
    Athanas, Peter
    Booth, Skip
    Marshall, John
    2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 190 - 194
  • [38] High-Level Manipulation of OpenCL-Based Subvectors and Submatrices
    Rupp, Karl
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 1857 - 1866
  • [39] A case study of OpenCL-based parallel programming for low-power remote sensing applications
    Castro Angulo, A.
    Carrasco Alvarez, R.
    Ortegon Aguilar, J.
    Vazquez Castillo, J.
    Palma Marrufo, O.
    Castillo Atoche, A.
    2015 12TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2015), 2015,
  • [40] OpenCL-based Virtual Prototyping and Simulation of Many-Accelerator Architectures
    Sotiriou-Xanthopoulos, Efstathios
    Masing, Leonard
    Xydis, Sotirios
    Siozios, Kostas
    Becker, Juergen
    Soudris, Dimitrios
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (05)