A Study of Data Partitioning on OpenCL-based FPGAs

被引:0
|
作者
Wang, Zeke [1 ]
He, Bingsheng [1 ]
Zhang, Wei [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A lot of research efforts have been devoted to accelerating relational database applications on FPGAs, due to their high energy efficiency and high throughput. Most of the existing studies are based on hardware description languages (HDLs). Recently, FPGA vendors have started to develop OpenCL SDKs for much better programmability. In this paper, we investigate the performance of relational database applications on OpenCL-based FPGAs. As a start, we study the performance of data partitioning, a core operation widely used in relational databases. Due to random memory accesses, data partitioning is time-consuming and can become a major bottleneck for database operators such as hash join. We start with the state-of-theart OpenCL implementation which was originally designed for CPUs/GPUs, and find that it suffers from lock overheads and memory bandwidth overheads. To reduce lock overheads, we develop a simple yet efficient multi-kernel approach to leverage two emerging features of Altera OpenCL SDK, namely task kernel and channel. Moreover, on-chip buckets are employed to reduce the number of memory transactions. We further develop a cost model to guide the parameter configuration. We evaluate the proposed design on a recent Altera Stratix V FPGA. Our results demonstrate 1) our cost model can accurately predict the performance of data partitioning under different parameter settings; 2) our proposed multi-kernel approach can achieve 10.7X speedup over the existing OpenCL implementation. Also, the experiments with three case studies show that the optimized implementations can achieve 4-12X performance improvement over the original implementations.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] OpenCL-based Remote Offloading Framework for Trusted Mobile Cloud Computing
    Eom, Heungsik
    St Juste, Pierre
    Figueiredo, Renato
    Tickoo, Omesh
    Illikkal, Ramesh
    Iyer, Ravishankar
    2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013), 2013, : 240 - 248
  • [42] Exploiting Decoupled OpenCL Work-Items with Data Dependencies on FPGAs: A Case Study
    Varela, Javier Alejandro
    Wehn, Norbert
    Liang, Qian
    Tang, Songyin
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 124 - 131
  • [43] Optimization of an OpenCL-Based Multi-swarm PSO Algorithm on an APU
    Franz, Wayne
    Thulasiraman, Parimala
    Thulasiram, Ruppa K.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 140 - 150
  • [44] An OpenCL-Based Hybrid CNN-RNN Inference Accelerator On FPGA
    Sun, Yunfei
    Liu, Brian
    Xu, Xianchao
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 283 - 286
  • [45] Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network
    Zhang, Jialiang
    Li, Jing
    FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 25 - 34
  • [46] OpenCL for HPC with FPGAs: Case Study in Molecular Electrostatics
    Yang, Chen
    Sheng, Jiayi
    Patel, Rushi
    Sanaullah, Ahmed
    Sachdeva, Vipin
    Herbordt, Martin C.
    2017 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2017,
  • [47] Improving the Performance of Whale Optimization Algorithm through OpenCL-Based FPGA Accelerator
    Jiang, Qiangqiang
    Guo, Yuanjun
    Yang, Zhile
    Wang, Zheng
    Yang, Dongsheng
    Zhou, Xianyu
    COMPLEXITY, 2020, 2020
  • [48] Early experiences with OpenCL on FPGAs: convolution case study
    Rodriguez-Donate, C.
    Botella, G.
    Garcia, C.
    Cabal-Yepez, E.
    Prieto-Matias, M.
    2015 IEEE 23RD ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2015, : 235 - 235
  • [49] Workload Analysis and Efficient OpenCL-based Implementation of SIFT Algorithm on a Smartphone
    Wang, Guohui
    Rister, Blaine
    Cavallaro, Joseph R.
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 759 - 762
  • [50] Specmaster: an OpenCL-based peptide search engine for tandem mass spectrometry
    Weber, Rick
    Jenkins, David D.
    Peterson, Gregory D.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (06): : 1234 - 1248