Accelerating Equi-Join on a CPU-FPGA Heterogeneous Platform

被引:17
|
作者
Chen, Ren [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90089 USA
来源
2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM) | 2016年
关键词
Keywords Database Operation; Heterogeneous Platform; Hardware Acceleration; CPU-FPGA; Sorting; Join; Selection;
D O I
10.1109/FCCM.2016.62
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accelerating database applications using FPGAs has recently been an area of growing interest in both academia and industry. Equi-join is one of the key database operations whose performance highly depends on sorting, which exhibits high memory usage on FPGA. A fully pipelined N-key merge sorter consists of log N sorting stages using 0(N) memory totally. For large data sets, external memory has to be employed to perform data buffering between the sorting stages. This introduces pipeline stalls as well as several iterations between FPGA and external memory, causing significant performance degradation. In this paper, we speed-up equi-join using a hybrid CPU-FPGA heterogeneous platform. To alleviate the performance impact of limited memory, we propose a merge sort based hybrid design where the first few sorting stages in the merge sort tree are replaced with "folded" bitonic sorting networks. These "folded" bitonic sorting networks operate in parallel on the FPGA. The partial results are then merged on the CPU to produce the final sorted result. Based on this hybrid sorting design, we develop two streaming join algorithms by optimizing the classic CPU-based nested-loop join and sort-merge join algorithms. On a range of data set sizes, our design achieves throughput improvement of 3.1x and 1.9x compared with software-only and FPGA only implementations, respectively. Our design sustains 21.6% of the peak bandwidth, which is 3.9x utilization obtained by the stateof-the-art FPGA equi-join implementation.
引用
收藏
页码:212 / 219
页数:8
相关论文
共 30 条
  • [21] High-Parallelism Hash-Merge Architecture for Accelerating Join Operation on FPGA
    Wu, Wen-Qi
    Xue, Mei-Ting
    Xing, Qian-Jian
    Yu, Feng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (07) : 2650 - 2654
  • [22] Mathematical programming models for scheduling in a CPU/FPGA architecture with heterogeneous communication delays
    Abdessamad Ait El Cadi
    Omar Souissi
    Rabie Ben Atitallah
    Nicolas Belanger
    Abdelhakim Artiba
    Journal of Intelligent Manufacturing, 2018, 29 : 629 - 640
  • [23] Mathematical programming models for scheduling in a CPU/FPGA architecture with heterogeneous communication delays
    El Cadi, Abdessamad Ait
    Souissi, Omar
    Ben Atitallah, Rabie
    Belanger, Nicolas
    Artiba, Abdelhakim
    JOURNAL OF INTELLIGENT MANUFACTURING, 2018, 29 (03) : 629 - 640
  • [24] Development of a CPU-GPU heterogeneous platform based on a nonlinear parallel algorithm
    Ma, Haifeng
    NONLINEAR ENGINEERING - MODELING AND APPLICATION, 2022, 11 (01): : 215 - 222
  • [25] A High Performance FPGA-GPU-CPU Platform for a Real-Time Locating System
    Alawieh, Mohammad
    Kasparek, Maximilian
    Franke, Norbert
    Hupfer, Jochen
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1576 - 1580
  • [26] A comparison of two metaheuristic algorithms for scheduling problem on a heterogeneous CPU/FPGA architecture with communication delays
    Abdallah, Fadel
    Tanougast, Camel
    Kacem, Imed
    Diou, Camille
    Singer, Daniel
    2017 4TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2017, : 294 - 299
  • [27] An Efficient Design Flow for Accelerating Complicated-connected CNNs on a Multi-FPGA Platform
    Wang, Deguang
    Shen, Junzhong
    Wen, Mei
    Zhang, Chunyuan
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [28] Analysis method for seismic response of high-rise structure based on CPU-GPU heterogeneous platform
    Li, Hong-Yu
    Teng, Jun
    Li, Zuo-Hua
    Zhendong yu Chongji/Journal of Vibration and Shock, 2014, 33 (13): : 86 - 91
  • [29] Design and Implementation of Object Detection Acceleration Module Based on an ARM+FPGA Heterogeneous Platform
    Li F.
    Cao J.
    Li P.
    Xie H.
    Zhao X.
    Wang Y.
    Zhang X.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (06): : 1035 - 1041
  • [30] Fast and Accurate Evaluation of Multilayered Green's Functions by Extracting Surface and Leaky Wave Poles Based on GPU/CPU Heterogeneous Platform
    Song, Zhe
    2017 INTERNATIONAL WORKSHOP ON ELECTROMAGNETICS: APPLICATIONS AND STUDENT INNOVATION COMPETITION (IEEE IWEM 2017), 2017, : 63 - 64