Accelerating Equi-Join on a CPU-FPGA Heterogeneous Platform

被引:17
|
作者
Chen, Ren [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90089 USA
来源
2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM) | 2016年
关键词
Keywords Database Operation; Heterogeneous Platform; Hardware Acceleration; CPU-FPGA; Sorting; Join; Selection;
D O I
10.1109/FCCM.2016.62
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accelerating database applications using FPGAs has recently been an area of growing interest in both academia and industry. Equi-join is one of the key database operations whose performance highly depends on sorting, which exhibits high memory usage on FPGA. A fully pipelined N-key merge sorter consists of log N sorting stages using 0(N) memory totally. For large data sets, external memory has to be employed to perform data buffering between the sorting stages. This introduces pipeline stalls as well as several iterations between FPGA and external memory, causing significant performance degradation. In this paper, we speed-up equi-join using a hybrid CPU-FPGA heterogeneous platform. To alleviate the performance impact of limited memory, we propose a merge sort based hybrid design where the first few sorting stages in the merge sort tree are replaced with "folded" bitonic sorting networks. These "folded" bitonic sorting networks operate in parallel on the FPGA. The partial results are then merged on the CPU to produce the final sorted result. Based on this hybrid sorting design, we develop two streaming join algorithms by optimizing the classic CPU-based nested-loop join and sort-merge join algorithms. On a range of data set sizes, our design achieves throughput improvement of 3.1x and 1.9x compared with software-only and FPGA only implementations, respectively. Our design sustains 21.6% of the peak bandwidth, which is 3.9x utilization obtained by the stateof-the-art FPGA equi-join implementation.
引用
收藏
页码:212 / 219
页数:8
相关论文
共 30 条
  • [1] Bloom Filter-Based Parallel Architecture for Accelerating Equi-Join Operation on FPGA
    He, Binhao
    Xue, Meiting
    Liu, Shubiao
    Luo, Wei
    ELECTRONICS, 2021, 10 (15)
  • [2] PH-CF: A Phased Hybrid Algorithm for Accelerating Subgraph Matching Based on CPU-FPGA Heterogeneous Platform
    Zhang, Xian
    Xiao, Guoqing
    Duan, Mingxing
    Chen, Yuedan
    Li, Kenli
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (07) : 8362 - 8373
  • [3] Simplifying HW/SW Integration to Deploy Multiple Accelerators for CPU-FPGA Heterogeneous Platforms
    Braganca, Lucas
    Alves, Fredy
    Penha, Jeronimo Costa
    Coimbra, Gabriel
    Ferreira, Ricardo
    Nacif, Jose Augusto M.
    2018 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION (SAMOS XVIII), 2018, : 97 - 104
  • [4] Equi-join for Multiple Datasets Based on Time Cost Evaluation Model
    Zhu, Hong
    Xia, Libo
    Xie, Mieyi
    Yan, Ke
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT II, 2014, 8631 : 122 - 135
  • [5] AKGF: Automatic Kernel Generation for DNN on CPU-FPGA
    Dong, Dong
    Jiang, Hongxu
    Diao, Boyu
    COMPUTER JOURNAL, 2023, 67 (05) : 1619 - 1627
  • [6] HeteroKV: A Scalable Line-rate Key-Value Store on Heterogeneous CPU-FPGA Platforms
    Yang, Haichang
    Li, Zhaoshi
    Wang, Jiawei
    Yin, Shouyi
    Wei, Shaojun
    Liu, Leibo
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 834 - 837
  • [7] System Design of ATSC3.0 Broadcast Gateway Based on CPU-FPGA
    Ding, Jianhao
    Xiong, Shuai
    Liu, Yifan
    He, Dazhi
    Zhang, Wenjun
    Wang, Kaikai
    2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
  • [8] High-performance Face Detection with CPU-FPGA Acceleration
    Mohanty, Abinash
    Suda, Naveen
    Kim, Minkyu
    Vrudhula, Sarma
    Seo, Jae-sun
    Cao, Yu
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 117 - 120
  • [9] Resource-Aware Collaborative Allocation for CPU-FPGA Cloud Environments
    Jordan, Michael Guilherme
    Korol, Guilherme
    Rutzig, Mateus Beck
    Beck, Antonio Carlos Schneider
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (05) : 1655 - 1659
  • [10] Resource Provisioning for CPU-FPGA Environments with Adaptive HLS-Versioning and DVFS
    Jordan, Michael Guilherme
    Korol, Guilherme
    Knorst, Tiago
    Rutzig, Mateus Beck
    Schneider Beck, Antonio Carlos
    2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 127 - 132