Accelerating the merge phase of sort-merge join

被引：14

作者：

Papaphilippou, Philippos ^{[1
]}

Pirk, Holger ^{[1
]}

Luk, Wayne ^{[1
]}

机构：

[1] Imperial Coll London, Dept Comp, London, England

来源：

2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL) | 2019年

基金：

英国工程与自然科学研究理事会;

关键词：

Sort-merge join; stream processing; operators; database acceleration; high-throughput; FPGA; MPSoC; big data;

D O I：

10.1109/FPL.2019.00025

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present an efficient, high-throughput and scalable hardware design for accelerating the merge phase of the sort-merge join operation. Sort-merge join is one of the fundamental join algorithms and among the most frequently executed operations in relational databases. It has been the focus of various recent pieces of research, each having different shortcomings and usually only focusing on the sort phase. In this paper, a new parallel sort-merge join architecture is developed, that provides data streaming functionality and high throughput. The key idea of the paper is the use of a novel design for a co-grouping engine, with which the input data are summarised on-the-fly. In this way, the operation is performed on streams of data, preserving the linear access pattern for faster data movement and also eliminates the need for a replay buffer/cache. In contrast to related work, our approach does not make assumptions about the value distribution of the input data and applies to any input size and width. We evaluate the design on a Zynq UltraScale+ based platform, and show that there is up to 3.1 times speedup over the host processor, even without accelerating the sort phase, while still taking into account the data transfers from and to main memory in Linux.

引用

页码：100 / 105

页数：6

共 18 条

[1]

Abadi DJ, 2007, PROC INT CONF DATA, P441

[2]

Batcher KE., 1968, Proceedings of the Spring Joint Computer Conference, P307, DOI DOI 10.1145/1468075.1468121

[3]

Bundala D, 2014, LECT NOTES COMPUT SC, V8370, P236, DOI 10.1007/978-3-319-04921-2_19

[4]

Casper J., 2014, P FPGA 2014 NEW YORK, P151, DOI DOI 10.1145/2554688.2554787

[5] Saturn: A terabit packet switch using dual round-robin [J].

Chao, J .

IEEE COMMUNICATIONS MAGAZINE, 2000, 38 (12) :78-84

[6] Accelerating Equi-Join on a CPU-FPGA Heterogeneous Platform [J].

Chen, Ren ;

Prasanna, Viktor K. .

2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, :212-219

[7]

Farmahini-Farahani A., 2011, Proceedings 2011 IEEE 9th Symposium on Application Specific Processors (SASP 2011), P38, DOI 10.1109/SASP.2011.5941075

[8]

Halstead R., 2015, CIDR

[9]

Hong Diep Nguyen, 2011, 2011 International Conference on Field Programmable Logic and Applications, P232, DOI 10.1109/FPL.2011.49

[10] FPGA-based Data Partitioning [J].

Kara, Kaan ;

Giceva, Jana ;

Alonso, Gustavo .

SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, :433-445

← 1 2 →