Fast Join Project Query Evaluation using Matrix Multiplication

被引:12
作者
Deep, Shaleen [1 ]
Hu, Xiao [2 ]
Koutris, Paraschos [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
[2] Duke Univ, Durham, NC 27706 USA
来源
SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2020年
关键词
D O I
10.1145/3318464.3380607
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last few years, much effort has been devoted to developing join algorithms to achieve worst-case optimality for join queries over relational databases. Towards this end, the database community has had considerable success in developing efficient algorithms that achieve worst-case optimal runtime for full join queries, i.e., joins without projections. However, not much is known about join evaluation with projections beyond some simple techniques of pushing down the projection operator in the query execution plan. Such queries have a large number of applications in entity matching, graph analytics and searching over compressed graphs. In this paper, we study how a class of join queries with projections can be evaluated faster using worst-case optimal algorithms together with matrix multiplication. Crucially, our algorithms are parameterized by the output size of the final result, allowing for choosing the best execution strategy. We implement our algorithms as a subroutine and compare the performance with state-of-the-art techniques to show they can be improved upon by as much as 50x. More importantly, our experiments indicate that matrix multiplication is a useful operation that can help speed up join processing owing to highly optimized open source libraries that are also highly parallelizable.
引用
收藏
页码:1213 / 1223
页数:11
相关论文
共 34 条
[1]  
Abdelaziz I, 2015, PROC VLDB ENDOW, V8, P1881
[2]   LevelHeaded: A Unified Engine for Business Intelligence and Linear Algebra Querying [J].
Aberger, Christopher R. ;
Lamb, Andrew ;
Olukotun, Kunle ;
Re, Christopher .
2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, :449-460
[3]   EmptyHeaded: A Relational Engine for Graph Processing [J].
Aberger, Christopher R. ;
Lamb, Andrew ;
Tu, Susan ;
Noetzli, Andres ;
Olukotun, Kunle ;
Re, Christopher .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2017, 42 (04)
[4]  
Afshani P, 2016, ICALP 2016 AUTOMATA
[5]  
Amossen R.R., 2009, ICDT, P121, DOI [10.1145/1514894.1514909, DOI 10.1145/1514894.1514909]
[6]  
[Anonymous], 1975, DESIGN ANAL COMPUTER
[7]   SIZE BOUNDS AND QUERY PLANS FOR RELATIONAL JOINS [J].
Atserias, Albert ;
Grohe, Martin ;
Marx, Daniel .
SIAM JOURNAL ON COMPUTING, 2013, 42 (04) :1737-1767
[8]  
Bagan G, 2007, LECT NOTES COMPUT SC, V4646, P208
[9]  
Bhargava G., 1997, US Patent, Patent No. [5,687,362, 5687362]
[10]   Set containment join revisited [J].
Bouros, Panagiotis ;
Mamoulis, Nikos ;
Ge, Shen ;
Terrovitis, Manolis .
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (01) :375-402