Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture

被引:77
作者
He, Jiong [1 ]
Lu, Mian [2 ]
He, Bingsheng [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] A Star Inst HPC, Singapore, Singapore
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2013年 / 6卷 / 10期
关键词
D O I
10.14778/2536206.2536216
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query co-processing on graphics processors (GPUs) has become an effective means to improve the performance of main memory databases. However, the relatively low bandwidth and high latency of the PCI-e bus are usually bottleneck issues for co-processing. Recently, coupled CPU-GPU architectures have received a lot of attention, e.g. AMD APUs with the CPU and the GPU integrated into a single chip. That opens up new opportunities for optimizing query co-processing. In this paper, we experimentally revisit hash joins, one of the most important join algorithms for main memory databases, on a coupled CPU-GPU architecture. Particularly, we study the fine-grained co-processing mechanisms on hash joins with and without partitioning. The co-processing outlines an interesting design space. We extend existing cost models to automatically guide decisions on the design space. Our experimental results on a recent AMD APU show that (1) the coupled architecture enables fine-grained co-processing and cache reuses, which are inefficient on discrete CPU-GPU architectures; (2) the cost model can automatically guide the design and tuning knobs in the design space; (3) fine-grained co-processing achieves up to 53%, 35% and 28% performance improvement over CPU-only, GPU-only and conventional CPU-GPU co-processing, respectively. We believe that the insights and implications from this study are initial yet important for further research on query co-processing on coupled CPU-GPU architectures.
引用
收藏
页码:889 / 900
页数:12
相关论文
共 35 条
[1]  
Abadi DJ, 2005, PROC 31 INT C VERY L, P553
[2]  
Ailamaki A., 1999, VLDB
[3]  
Balkesen C, 2013, PROC INT CONF DATA, P362, DOI 10.1109/ICDE.2013.6544839
[4]  
Blanas S., 2011, SIGMOD C, P37
[5]  
Boncz P. A., 1999, VLDB
[6]  
Chen L., 2012, SC 12
[7]  
CHEN MS, 1992, PROC INT CONF VERY L, P15
[8]  
Chen S., 2007, TODS
[9]   PARALLEL DATABASE-SYSTEMS - THE FUTURE OF HIGH-PERFORMANCE DATABASE-SYSTEMS [J].
DEWITT, D ;
GRAY, J .
COMMUNICATIONS OF THE ACM, 1992, 35 (06) :85-98
[10]  
DeWitt D. J., 1985, Very Large Data Bases. Proceedings of the Eleventh International Conference, P151