Optimization of GPU-Based Main-Memory Hash Join

被引:0
作者
Li, Guo-hua [1 ]
Ren, Yu-qi [2 ]
Luo, Can [3 ]
Huang, Jin [3 ]
Deng, Yang-dong [3 ]
机构
[1] Southwest Jiaotong Univ, Sch Transportat & Logist, Chengdu 610031, Sichuan, Peoples R China
[2] CRRC Dalian Res Inst Co Ltd, Dalian 116021, Liaoning, Peoples R China
[3] Tsinghua Univ, Sch Software, KLISS, TNList, Beijing 100084, Peoples R China
来源
2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM) | 2017年
关键词
Hash join; Query optimization; GPU; ARCHITECTURE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the advent of the big data era, highly efficient and scalable join algorithms are becoming increasingly essential for database operations. As a result, recent years witnessed a strong momentum in accelerating join algorithms with multi-and many-core processors. Among various acceleration platforms, GPUs have the advantage in terms of raw computing power and scalability. The hash join problem, however, poses unique challenges for effective GPU implementations. Especially, a complete treatment of the problem by systematically considering various GPU architectural details and input characteristics is still missing. In this work, we built a GPU-based testbed to systematically study the performance tradeoffs of developing highly efficient GPU implementations for hash join. On such a basis, we investigated a set of essential building blocks including data transfer mechanisms between host (CPU) and device (GPU) to take advantage of the PCI-E bandwidth, a streaming scheme to effectively overlap data transfer and kernel execution, and an atomic-free transformation to minimize costly synchronization overhead. By integrating these blocks, we are able to improve the hash join performance to a new level. The experimental results show that our GPU implementation of hash join outperforms the state-of-the-art results by up to 111%. We also proposed a framework to guide the selection of optimization strategies.
引用
收藏
页码:489 / 494
页数:6
相关论文
共 11 条
  • [1] Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited
    Balkesen, Cagri
    Alonso, Gustavo
    Teubner, Jens
    Oezsu, M. Tamer
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 7 (01): : 85 - 96
  • [2] Balkesen C, 2013, PROC INT CONF DATA, P362, DOI 10.1109/ICDE.2013.6544839
  • [3] Bratbergsengen K, 1984, VLDB, P323
  • [4] Electronic Design Automation with Graphic Processors: A Survey
    Deng, Yangdong
    Mu, Shuai
    [J]. FOUNDATIONS AND TRENDS IN ELECTRONIC DESIGN AUTOMATION, 2013, 7 (1-2): : 1 - 176
  • [5] Halstead R., 2015, CIDR
  • [6] He B., 2008, SIGMOD, DOI DOI 10.1145/1376616.1376670
  • [7] Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture
    He, Jiong
    Lu, Mian
    He, Bingsheng
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (10): : 889 - 900
  • [8] Kaldewey T., 2012, DaMoN, DOI DOI 10.1145/2236584.2236592
  • [9] APPLICATION OF HASH TO DATA-BASE MACHINE AND ITS ARCHITECTURE
    KITSUREGAWA, M
    TANAKA, H
    MOTOOKA, T
    [J]. NEW GENERATION COMPUTING, 1983, 1 (01) : 63 - 74
  • [10] Negrut D., 2014, TECH REP