TripleID-Q: RDF Query Processing Framework Using GPU

被引:11
作者
Chantrapornchai, Chantana [1 ]
Choksuchat, Chidchanok [2 ,3 ]
机构
[1] Kasetsart Univ, Fac Engn, Dept Comp Engn, Bangkok 10900, Thailand
[2] Silpakorn Univ, Nakhon Pathom 73000, Thailand
[3] Prince Songkla Univ, Fac Sci, Informat & Commun Technol Programme, Hat Yai 90110, Thailand
关键词
Query processing; parallel processing; entailment; TripleID; GPU; RDF;
D O I
10.1109/TPDS.2018.2814567
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Resource Description Framework (RDF) data represents information linkage around the Internet. It uses Internationalized Resources Identifier (IRI) which can be referred to external information. Typically, an RDF data is serialized as a large text file which contains millions of relationships. In this work, we propose a framework based on TripleID-Q, for query processing of large RDF data in a GPU. The key elements of the framework are 1) a compact representation suitable for a Graphics Processing Unit ( GPU) and 2) its simple representation conversion method which optimizes the preprocessing overhead. Together with the framework, we propose parallel algorithms which utilize thousands of GPU threads to look for specific data for a given query as well as to perform basic query operations such as union, join, and filter. The TripleID representation is smaller than the original representation 3-4 times. Querying from TripleID using a GPU is up to 108 times faster than using the traditional RDF tool. The speedup can be more than 1,000 times over the traditional RDF store when processing a complex query with union and join of many subqueries.
引用
收藏
页码:2121 / 2135
页数:15
相关论文
共 49 条
[1]  
Alcantara DanAnthony Feliciano., 2011, Efficient Hash Tables on the GPU
[2]  
[Anonymous], 2010, MENTOK RDF STORAGE Q
[3]  
[Anonymous], 2004, RDF VOCABULARY DESCR
[4]  
Atre Medha., 2010, WWW, P41, DOI DOI 10.1145/1772690.1772696
[5]  
Bassem M., 2013, P 2013 INT C DOCT CO, V1045, P40
[6]  
Baxter Sean., 2013, Modern GPU
[7]   The, design and implementation of the Redland RDF application framework [J].
Beckett, D .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2002, 39 (05) :577-588
[8]   Load-aware inter-co-processor parallelism in database query processing [J].
Bress, Sebastian ;
Siegmund, Norbert ;
Heimel, Max ;
Saecker, Michael ;
Lauer, Tobias ;
Bellatreche, Ladjel ;
Saake, Gunter .
DATA & KNOWLEDGE ENGINEERING, 2014, 93 :60-79
[9]   TripleID-C: Low Cost Compressed Representation for RDF Query Processing in GPUs [J].
Chantrapornchai, Chantana ;
Makpaisit, Pisit .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2018), 2018, :261-270
[10]  
Chirravuri S. K., 2014, THESIS