A Distributed Graph Engine for Web Scale RDF Data

被引:161
作者
Zeng, Kai [1 ]
Yang, Jiacheng [2 ]
Wang, Haixun [3 ]
Shao, Bin [3 ]
Wang, Zhongyuan [3 ,4 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[2] Columbia Univ, New York, NY USA
[3] Microsoft Res Asia, Beijing, Peoples R China
[4] Renmin Univ China, Beijing, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2013年 / 6卷 / 04期
关键词
D O I
10.14778/2535570.2488333
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Much work has been devoted to supporting RDF data. But state-of-the-art systems and methods still cannot handle web scale RDF data effectively. Furthermore, many useful and general purpose graph-based operations (e.g., random walk, reachability, community discovery) on RDF data are not supported, as most existing systems store and index data in particular ways (e.g., as relational tables or as a bitmap matrix) to maximize one particular operation on RDF data: SPARQL query processing. In this paper, we introduce Trinity.RDF, a distributed, memory-based graph engine for web scale RDF data. Instead of managing the RDF data in triple stores or as bitmap matrices, we store RDF data in its native graph form. It achieves much better (sometimes orders of magnitude better) performance for SPARQL queries than the state-of-the-art approaches. Furthermore, since the data is stored in its native graph form, the system can support other operations (e.g., random walks, reachability) on RDF graphs as well. We conduct comprehensive experimental studies on real life, web scale RDF data to demonstrate the effectiveness of our approach.
引用
收藏
页码:265 / 276
页数:12
相关论文
共 36 条
[1]   SW-Store: a vertically partitioned DBMS for Semantic Web data management [J].
Abadi, Daniel J. ;
Marcus, Adam ;
Madden, Samuel R. ;
Hollenbach, Kate .
VLDB JOURNAL, 2009, 18 (02) :385-406
[2]  
Alexaki S., 2001, SEM WEB
[3]  
Angles R, 2005, LECT NOTES COMPUT SC, V3532, P346
[4]  
Atre M., 2010, P 19 INT C WORLD WID, P41, DOI 10.1145/1772690.1772696
[5]   DBpedia: A nucleus for a web of open data [J].
Auer, Soeren ;
Bizer, Christian ;
Kobilarov, Georgi ;
Lehmann, Jens ;
Cyganiak, Richard ;
Ives, Zachary .
SEMANTIC WEB, PROCEEDINGS, 2007, 4825 :722-+
[6]   USING SEMI-JOINS TO SOLVE RELATIONAL QUERIES [J].
BERNSTEIN, PA ;
CHIU, DMW .
JOURNAL OF THE ACM, 1981, 28 (01) :25-40
[7]   Storing RDF as a graph [J].
Bönström, V ;
Hinze, A ;
Schweppe, H .
FIRST LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2003, :27-36
[8]  
Broekstra J., 2002, ISWC
[9]   Fast graph pattern matching [J].
Cheng, Jiefeng ;
Yu, Jeffrey Xu ;
Ding, Bolin ;
Yu, Philip S. ;
Wang, Haixun .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :913-+
[10]  
CHONG EI, 2005, VLDB