SW-Store: a vertically partitioned DBMS for Semantic Web data management

被引:155
作者
Abadi, Daniel J. [1 ]
Marcus, Adam [2 ]
Madden, Samuel R. [2 ]
Hollenbach, Kate [2 ]
机构
[1] Yale Univ, New Haven, CT 06520 USA
[2] MIT, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Query Time; Property Table; Query Plan; Path Expression; Vertical Partition;
D O I
10.1007/s00778-008-0125-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient management of RDF data is an important prerequisite for realizing the Semantic Web vision. Performance and scalability issues are becoming increasingly pressing as Semantic Web technology is applied to real-world applications. In this paper, we examine the reasons why current data management solutions for RDF data scale poorly, and explore the fundamental scalability limitations of these approaches. We review the state of the art for improving performance of RDF databases and consider a recent suggestion, "property tables". We then discuss practically and empirically why this solution has undesirable features. As an improvement, we propose an alternative solution: vertically partitioning the RDF data. We compare the performance of vertical partitioning with prior art on queries generated by a Web-based RDF browser over a large-scale (more than 50 million triples) catalog of library data. Our results show that a vertically partitioned schema achieves similar performance to the property table technique while being much simpler to design. Further, if a column-oriented DBMS (a database architected specially for the vertically partitioned case) is used instead of a row-oriented DBMS, another order of magnitude performance improvement is observed, with query times dropping from minutes to several seconds. Encouraged by these results, we describe the architecture of SW-Store, a new DBMS we are actively building that implements these techniques to achieve high performance RDF data management.
引用
收藏
页码:385 / 406
页数:22
相关论文
共 53 条
[1]  
Abadi D., 2008, SIGMOD
[2]  
Abadi D., 2006, SIGMOD
[3]  
Abadi D.J., 2008, THESIS
[4]  
Abadi D. J., 2007, P ICDE
[5]  
ABADI DJ, 2007, MITCSAILTR2007036
[6]  
ABADI DJ, 2007, CIDR
[7]  
AGRAWAL R, 2001, VLDB
[8]  
Ailamaki A., 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P169
[9]  
Alexaki Sofia., 2001, SemWeb
[10]  
[Anonymous], SIMILE