Merging What's Cracked, Cracking What's Merged: Adaptive Indexing in Main-Memory Column-Stores

被引:53
作者
Idreos, Stratos [1 ]
Manegold, Stefan [1 ]
Kuno, Harumi [2 ]
Graefe, Goetz [2 ]
机构
[1] CWI, Amsterdam, Netherlands
[2] HP Labs, Palo Alto, CA USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2011年 / 4卷 / 09期
关键词
D O I
10.14778/2002938.2002944
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive indexing is characterized by the partial creation and refinement of the index as side effects of query execution. Dynamic or shifting workloads may benefit from preliminary index structures focused on the columns and specific key ranges actually queried - without incurring the cost of full index construction. The costs and benefits of adaptive indexing techniques should therefore be compared in terms of initialization costs, the overhead imposed upon queries, and the rate at which the index converges to a state that is fully-refined for a particular workload component. Based on an examination of database cracking and adaptive merging, which are two techniques for adaptive indexing, we seek a hybrid technique that has a low initialization cost and also converges rapidly. We find the strengths and weaknesses of database cracking and adaptive merging complementary. One has a relatively high initialization cost but converges rapidly. The other has a low initialization cost but converges relatively slowly. We analyze the sources of their respective strengths and explore the space of hybrid techniques. We have designed and implemented a family of hybrid algorithms in the context of a column-store database system. Our experiments compare their behavior against database cracking and adaptive merging, as well as against both traditional full index lookup and scan of unordered data. We show that the new hybrids significantly improve over past methods while at least two of the hybrids come very close to the "ideal performance" in terms of both overhead per query and convergence to a final state.
引用
收藏
页码:586 / 597
页数:12
相关论文
共 15 条
[1]  
Bruno N., 2007, ACM T DATABASE SYST, V32
[2]  
Chaudhuri, 2007, VLDB, P3
[3]   PHYSICAL DATABASE DESIGN FOR RELATIONAL DATABASES [J].
FINKELSTEIN, S ;
SCHKOLNICK, M ;
TIBERIO, P .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1988, 13 (01) :91-128
[4]  
Graefe G., 2010, EDBT, P371, DOI [DOI 10.1145/1739041.1739087, 10.1145/1739041.1739087]
[5]  
Graefe G, 2011, LECT NOTES COMPUT SC, V6417, P169, DOI 10.1007/978-3-642-18206-8_13
[6]   A Survey of B-Tree Locking Techniques [J].
Graefe, Goetz .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2010, 35 (03)
[7]   Adaptive indexing for relational keys [J].
Graefe, Goetz ;
Kuno, Harumi .
2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, :69-74
[8]  
Harder T., 1976, ECI Conference 1976. Proceedings of the 1st Conference of the European Cooperation in Informatics, P146
[9]  
Hoare C. A. R., 1961, COMMUN ACM, V4, P321, DOI [DOI 10.1145/366622.366644, 10.1145/366622.366647, DOI 10.1145/366622.366647]
[10]  
Idreos S., 2007, SIGMOD, P413, DOI [10.1145/1247480.1247527, DOI 10.1145/1247480.1247527]