Efficient Dual-Resolution Layer Indexing for Top-k Queries

被引:8
作者
Lee, Jongwuk [1 ]
Cho, Hyunsouk [1 ]
Hwang, Seung-won [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Dept Comp Sci & Engn, Pohang 790784, South Korea
来源
2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE) | 2012年
关键词
SKYLINE; ALGORITHMS;
D O I
10.1109/ICDE.2012.73
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Top-k queries have gained considerable attention as an effective means for narrowing down the overwhelming amount of data. This paper studies the problem of constructing an indexing structure that efficiently supports top-k queries for varying scoring functions and retrieval sizes. The existing work can be categorized into three classes: list-, layer-, and view-based approaches. This paper focuses on the layer-based approach, pre-materializing tuples into consecutive multiple layers. The layer-based index enables us to return top-k answers efficiently by restricting access to tuples in the k layers. However, we observe that the number of tuples accessed in each layer can be reduced further. For this purpose, we propose a dual-resolution layer structure. Specifically, we iteratively build coarse-level layers using skylines, and divide each coarse-level layer into fine-level sublayers using convex skylines. The dual-resolution layer is able to leverage not only the dominance relationship between coarse-level layers, named for all-dominance, but also a relaxed dominance relationship between fine-level sublayers, named there exists-dominance. Our extensive evaluation results demonstrate that our proposed method significantly reduces the number of tuples accessed than the state-of-the-art methods.
引用
收藏
页码:1084 / 1095
页数:12
相关论文
共 32 条
[1]   The Quickhull algorithm for convex hulls [J].
Barber, CB ;
Dobkin, DP ;
Huhdanpaa, H .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (04) :469-483
[2]  
Bast H., 2006, P 32 INT C VERY LARG, P475
[3]   The Skyline operator [J].
Börzsönyi, S ;
Kossmann, D ;
Stocker, K .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :421-430
[4]   Evaluating Top-k queries over web-accessible Databases [J].
Bruno, N ;
Gravano, L ;
Marian, A .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :369-+
[5]  
Carey M. J., 1997, SIGMOD Record, V26, P219, DOI 10.1145/253262.253302
[6]  
Chang KevinChen-Chuan., 2002, P ACM INT C MANAGEME, P346
[7]  
Chang YC, 2000, SIGMOD RECORD, V29, P391, DOI 10.1145/335191.335433
[8]   ON THE CONVEX LAYERS OF A PLANAR SET [J].
CHAZELLE, B .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1985, 31 (04) :509-517
[9]   Skyline with presorting [J].
Chomicki, J ;
Godfrey, P ;
Gryz, J ;
Liang, DM .
19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, :717-719
[10]  
Das G., 2006, Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB'06), P451