Exact and Approximate Generic Multi-criteria Top-k Query Processing

被引:0
作者
Badr, Mehdi [1 ,2 ]
Vodislav, Dan [1 ]
机构
[1] Univ Cergy Pontoise, ENSEA, ETIS, CNRS, Cergy, France
[2] TRIMANE, St Germain En Laye, France
来源
TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XVIII: SPECIAL ISSUE ON DATABASE- AND EXPERT-SYSTEMS APPLICATIONS | 2015年 / 8980卷
关键词
Top-k query processing; Ranking; Multi-criteria information retrieval;
D O I
10.1007/978-3-662-46485-4_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many algorithms for multi-criteria top-k query processing with ranking predicates have been proposed, but little effort has been directed toward genericity, i.e. supporting any type of access to the lists of predicate scores (sorted and/or random), or any access cost settings. In this paper we propose a general approach to exact and approximate generic top-k processing. To this end, we propose a general framework (GF) for generic top-k processing, able to express any top-k algorithm and present within this framework a first comparison between generic algorithms. In previous work, we proposed BreadthRefine (BR), a generic algorithm that considers the current top-k candidates as a whole instead of focusing on the best candidate for score refinement, then we compared it with specific top-k algorithms. In this paper, we propose two variants of existing generic strategies and experimentally compare them with the BR breadth-first strategy, showing that BR leads to better execution costs. We also extend the notion of theta-approximation to the GF framework and present a first experimental study of the approximation potential of top-k algorithms on early stopping.
引用
收藏
页码:53 / 79
页数:27
相关论文
共 19 条
  • [1] Akbarinia Reza, 2007, INT C VERY LARGE DAT, P495
  • [2] Badr Mehdi, 2011, Database and Expert Systems Applications. Proceedings 22nd International Conference, DEXA 2011, P379, DOI 10.1007/978-3-642-23088-2_28
  • [3] Searching in high-dimensional spaces -: Index structures for improving the performance of multimedia Databases
    Böhm, C
    Berchtold, S
    Keim, D
    [J]. ACM COMPUTING SURVEYS, 2001, 33 (03) : 322 - 373
  • [4] Evaluating Top-k queries over web-accessible Databases
    Bruno, N
    Gravano, L
    Marian, A
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 369 - +
  • [5] Cao P., 2004, P 23 ANN ACM S PRINC, P206, DOI DOI 10.1145/1011767.1011798
  • [6] Chang KevinChen-Chuan., 2002, P ACM INT C MANAGEME, P346
  • [7] Optimal aggregation algorithms for middleware
    Fagin, R
    Lotem, A
    Naor, M
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 614 - 656
  • [8] Towards efficient multi-feature queries in heterogeneous environments
    Güntzer, U
    Balke, WT
    Kiessling, W
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, PROCEEDINGS, 2001, : 622 - 628
  • [9] Guntzer Ulrich., 2000, VLDB J, P419
  • [10] Optimizing top-k queries for middleware access: A unified cost-based approach
    Hwang, Seung-Won
    Chang, Kevin Chen-Chuan
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2007, 32 (01):