Evaluating Top-N queries in n-dimensional normed spaces

被引:4
作者
Zhu, Liang [1 ]
Liu, Feifei [1 ]
Meng, Weiyi [2 ]
Ma, Qin [3 ]
Wang, Yu [1 ]
Yuan, Fang [4 ]
机构
[1] Hebei Univ, Sch Comp Sci & Technol, Intelligent Database Lab, Baoding 071002, Hebei, Peoples R China
[2] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
[3] Hebei Univ, Dept Foreign Language Teaching & Res, Baoding 071002, Hebei, Peoples R China
[4] Hebei Univ, Sch Math & Informat Sci, Baoding 071002, Hebei, Peoples R China
关键词
Top-N query; n-dimensional normed space; Relational database; Ranking function; K QUERIES; AGGREGATION; SKYLINE; JOIN;
D O I
10.1016/j.ins.2016.09.035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the problem of processing top-N queries, the threshold algorithm (TA) is an important method in many scenarios. The model of top-N queries with TA has three characteristics: (1) the ranking function is monotone, (2) the query point is fixed, and (3) TA scans the sorted index lists unidirectionally. In many database applications, however, there are opportunities for evaluating top-N queries with arbitrary query points and generic (not necessarily monotone) ranking functions. In this paper, we propose an approach for evaluating top-N queries in n-dimensional normed spaces. Given a query point Q = (q1, ..., qn) in the real vector space R-n and a generic norm distance d as ranking function, our method employs the norm equivalence theorem in Functional Analysis so that the candidate tuples of top-N query Q with d can be obtained using the Maximum distance d(infinity). This method projects each q(i) on its corresponding axis and constructs an interval centered at rh, and then enlarges each interval bidirectionally until the n-dimensional hyperrectangle contains enough candidate tuples so that the top-N tuples are retrieved according to the given norm distance d. Extensive experiments are conducted to measure the performance of our approach for both low-dimensional and high-dimensional data. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:255 / 275
页数:21
相关论文
共 41 条
  • [1] Akbarinia Reza, 2007, INT C VERY LARGE DAT, P495
  • [2] [Anonymous], 2007, P 33 INT C VER LARG
  • [3] [Anonymous], 2012, Theory of computing, DOI DOI 10.4086/TOC.2012.V008A014
  • [4] [Anonymous], 2003, P 2003 ACM SIGMOD IN, DOI DOI 10.1145/872757.872795
  • [5] [Anonymous], 2004, HDB DISCRETE COMPUTA
  • [6] Berchtold S, 2000, LECT NOTES COMPUT SC, V1874, P244
  • [7] Borodin A., 1999, Proceedings of the Thirty-First Annual ACM Symposium on Theory of Computing, P312, DOI 10.1145/301250.301330
  • [8] The Skyline operator
    Börzsönyi, S
    Kossmann, D
    Stocker, K
    [J]. 17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, : 421 - 430
  • [9] Evaluating Top-k queries over web-accessible Databases
    Bruno, N
    Gravano, L
    Marian, A
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 369 - +
  • [10] Top-k selection queries over relational databases:: Mapping strategies and performance evaluation
    Bruno, N
    Chaudhuri, S
    Gravano, L
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2002, 27 (02): : 153 - 187