Processing top-N relational queries by learning

被引:9
作者
Zhu, Liang [1 ,2 ]
Meng, Weiyi [3 ]
Liu, Chunnian [1 ]
Yang, Wenzhu [2 ]
Liu, Dazhong [2 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci & Technol, Beijing 100124, Peoples R China
[2] Hebei Univ, Sch Math & Comp Sci, Key Lab Machine Learning & Computat Intelligence, Baoding 071002, Hebei, Peoples R China
[3] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
关键词
Top-N query; Relational database; Learning-based strategies; Time series; K SELECTION QUERIES; ALGORITHMS; DATABASES;
D O I
10.1007/s10844-009-0078-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A top-N selection query against a relation is to find the N tuples that satisfy the query condition the best but not necessarily completely. In this paper, we propose a new method for evaluating top-N queries against a relation. This method employs a learning-based strategy. Initially, this method finds and saves the optimal search spaces for a small number of random top-N queries. The learned knowledge is then used to evaluate new queries. Extensive experiments are carried out to measure the performance of this strategy and the results indicate that it is highly competitive with existing techniques for both low-dimensional and high-dimensional data. Furthermore, the knowledge base can be updated based on new user queries to reflect new query patterns so that frequently submitted queries can be processed most efficiently. The maintenance and stability of the knowledge base are also addressed in the paper.
引用
收藏
页码:21 / 55
页数:35
相关论文
共 43 条
[1]  
Balke WT, 2005, PROC INT CONF DATA, P174
[2]  
Bast H., 2006, P 32 INT C VERY LARG, P475
[3]  
Bowerman B, 1993, Forecasting and Time Series: An Applied Approach
[4]   Top-k selection queries over relational databases:: Mapping strategies and performance evaluation [J].
Bruno, N ;
Chaudhuri, S ;
Gravano, L .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2002, 27 (02) :153-187
[5]  
Bruno N., 2001, SIGMOD RECORD, P211
[6]  
Carey M. J., 1997, SIGMOD Record, V26, P219, DOI 10.1145/253262.253302
[7]  
Carey M. J., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P158
[8]  
Chang Yuan-Chi., 2000, P ACM INT C MANAGEME, P391
[9]   Optimizing top-k selection queries over multimedia repositories [J].
Chaudhuri, S ;
Gravano, L ;
Marian, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (08) :992-1009
[10]  
Chaudhuri S, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P399