Best Keyword Cover Search

被引:34
作者
Deng, Ke [1 ]
Li, Xin [2 ]
Lu, Jiaheng [2 ]
Zhou, Xiaofang [3 ,4 ]
机构
[1] Huawei Noahs Ark Res Lab, Hong Kong, Hong Kong, Peoples R China
[2] Renmin Univ, Dept Comp Sci, Beijing, Peoples R China
[3] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[4] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
基金
澳大利亚研究理事会;
关键词
Spatial database; point of interests; keywords; keyword rating; keyword cover;
D O I
10.1109/TKDE.2014.2324897
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is common that the objects in a spatial database (e.g., restaurants/hotels) are associated with keyword(s) to indicate their businesses/services/features. An interesting problem known as Closest Keywords search is to query objects, called keyword cover, which together cover a set of query keywords and have the minimum inter-objects distance. In recent years, we observe the increasing availability and importance of keyword rating in object evaluation for the better decision making. This motivates us to investigate a generic version of Closest Keywords search called Best Keyword Cover which considers inter-objects distance as well as the keyword rating of objects. The baseline algorithm is inspired by the methods of Closest Keywords search which is based on exhaustively combining objects from different query keywords to generate candidate keyword covers. When the number of query keywords increases, the performance of the baseline algorithm drops dramatically as a result of massive candidate keyword covers generated. To attack this drawback, this work proposes a much more scalable algorithm called keyword nearest neighbor expansion (keyword-NNE). Compared to the baseline algorithm, keyword-NNE algorithm significantly reduces the number of candidate keyword covers generated. The in-depth analysis and extensive experiments on real data sets have justified the superiority of our keyword-NNE algorithm.
引用
收藏
页码:61 / 73
页数:13
相关论文
共 18 条
[1]  
Agrawal Rakesh., 1994, P 20 INT C VER LARG, P487
[2]  
[Anonymous], J COMPUT SYST SCI
[3]   Efficient processing of spatial joins using R-trees [J].
Brinkhoff, Thomas ;
Kriegel, Hans-Peter ;
Seeger, Bernhard .
SIGMOD Record, 1993, 22 (02) :237-246
[4]  
Cao X., 2011, P 2011 INT C MAN DAT, P373, DOI DOI 10.1145/1989323.1989363
[5]   Editorial [J].
Cao, Xi-Ren .
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2010, 20 (01) :1-2
[6]  
Cong G, 2009, PROC VLDB ENDOW, V2
[7]   Keyword search on spatial databases [J].
De Felipe, Ian ;
Hristidis, Vagelis ;
Rishe, Naphtali .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :656-+
[8]  
Hariharan Ramaswamy, 2007, 2007 International Conference on Scientific and Statistical Database Management, DOI 10.1109/SSDBM.2007.22
[9]   Distance browsing in spatial databases [J].
Hjaltason, GR ;
Samet, H .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1999, 24 (02) :265-318
[10]   IR-Tree: An Efficient Index for Geographic Document Search [J].
Li, Zhisheng ;
Lee, Ken C. K. ;
Zheng, Baihua ;
Lee, Wang-Chien ;
Lee, Dik Lun ;
Wang, Xufa .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (04) :585-599