Efficient Algorithms for Answering the m-Closest Keywords Query

被引:71
作者
Guo, Tao [1 ]
Cao, Xin [2 ]
Cong, Gao [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Queens Univ Belfast, Belfast, Antrim, North Ireland
来源
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2015年
基金
新加坡国家研究基金会;
关键词
Spatial keyword query; Geo-textual objects; SEARCH;
D O I
10.1145/2723372.2723723
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important type of spatial keyword query, the m-closest keywords (mCK) query finds a group of objects such that they cover all query keywords and have the smallest diameter, which is defined as the largest distance between any pair of objects in the group. The query is useful in many applications such as detecting locations of web resources. However, the existing work does not study the intractability of this problem and only provides exact algorithms, which are computationally expensive. In this paper, we prove that the problem of answering mCK queries is NP-hard. We first devise a greedy algorithm that has an approximation ratio of 2. Then, we observe that an mCK query can be approximately answered by finding the circle with the smallest diameter that encloses a group of objects together covering all query keywords. We prove that the group enclosed in the circle can answer the mCK query with an approximation ratio of 2/root 3. Based on this, we develop an algorithm for finding such a circle exactly, which has a high time complexity. To improve efficiency, we propose another two algorithms that find such a circle approximately, with a ratio of (2/root 3+epsilon) Finally, we propose an exact algorithm that utilizes the group found by the (2/root 3+epsilon)-approximation algorithm to obtain the optimal group. We conduct extensive experiments using real-life datasets. The experimental results offer insights into both efficiency and accuracy of the proposed approximation algorithms, and the results also demonstrate that our exact algorithm outperforms the best known algorithm by an order of magnitude.
引用
收藏
页码:405 / 418
页数:14
相关论文
共 24 条
  • [1] Bouros P, 2012, PROC VLDB ENDOW, V6, P1
  • [2] Cao X., 2011, ACM SIGMOD International Conference on Management of Data, SIGMOD 2011, Athens, Greece, June 1216, 2011, P373, DOI [10.1145/1989323.1989363, DOI 10.1145/1989323.1989363]
  • [3] Cao X., 2014, P VLDB ENDOWMENT, V7
  • [4] Cary A, 2010, LECT NOTES COMPUT SC, V6187, P87, DOI 10.1007/978-3-642-13818-8_8
  • [5] Spatial Keyword Query Processing: An Experimental Evaluation
    Chen, Lisi
    Cong, Gao
    Jensen, Christian S.
    Wu, Dingming
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (03): : 217 - 228
  • [6] Christoforaki M., 2011, CIKM, P423, DOI DOI 10.1145/2063576.2063641
  • [7] Cong G., 2009, PROC VLDB ENDOW, V2, P337, DOI DOI 10.14778/1687627.1687666
  • [8] Cong G., 2012, EFFICIENT SPATIAL KE
  • [9] Keyword search on spatial databases
    De Felipe, Ian
    Hristidis, Vagelis
    Rishe, Naphtali
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 656 - +
  • [10] MINIMUM COVERING SPHERE PROBLEM
    ELZINGA, DJ
    HEARN, DW
    [J]. MANAGEMENT SCIENCE SERIES A-THEORY, 1972, 19 (01): : 96 - 104