A framework for efficient spatial web object retrieval

被引:94
作者
Wu, Dingming [1 ]
Cong, Gao [2 ]
Jensen, Christian S. [3 ]
机构
[1] Hong Kong Baptist Univ, Dept Comp Sci, Kowloon Tong, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
[3] Aarhus Univ, Dept Comp Sci, DK-8000 Aarhus, Denmark
关键词
Spatial web; Keyword query; Spatial query; Top-K query; Inverted file; R-tree; Spatio-textual indexing; SIGNATURE FILES; INVERTED FILES; KEYWORD SEARCH;
D O I
10.1007/s00778-012-0271-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The conventional Internet is acquiring a geospatial dimension. Web documents are being geo-tagged and geo-referenced objects such as points of interest are being associated with descriptive text documents. The resulting fusion of geo-location and documents enables new kinds of queries that take into account both location proximity and text relevancy. This paper proposes a new indexing framework for top-k spatial text retrieval. The framework leverages the inverted file for text retrieval and the R-tree for spatial proximity querying. Several indexing approaches are explored within this framework. The framework encompasses algorithms that utilize the proposed indexes for computing location-aware as well as region-aware top-k text retrieval queries, thus taking into account both text relevancy and spatial proximity to prune the search space. Results of empirical studies with an implementation of the framework demonstrate that the paper's proposal is capable of excellent performance.
引用
收藏
页码:797 / 822
页数:26
相关论文
共 41 条
[1]  
Amitay E., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P273, DOI 10.1145/1008992.1009040
[2]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[3]  
[Anonymous], 1994, TREC
[4]  
[Anonymous], 1998, SIGIR 98 P 21 ANN IN, DOI DOI 10.1145/290941.291008
[5]  
[Anonymous], 1990, P 1990 ACM SIGMOD IN, DOI DOI 10.1145/93597.98741
[6]  
Baeza-Yates R, 1999, MODERN INFORM RETRIE, V463
[7]   Evaluating Top-k queries over web-accessible Databases [J].
Bruno, N ;
Gravano, L ;
Marian, A .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :369-+
[8]   Retrieving Top-k Prestige-Based Relevant Spatial Web Objects [J].
Cao, Xin ;
Cong, Gao ;
Jensen, Christian S. .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01) :373-384
[9]  
Chen Y.-Y., 2006, P ACM SIGMOD INT C M, P277
[10]  
Cong G, 2008, P 31 ANN INT ACM SIG, P467, DOI [DOI 10.1145/1390334.1390415, 10.1145/1390334.1390415]