Batch Processing of Top-k Spatial-Textual Queries

被引:17
作者
Choudhury, Farhana M. [1 ]
Culpepper, J. Shane [1 ]
Bao, Zhifeng [1 ]
Sellis, Timos [2 ]
机构
[1] RMIT Univ, Sch Sci, Melbourne, Vic 3000, Australia
[2] Swinburne Univ Technol, Data Sci Res Inst, Hawthorn, Vic 3122, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Spatial-textual queries; batch queries; spatial-textual indexing; efficient query processing;
D O I
10.1145/3196155
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Since the mid-2000s, everal indexing techniques have been proposed to efficiently answer top-k spatial-textual queries. However, all of these approaches focus on answering one query at a time. In contrast, how to design efficient algorithms that can exploit similarities between incoming queries to improve performance has received little attention. In this article, we study a series of efficient approaches to batch process multiple topk spatial-textual queries concurrently. We carefully design a variety of indexing structures for the problem space by exploring the effect of prioritizing spatial and textual properties on system performance. Specifically, we present an efficient traversal method, SF-Sep, over an existing space-prioritized index structure. Then, we propose a new space-prioritized index structure, the MIR-Tree to support a filter-and-refine based technique, SF-Grp. To support the processing of text-intensive data, we propose an augmented, inverted indexing structure that can easily be added into existing text search engine architectures and a novel traversal method for batch processing of the queries. In all of these approaches, the goal is to improve the overall performance by sharing the I/O costs of similar queries. Finally, we demonstrate significant I/O savings in our algorithms over traditional approaches by extensive experiments on three real datasets and compare how properties of different datasets affect the performance. Many applications in streaming, micro-batching of continuous queries, and privacy-aware search can benefit from this line of work.
引用
收藏
页数:40
相关论文
共 34 条
[1]  
BRODER AZ, 2003, CIKM, P426, DOI DOI 10.1145/956863.956944
[2]  
Chakrabarti K, 2011, PROC INT CONF DATA, P709, DOI 10.1109/ICDE.2011.5767855
[3]   Spatial Keyword Query Processing: An Experimental Evaluation [J].
Chen, Lisi ;
Cong, Gao ;
Jensen, Christian S. ;
Wu, Dingming .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (03) :217-228
[4]  
Chen Y, 2007, PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, P563
[5]  
Choudhury F.M., 2015, GEORICH, P7
[6]   Maximizing Bichromatic Reverse Spatial and Textual k Nearest Neighbor Queries [J].
Choudhury, Farhana M. ;
Culpepper, J. Shane ;
Sellis, Timos ;
Cao, Xin .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (06) :456-467
[7]  
Christoforaki M., 2011, CIKM, P423, DOI DOI 10.1145/2063576.2063641
[8]  
Christopher D., 2008, INTRO INFORM RETRIEV
[9]  
Cong G., 2009, PROC VLDB ENDOW, V2, P337, DOI DOI 10.14778/1687627.1687666
[10]  
Dimopoulos Constantinos, 2013, P 6 ACM INT C WEB SE, P113, DOI DOI 10.1145/2433396.2433412