共 1 条
Batch Processing of Top-k Spatial-Textual Queries
被引:17
|作者:
Choudhury, Farhana M.
[1
]
Culpepper, J. Shane
[1
]
Bao, Zhifeng
[1
]
Sellis, Timos
[2
]
机构:
[1] RMIT Univ, Sch Sci, Melbourne, Vic 3000, Australia
[2] Swinburne Univ Technol, Data Sci Res Inst, Hawthorn, Vic 3122, Australia
基金:
中国国家自然科学基金;
澳大利亚研究理事会;
关键词:
Spatial-textual queries;
batch queries;
spatial-textual indexing;
efficient query processing;
D O I:
10.1145/3196155
中图分类号:
TP7 [遥感技术];
学科分类号:
081102 ;
0816 ;
081602 ;
083002 ;
1404 ;
摘要:
Since the mid-2000s, everal indexing techniques have been proposed to efficiently answer top-k spatial-textual queries. However, all of these approaches focus on answering one query at a time. In contrast, how to design efficient algorithms that can exploit similarities between incoming queries to improve performance has received little attention. In this article, we study a series of efficient approaches to batch process multiple topk spatial-textual queries concurrently. We carefully design a variety of indexing structures for the problem space by exploring the effect of prioritizing spatial and textual properties on system performance. Specifically, we present an efficient traversal method, SF-Sep, over an existing space-prioritized index structure. Then, we propose a new space-prioritized index structure, the MIR-Tree to support a filter-and-refine based technique, SF-Grp. To support the processing of text-intensive data, we propose an augmented, inverted indexing structure that can easily be added into existing text search engine architectures and a novel traversal method for batch processing of the queries. In all of these approaches, the goal is to improve the overall performance by sharing the I/O costs of similar queries. Finally, we demonstrate significant I/O savings in our algorithms over traditional approaches by extensive experiments on three real datasets and compare how properties of different datasets affect the performance. Many applications in streaming, micro-batching of continuous queries, and privacy-aware search can benefit from this line of work.
引用
收藏
页数:40
相关论文