Batch Processing of Top-k Spatial-Textual Queries

被引:17
|
作者
Choudhury, Farhana M. [1 ]
Culpepper, J. Shane [1 ]
Bao, Zhifeng [1 ]
Sellis, Timos [2 ]
机构
[1] RMIT Univ, Sch Sci, Melbourne, Vic 3000, Australia
[2] Swinburne Univ Technol, Data Sci Res Inst, Hawthorn, Vic 3122, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Spatial-textual queries; batch queries; spatial-textual indexing; efficient query processing;
D O I
10.1145/3196155
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Since the mid-2000s, everal indexing techniques have been proposed to efficiently answer top-k spatial-textual queries. However, all of these approaches focus on answering one query at a time. In contrast, how to design efficient algorithms that can exploit similarities between incoming queries to improve performance has received little attention. In this article, we study a series of efficient approaches to batch process multiple topk spatial-textual queries concurrently. We carefully design a variety of indexing structures for the problem space by exploring the effect of prioritizing spatial and textual properties on system performance. Specifically, we present an efficient traversal method, SF-Sep, over an existing space-prioritized index structure. Then, we propose a new space-prioritized index structure, the MIR-Tree to support a filter-and-refine based technique, SF-Grp. To support the processing of text-intensive data, we propose an augmented, inverted indexing structure that can easily be added into existing text search engine architectures and a novel traversal method for batch processing of the queries. In all of these approaches, the goal is to improve the overall performance by sharing the I/O costs of similar queries. Finally, we demonstrate significant I/O savings in our algorithms over traditional approaches by extensive experiments on three real datasets and compare how properties of different datasets affect the performance. Many applications in streaming, micro-batching of continuous queries, and privacy-aware search can benefit from this line of work.
引用
收藏
页数:40
相关论文
共 1 条