Top-k Spatio-textual Similarity Search

被引:0
作者
Liu, Sitong [1 ]
Chu, Yaping [1 ]
Hu, Huiqi [1 ]
Feng, Jianhua [1 ]
Zhu, Xuan [2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Samsung R&D Inst, Beijing, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT, WAIM 2014 | 2014年 / 8485卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Location-based services have attracted significant attention for the ubiquitous smartphones equipped with GPS systems. These services (e.g., Twitter, Foursquare) generate large amounts of spatio-textual data which contain both geographical location and textual description. In this paper, we study a prevalent top-k spatio-textual similarity search problem: Given a set of objects and a user query, find k most relevant objects considering both spatial location and textual description. We make the following contributions: (1) We propose a TA-based framework and devise efficient algorithms to incrementally visit the objects with current highest spatial or textual similarity. (2) We explore a hybrid partition pattern by integrating spatial and textual pruning power. We further propose a partition-based algorithm which can significantly improve the performance. (3) We have conducted extensive experiments on real and synthetic datasets. Experimental results show that our methods outperform state-of-the-art algorithms and achieve high performance.
引用
收藏
页码:602 / 614
页数:13
相关论文
共 11 条
  • [1] Arasu A., 2006, Proceedings of VLDB Endowment, P918
  • [2] Bayardo R.J., 2007, WWW INT C WORLD WID, P131, DOI [DOI 10.1145/1242572.1242591, 10.1145/1242572.1242591]
  • [3] Cong Gao., 2009, PVLDB
  • [4] Optimal aggregation algorithms for middleware
    Fagin, R
    Lotem, A
    Naor, M
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 614 - 656
  • [5] Guttman Antonin., 1984, P 1984 ACM SIGMOD C, P47
  • [6] A Survey of Top-k Query Processing Techniques in Relational Database Systems
    Ilyas, Ihab F.
    Beskales, George
    Soliman, Mohamed A.
    [J]. ACM COMPUTING SURVEYS, 2008, 40 (04)
  • [7] Katayama N., 1997, SIGMOD Record, V26, P369, DOI 10.1145/253262.253347
  • [8] Roussopoulos N., 1995, ACM SIGMOD RECORD, V24, P7179
  • [9] Xiao C., 2008, WWW, P131
  • [10] Xiao CA, 2009, PROC INT CONF DATA, P916, DOI 10.1109/ICDE.2009.111