Scalable multi-dimensional RNN query processing

被引:6
作者
Ji, Changqing [1 ,2 ]
Qu, Wenyu [1 ]
Li, Zhiyang [1 ]
Xu, Yujie [1 ]
Li, Yuanyuan [1 ,3 ]
Wu, Junfeng [1 ,4 ,5 ]
机构
[1] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian, Peoples R China
[2] Dalian Univ, Sch Phys Sci & Technol, Dalian 116012, Peoples R China
[3] Dalian Jiaotong Univ, Sch Software, Dalian, Peoples R China
[4] Dalian Ocean Univ, Sch Educ Technol, Dalian, Peoples R China
[5] Dalian Ocean Univ, Ctr Comp, Dalian, Peoples R China
基金
美国国家科学基金会;
关键词
reverse nearest neighbor; big data; MapReduce; cloud computing; REVERSE NEAREST NEIGHBORS; MAPREDUCE;
D O I
10.1002/cpe.3513
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Reverse nearest neighbor (RNN) queries are the complimentary problem and particular interest in the past few years, such as location-based services, profile-based marketing, resource allocation, and traffic monitoring system. The one major drawback for the existing RNN is that it has inherent sequential nature and uses in-memory algorithm, which limits its applicability to large-scale spatial data queries. This paper proposes scalable algorithms for RNN queries in a distributed environment. Firstly, we investigate the Basic-scalable reverse nearest neighbor (SRNN) initialization query method based on the inverted grid index. Secondly, two optimization methods Lazy-SRNN and Eager-SRNN are proposed to effectively process scalable multi-dimensional RNN queries. Among them, Lazy-SRNN prunes the search space when all RNN objects are discovered in one pass; Eager-SRNN attempts to prune spatial objects incrementally as soon as they are visited. In addition, the SRNN algorithm is proved to be the first attempt for the exact scalable RNN algorithms in a distributed environment on multi-dimensional data sets. We show in an extensive experimental evaluation on real-world and synthetic data the scalability and the performance of our novel approach. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:4156 / 4171
页数:16
相关论文
共 31 条
[11]   Efficient Multi-dimensional Spatial RkNN Query Processing with MapReduce [J].
Ji, Changqing ;
Hu, Hongbin ;
Xu, Yujie ;
Li, Yuanyuan ;
Qu, Wenyu .
2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, :63-68
[12]   BIG DATA PROCESSING: BIG CHALLENGES AND OPPORTUNITIES [J].
Ji, Changqing ;
Li, Yu ;
Qiu, Wenming ;
Jin, Yingwei ;
Xu, Yujie ;
Awada, Uchechukwu ;
Li, Keqiu ;
Qu, Wenyu .
JOURNAL OF INTERCONNECTION NETWORKS, 2012, 13 (3-4)
[13]  
Jiao YC, 2009, PGRID PARALLEL GRID
[14]  
Kang JamesM., 2007, ICDE, P806
[15]  
Korn F, 2000, SIGMOD REC, V29, P201, DOI 10.1145/335191.335415
[16]  
Li J, 2011, P 20 ACM INT C INF K, P2389
[17]  
Liu Ting, 2007, 2007 IEEE WORKSHOP A, DOI DOI 10.1109/WACV.2007.18
[18]  
Liu X, 2015, P IEEE INT C COMP CO
[19]   Fast Counting the Key Tags in Anonymous RFID Systems [J].
Liu, Xiulong ;
Li, Keqiu ;
Qi, Heng ;
Xiao, Bin ;
Xie, Xin .
2014 IEEE 22ND INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2014, :59-70
[20]  
Neumeyer L., 2010, Proceedings 2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010), P170, DOI 10.1109/ICDMW.2010.172