Scalable multi-dimensional RNN query processing

被引:6
作者
Ji, Changqing [1 ,2 ]
Qu, Wenyu [1 ]
Li, Zhiyang [1 ]
Xu, Yujie [1 ]
Li, Yuanyuan [1 ,3 ]
Wu, Junfeng [1 ,4 ,5 ]
机构
[1] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian, Peoples R China
[2] Dalian Univ, Sch Phys Sci & Technol, Dalian 116012, Peoples R China
[3] Dalian Jiaotong Univ, Sch Software, Dalian, Peoples R China
[4] Dalian Ocean Univ, Sch Educ Technol, Dalian, Peoples R China
[5] Dalian Ocean Univ, Ctr Comp, Dalian, Peoples R China
基金
美国国家科学基金会;
关键词
reverse nearest neighbor; big data; MapReduce; cloud computing; REVERSE NEAREST NEIGHBORS; MAPREDUCE;
D O I
10.1002/cpe.3513
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Reverse nearest neighbor (RNN) queries are the complimentary problem and particular interest in the past few years, such as location-based services, profile-based marketing, resource allocation, and traffic monitoring system. The one major drawback for the existing RNN is that it has inherent sequential nature and uses in-memory algorithm, which limits its applicability to large-scale spatial data queries. This paper proposes scalable algorithms for RNN queries in a distributed environment. Firstly, we investigate the Basic-scalable reverse nearest neighbor (SRNN) initialization query method based on the inverted grid index. Secondly, two optimization methods Lazy-SRNN and Eager-SRNN are proposed to effectively process scalable multi-dimensional RNN queries. Among them, Lazy-SRNN prunes the search space when all RNN objects are discovered in one pass; Eager-SRNN attempts to prune spatial objects incrementally as soon as they are visited. In addition, the SRNN algorithm is proved to be the first attempt for the exact scalable RNN algorithms in a distributed environment on multi-dimensional data sets. We show in an extensive experimental evaluation on real-world and synthetic data the scalability and the performance of our novel approach. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:4156 / 4171
页数:16
相关论文
共 31 条
[1]  
Akdogan A., 2010, Proceedings of the 2010 IEEE 2nd International Conference on Cloud Computing Technology and Science (CloudCom 2010), P9, DOI 10.1109/CloudCom.2010.92
[2]  
Bu YY, 2010, PROC VLDB ENDOW, V3, P285
[3]  
Changqing Ji, 2012, 2012 Seventh ChinaGrid Annual Conference (ChinaGrid 2012), P25, DOI 10.1109/ChinaGrid.2012.19
[4]  
Chen YX, 2007, KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P133
[5]  
Comap, 2010, CONS MATH ITS APPL
[6]  
Condie Tyson, 2010, SIGMOD, P1115, DOI DOI 10.1145/1807167.1807295
[7]  
Condie Tyson, 2010, P 7 USENIX C NETWORK, P21
[8]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[9]   ComMapReduce: An improvement of MapReduce with lightweight communication mechanisms [J].
Ding, Linlin ;
Wang, Guoren ;
Xin, Junchang ;
Wang, Xiaoyang ;
Huang, Shan ;
Zhang, Rui .
DATA & KNOWLEDGE ENGINEERING, 2013, 88 :224-247
[10]   Scalable nearest neighbor query processing based on Inverted Grid Index [J].
Ji, Changqing ;
Li, Zhiyang ;
Qu, Wenyu ;
Xu, Yujie ;
Li, Yuanyuan .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2014, 44 :172-182