Large-Scale Spatial Join Query Processing in Cloud

被引:0
作者
You, Simin [1 ]
Zhang, Jianting [2 ]
Gruenwald, Le [3 ]
机构
[1] CUNY, Grad Ctr, Dept Comp Sci, New York, NY 10021 USA
[2] CUNY, Dept Comp Sci, New York, NY 10021 USA
[3] Univ Oklahoma, Dept Comp Sci, Norman, OK 73019 USA
来源
2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW) | 2015年
关键词
Spatial Join; Spark; Impala; Cloud Computing; MAPREDUCE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The rapidly increasing amount of location data available in many applications has made it desirable to process their large-scale spatial queries in Cloud for performance and scalability. We report our designs and implementations of two prototype systems that are ready for Cloud deployments: SpatialSpark based on Apache Spark and ISP-MC based on Cloudera Impala. Both systems support indexed spatial joins based on point-in-polygon test and point-to-polyline distance computation. Experiments on the pickup locations of similar to 170 million taxi trips in New York City and similar to 10 million global species occurrences records have demonstrated both efficiency and scalability using Amazon EC2 clusters.
引用
收藏
页码:34 / 41
页数:8
相关论文
共 14 条
[1]   Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce [J].
Aji, Ablimit ;
Wang, Fusheng ;
Vo, Hoang ;
Lee, Rubao ;
Liu, Qiaoling ;
Zhang, Xiaodong ;
Saltz, Joel .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11) :1009-1020
[2]  
[Anonymous], 2012, P 1 ACM SIGSPATIAL I
[3]  
[Anonymous], P EDBT
[4]  
[Anonymous], 2010, P HOTCLOUD
[5]  
Appuswamy R., 2013, 4 ANN S CLOUD COMPUT, P20
[6]   MapReduce: A Flexible Data Processing Tool [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2010, 53 (01) :72-77
[7]   A Demonstration of SpatialHadoop: An Efficient MapReduce Framework for Spatial Data [J].
Eldawy, Ahmed ;
Mokbel, Mohamed F. .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12) :1230-1233
[8]  
Hennessy J.L., 2011, Computer Architecture: A Quantitative Approach
[9]   Spatial join techniques [J].
Jacox, Edwin H. ;
Samet, Hanan .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2007, 32 (01)
[10]  
Wanderman-Milne Skye, 2014, IEEE Data Eng. Bull, V37, P31