Spatial Join Query Processing in Cloud: Analyzing Design Choices and Performance Comparisons

被引:14
作者
You, Simin [1 ]
Zhang, Jianting [2 ]
Gruenwald, Le [3 ]
机构
[1] CUNY Grad Ctr, Dept Comp Sci, New York, NY 10016 USA
[2] CUNY City Coll, Dept Comp Sci, New York, NY 10031 USA
[3] Univ Oklahoma, Dept Comp Sci, Norman, OK 73019 USA
来源
2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS | 2015年
关键词
Spatial Join; Query Processing; Cloud Computing; Design; Performance;
D O I
10.1109/ICPPW.2015.41
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data volumes of GPS recorded locations and many other types of geospatial data are fast increasing. Processing large-scale spatial joins in Cloud for performance and scalability is becoming increasingly popular. In this study, we compare three leading Cloud-based spatial data management systems, namely HadoopGIS, SpatialHadoop and SpatialSpark, both conceptually through analysis of design choices and empirically through experiments using real world datasets. Using both a workstation serving as a single-node cluster and up to 10 nodes Amazon EC2 clusters, the results show that the combined factors, including Cloud platforms, data access models and the underlying geometry libraries, have significant impacts in their realized performance. While SpatialHadoop generally wins on robustness, SpatialSpark is the clear winner of efficiency due to in-memory processing.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 13 条
  • [1] Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce
    Aji, Ablimit
    Wang, Fusheng
    Vo, Hoang
    Lee, Rubao
    Liu, Qiaoling
    Zhang, Xiaodong
    Saltz, Joel
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11): : 1009 - 1020
  • [2] Eldawy, P IEEE ICDE 15
  • [3] SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures
    Floratou, Avrilia
    Minhas, Umar Farooq
    Ozcan, Fatma
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (12): : 1295 - 1306
  • [4] Spatial join techniques
    Jacox, Edwin H.
    Samet, Hanan
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2007, 32 (01):
  • [5] KORNACKER M, P CIDR 15
  • [6] Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation
    Lu, Yi
    Cheng, James
    Yan, Da
    Wu, Huanhuan
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (03): : 281 - 292
  • [7] An Experimental Analysis of Iterated Spatial Joins in Main Memory
    Sowell, Benjamin
    Salles, Marcos Vaz
    Cao, Tuan
    Demers, Alan
    Gehrke, Johannes
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1882 - 1893
  • [8] Vo H., 2014, P ACM GIS
  • [9] You S., 2015, P IEEE CLOUDDM 15
  • [10] You S., 2015, P IEEE HARDBD 15