Big Spatial Data Processing With Apache Spark

被引:0
作者
Boyi Shangguan [1 ]
Peng Yue [1 ]
Wu, Zhaoyan [1 ]
Jiang, Liangcun [2 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, 129 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China
来源
2017 6TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS | 2017年
基金
中国国家自然科学基金;
关键词
Big Spatial Data; Apache Spark; SpatialRDD; SparkSpatialSDK; MAPREDUCE; SYSTEM;
D O I
暂无
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Big data technologies have shown great promise for managing geospatial data in recent years. In order to deal with the growing spatial data, a high performance spatial data processing system layered on big data technologies is needed. In this paper, we present an approach to process big spatial data with Apache Spark, a fast and generic engine for large-scale data processing. We developed a software development kit named SparkSpatialSDK, which takes spatial characteristics of geospatial data into consideration and provides a Spark-enabled spatial data structure and API to allow users easily perform spatial analyses with big spatial data. The spatial data structure couples geometric data structure (point, line, and polygon) with Resilient Distributed Datasets (RDD). An interface, called SpatialRDD, is provided to access big spatial data stored in distributed database systems like HBase and load the data in Spark processing engine. We illustrates the applications of the API using some example processing functions such as the spatial range and spatial k-nearest neighbor queries. The results demonstrate the applicability of using SparkSpatialSDK for big geospatial data processing.
引用
收藏
页码:239 / 242
页数:4
相关论文
共 14 条
[11]  
Xie M., 2000, Geographic Information Sciences, V6, P170, DOI [10.1080/10824000009480547, DOI 10.1080/10824000009480547]
[12]  
You SM, 2015, 2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), P34, DOI 10.1109/ICDEW.2015.7129541
[13]  
Yu J., 2015, Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, P70
[14]   HBaseSpatial: a Scalable Spatial Data Storage Based on HBase [J].
Zhang, Ningyu ;
Zheng, Guozhou ;
Chen, Huajun ;
Chen, Jiaoyan ;
Chen, Xi .
2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, :644-651