GISQF: An Efficient Spatial Query Processing System

被引:17
作者
Al-Naami, Khaled Mohammed [1 ]
Seker, Sadi [2 ]
Khan, Latifur [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Dallas, TX 75083 USA
[2] Istanbul Medeniyet Univ, Dept Business, Istanbul, Turkey
来源
2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD) | 2014年
基金
美国国家科学基金会;
关键词
MAPREDUCE;
D O I
10.1109/CLOUD.2014.96
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Collecting observations from all international news coverage and using TABARI software to code events, the Global Database of Event, Language, and Tone (GDELT) is the only global political georeferenced event dataset with 250+ million observations covering all countries in the world from January 1, 1979 to the present with daily updates. The purpose of this widely used dataset is to help understand and uncover spatial, temporal and perceptual trends and behaviors of the social and international system. To query such big geospatial data, traditional RDBMS can no longer be used and the need for parallel distributed solutions has become a necessity. MapReduce paradigm has proved to be a scalable platform to process and analyze Big Data in the cloud. Hadoop as an implementation of MapReduce is an open source application that has been widely used and accepted in academia and industry. However, when dealing with Spatial Data, Hadoop is not equipped well and falls short as it doesnt perform efficiently in terms of running time. SpatialHadoop is an extension of Hadoop with the support of spatial data. In this paper, we present Geographic Information System Querying Framework (GISQF) to process Massive Spatial Data. This framework has been built on top of the open source SpatialHadoop system which exploits two-layer spatial indexing techniques to speed up query processing. We show how this solution outperforms Hadoop query processing by orders of magnitude when applying queries on GDELT dataset with a size of 60 GB. We show the results for three types of queries, Longitude Latitude Point queries, Circle-Area queries, and Aggregation queries.
引用
收藏
页码:681 / 688
页数:8
相关论文
共 50 条
  • [11] Large-Scale Spatial Join Query Processing in Cloud
    You, Simin
    Zhang, Jianting
    Gruenwald, Le
    2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 34 - 41
  • [12] A Survey of Traditional and MapReduce-Based Spatial Query Processing Approaches
    Singh, Hari
    Bawa, Seema
    SIGMOD RECORD, 2017, 46 (02) : 18 - 29
  • [13] A Boundary Filtering Based Spatial Join Query Processing Optimization Algorithm
    Qiao, Baiyou
    Zhu, Junhai
    Shen, Muchuan
    Chen, Yang
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1764 - 1769
  • [14] Efficient Distributed Query Processing on Large Scale RDF Graph Data
    Wang X.
    Xu Q.
    Chai L.-L.
    Yang Y.-J.
    Chai Y.-P.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 498 - 514
  • [15] Efficient Graph Query Processing over Geo-Distributed Datacenters
    Yuan, Ye
    Ma, Delong
    Wen, Zhenyu
    Ma, Yuliang
    Wang, Guoren
    Chen, Lei
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 619 - 628
  • [16] LSShare: an efficient multiple query optimization system in the cloud
    Ge, Xing
    Yao, Bin
    Guo, Minyi
    Xu, Changliang
    Zhou, Jingyu
    Wu, Chentao
    Xue, Guangtao
    DISTRIBUTED AND PARALLEL DATABASES, 2014, 32 (04) : 583 - 605
  • [17] Efficient Indexing and Query Processing of Model-View Sensor Data in the Cloud
    Guo, Tian
    Papaioannou, Thanasis G.
    Aberer, Karl
    BIG DATA RESEARCH, 2014, 1 : 52 - 65
  • [18] Efficient Skyline query processing of massive data based on Map-Reduce
    Ding L.-L.
    Xin J.-C.
    Wang G.-R.
    Huang S.
    Jisuanji Xuebao/Chinese Journal of Computers, 2011, 34 (10): : 1785 - 1796
  • [19] What-If Query Processing Policy for Big Data in OLAP System
    Xu, Huan
    Luo, Hao
    He, Jieyue
    2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 110 - 116
  • [20] Efficient ELM-Based Two Stages Query Processing Optimization for Big Data
    Ding, Linlin
    Liu, Yu
    Song, Baoyan
    Xin, Junchang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015