GISQF: An Efficient Spatial Query Processing System

被引:17
作者
Al-Naami, Khaled Mohammed [1 ]
Seker, Sadi [2 ]
Khan, Latifur [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Dallas, TX 75083 USA
[2] Istanbul Medeniyet Univ, Dept Business, Istanbul, Turkey
来源
2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD) | 2014年
基金
美国国家科学基金会;
关键词
MAPREDUCE;
D O I
10.1109/CLOUD.2014.96
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Collecting observations from all international news coverage and using TABARI software to code events, the Global Database of Event, Language, and Tone (GDELT) is the only global political georeferenced event dataset with 250+ million observations covering all countries in the world from January 1, 1979 to the present with daily updates. The purpose of this widely used dataset is to help understand and uncover spatial, temporal and perceptual trends and behaviors of the social and international system. To query such big geospatial data, traditional RDBMS can no longer be used and the need for parallel distributed solutions has become a necessity. MapReduce paradigm has proved to be a scalable platform to process and analyze Big Data in the cloud. Hadoop as an implementation of MapReduce is an open source application that has been widely used and accepted in academia and industry. However, when dealing with Spatial Data, Hadoop is not equipped well and falls short as it doesnt perform efficiently in terms of running time. SpatialHadoop is an extension of Hadoop with the support of spatial data. In this paper, we present Geographic Information System Querying Framework (GISQF) to process Massive Spatial Data. This framework has been built on top of the open source SpatialHadoop system which exploits two-layer spatial indexing techniques to speed up query processing. We show how this solution outperforms Hadoop query processing by orders of magnitude when applying queries on GDELT dataset with a size of 60 GB. We show the results for three types of queries, Longitude Latitude Point queries, Circle-Area queries, and Aggregation queries.
引用
收藏
页码:681 / 688
页数:8
相关论文
共 50 条
  • [21] Efficient Spark-Based Framework for Big Geospatial Data Query Processing and Analysis
    Aljawarneh, Isam Mashhour
    Bellavista, Paolo
    Corradi, Antonio
    Montanari, Rebecca
    Foschini, Luca
    Zanotti, Andrea
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 851 - 856
  • [22] Query Processing for RDF Databases
    Kaoudi, Zoi
    Kementsietsidis, Anastasios
    REASONING WEB: REASONING ON THE WEB IN THE BIG DATA ERA, 2014, 8714 : 141 - +
  • [23] An Efficient Two-Table Join Query Processing Based on Extended Bloom Filter in MapReduce
    Wang, Junlu
    Pang, Jun
    Li, Xiaoyan
    Han, Baishuo
    Huang, Lei
    Ding, Linlin
    WEB-AGE INFORMATION MANAGEMENT, 2016, 9998 : 249 - 258
  • [24] RDF Data Storage Techniques for Efficient SPARQL Query Processing using Distributed Computation Engines
    Hassan, Mahmudul
    Bansal, Srividya K.
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 323 - 330
  • [25] (A)kNN Query Processing on the Cloud: A Survey
    Nodarakis, Nikolaos
    Rapti, Angeliki
    Sioutas, Spyros
    Tsakalidis, Athanasios K.
    Tsolis, Dimitrios
    Tzimas, Giannis
    Panagis, Yannis
    ALGORITHMIC ASPECTS OF CLOUD COMPUTING, ALGOCLOUD 2016, 2017, 10230 : 26 - 40
  • [26] SEIP: System for Efficient Image Processing on Distributed Platform
    Liu, Tao
    Liu, Yi
    Li, Qin
    Wang, Xiang-Rong
    Gao, Fei
    Zhu, Yan-Chao
    Qian, De-Pei
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (06) : 1215 - 1232
  • [27] Skyline and reverse skyline query processing in SpatialHadoop
    Kalyvas, Christos
    Maragoudakis, Manolis
    DATA & KNOWLEDGE ENGINEERING, 2019, 122 : 55 - 80
  • [28] Join Query Processing in Data Quality Management
    Yue, Mingliang
    Gao, Hong
    Shi, Shengfei
    Wang, Hongzhi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 329 - 342
  • [29] Scatter-Gather-Merge: An efficient star-join query processing algorithm for data-parallel frameworks
    Han, Hyuck
    Jung, Hyungsoo
    Eom, Hyeonsang
    Yeom, Heon Y.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2011, 14 (02): : 183 - 197
  • [30] Scatter-Gather-Merge: An efficient star-join query processing algorithm for data-parallel frameworks
    Hyuck Han
    Hyungsoo Jung
    Hyeonsang Eom
    Heon Y. Yeom
    Cluster Computing, 2011, 14 : 183 - 197