GISQAF: MapReduce guided spatial query processing and analytics system

被引:3
|
作者
Al-Naami, Khaled Mohammed [1 ]
Seker, Sadi Evren [2 ]
Khan, Latifur [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Dallas, TX USA
[2] Istanbul Medeniyet Univ, Dept Business, Istanbul, Turkey
基金
美国国家科学基金会;
关键词
big data; MapReduce; Hadoop; spatial query processing; data analytics; spatial co-occurring events;
D O I
10.1002/spe.2383
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Global Database of Event, Language, and Tone (GDELT) is the only global political georeferenced event dataset with more than 250 million observations covering all countries in the world since January 1, 1979. TABARI and CAMEO are the tools that are used to collect and code events from all international news coverage. To query such big geospatial data, traditional RDBMS can no longer be used, and the need for parallel distributed solutions has become a necessity. MapReduce paradigm has proven to be a scalable platform to process and analyze Big Data in the cloud. Hadoop, as an implementation of MapReduce, is an open-source application that has been widely used and accepted in academia and industry. However, when dealing with Spatial Data, Hadoop is not equipped well and does not perform efficiently. SpatialHadoop is an extension of Hadoop with the support of spatial data. In this paper, we present Geographic Information System Query and Analytics Framework (GISQAF), which has been built on top of SpatialHadoop. GISQAF focuses on two parts: query processing and data analytics. For the query processing part, we show how this solution outperforms Hadoop query processing by orders of magnitude when applying queries on the GDELT dataset with a size of 60 GB. We show the results for various types of queries. For the data analytics part, we present an approach for finding Spatial co-occurring events. We show how GISQAF is suitable and efficient to handle data analytics techniques. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1329 / 1349
页数:21
相关论文
共 50 条
  • [21] MapReduce Based Scalable Range Query Architecture for Big Spatial Data
    Kizgindere, Umut
    Eken, Suleyman
    Sayar, Ahmet
    2015 IEEE/ACS 12TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2015,
  • [22] A Density-Aware Similarity Join Query Processing Algorithm on MapReduce
    Jang, Miyoung
    Song, Youngho
    Chang, Jae-Woo
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURETECH & MUE, 2016, 393 : 469 - 475
  • [23] MapReduce Based Scalable Range Query Architecture for Big Spatial Data
    Eken, Suleyman
    Kizgindere, Umut
    Sayar, Ahmet
    RISE OF BIG SPATIAL DATA, 2017, : 263 - 272
  • [24] A Solution to Query Processing Challenges Through Smart Query Processor for Big Data Analytics
    Vaidya G.M.
    Kshirsagar M.M.
    SN Computer Science, 4 (2)
  • [25] A Flexible Query Answering System for Movie Analytics
    Leung, Carson K.
    Eckhardt, Lucas B.
    Sainbhi, Amanjyot Singh
    Tran, Cong Thanh Kevin
    Wen, Qi
    Lee, Wookey
    FLEXIBLE QUERY ANSWERING SYSTEMS, 2019, 11529 : 250 - 261
  • [26] SigMR: MapReduce-based SPARQL query processing by signature encoding and multi-way join
    Ahn, Jinhyun
    Im, Dong-Hyuk
    Kim, Hong-Gee
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (10) : 3695 - 3725
  • [27] FunDa: scalable serverless data analytics and in situ query processing
    Elyes Lounissi
    Suvam Kumar Das
    Ronnit Peter
    Xiaozheng Zhang
    Suprio Ray
    Lianyin Jia
    Journal of Big Data, 12 (1)
  • [28] SigMR: MapReduce-based SPARQL query processing by signature encoding and multi-way join
    Jinhyun Ahn
    Dong-Hyuk Im
    Hong-Gee Kim
    The Journal of Supercomputing, 2015, 71 : 3695 - 3725
  • [29] LShape Partitioning: Parallel Skyline Query Processing using MapReduce (Extended Abstract)
    Wijayanto, Heri
    Wang, Wenlu
    Ku, Wei-Shinn
    Chen, Arbee L. P.
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2340 - 2341
  • [30] MapReduce-based Image Processing System with Automated Parallelization
    Sozykin, A. V.
    Goldshtein, M. L.
    BULLETIN OF THE SOUTH URAL STATE UNIVERSITY SERIES-MATHEMATICAL MODELLING PROGRAMMING & COMPUTER SOFTWARE, 2012, (13): : 109 - 118