GISQAF: MapReduce guided spatial query processing and analytics system

被引:3
|
作者
Al-Naami, Khaled Mohammed [1 ]
Seker, Sadi Evren [2 ]
Khan, Latifur [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Dallas, TX USA
[2] Istanbul Medeniyet Univ, Dept Business, Istanbul, Turkey
基金
美国国家科学基金会;
关键词
big data; MapReduce; Hadoop; spatial query processing; data analytics; spatial co-occurring events;
D O I
10.1002/spe.2383
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Global Database of Event, Language, and Tone (GDELT) is the only global political georeferenced event dataset with more than 250 million observations covering all countries in the world since January 1, 1979. TABARI and CAMEO are the tools that are used to collect and code events from all international news coverage. To query such big geospatial data, traditional RDBMS can no longer be used, and the need for parallel distributed solutions has become a necessity. MapReduce paradigm has proven to be a scalable platform to process and analyze Big Data in the cloud. Hadoop, as an implementation of MapReduce, is an open-source application that has been widely used and accepted in academia and industry. However, when dealing with Spatial Data, Hadoop is not equipped well and does not perform efficiently. SpatialHadoop is an extension of Hadoop with the support of spatial data. In this paper, we present Geographic Information System Query and Analytics Framework (GISQAF), which has been built on top of SpatialHadoop. GISQAF focuses on two parts: query processing and data analytics. For the query processing part, we show how this solution outperforms Hadoop query processing by orders of magnitude when applying queries on the GDELT dataset with a size of 60 GB. We show the results for various types of queries. For the data analytics part, we present an approach for finding Spatial co-occurring events. We show how GISQAF is suitable and efficient to handle data analytics techniques. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1329 / 1349
页数:21
相关论文
共 50 条
  • [1] Spatial Data Processing with MapReduce
    Gunawardena, Tilani
    Vicari, Annamaria
    Mecca, Giansalvatore
    2015 IEEE 10TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2015, : 485 - 490
  • [2] Scalable Spatial Analytics and In Situ Query Processing in DaskDB
    Das, Suvam Kumar
    Peter, Ronnit
    Ray, Suprio
    PROCEEDINGS OF 2023 18TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATA, SSTD 2023, 2023, : 189 - 193
  • [3] A Survey of Traditional and MapReduce-Based Spatial Query Processing Approaches
    Singh, Hari
    Bawa, Seema
    SIGMOD RECORD, 2017, 46 (02) : 18 - 29
  • [4] Efficient Multi-dimensional Spatial RkNN Query Processing with MapReduce
    Ji, Changqing
    Hu, Hongbin
    Xu, Yujie
    Li, Yuanyuan
    Qu, Wenyu
    2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, : 63 - 68
  • [5] Reducing I/O Cost in OLAP Query Processing with MapReduce
    Kang, Woo-Lam
    Kim, Hyeon-Gyu
    Lee, Yoon-Joon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (02): : 444 - 447
  • [6] A survey of large-scale analytical query processing in MapReduce
    Doulkeridis, Christos
    Norvag, Kjetil
    VLDB JOURNAL, 2014, 23 (03) : 355 - 380
  • [7] A survey of large-scale analytical query processing in MapReduce
    Christos Doulkeridis
    Kjetil Nørvåg
    The VLDB Journal, 2014, 23 : 355 - 380
  • [8] Augmented Dynamic Skyline Query Processing Method Based on MapReduce
    Ding L.-L.
    Cui Z.-Q.
    Yin X.-K.
    Wang J.-L.
    Song B.-Y.
    Song, Bao-Yan (bysong@lnu.edu.cn), 2018, Chinese Institute of Electronics (46): : 1062 - 1070
  • [9] GISQF: An Efficient Spatial Query Processing System
    Al-Naami, Khaled Mohammed
    Seker, Sadi
    Khan, Latifur
    2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 681 - 688
  • [10] Efficient Probabilistic Skyline Query Processing in MapReduce
    Ding, Linlin
    Wang, Guoren
    Xin, Junchang
    Yuan, Ye
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 203 - 210