HadoopTrajectory: a Hadoop spatiotemporal data processing extension

被引:20
作者
Bakli, Mohamed [1 ]
Sakr, Mahmoud [2 ,3 ]
Soliman, Taysir Hassan A. [1 ]
机构
[1] Assiut Univ, Asyut, Egypt
[2] Ain Shams Univ, Cairo, Egypt
[3] Univ Libre Bruxelles, Brussels, Belgium
关键词
Spatiotemporal; Hadoop; 3DR-tree; Trajectory data management; Big data; MOVING-OBJECTS; SYSTEM;
D O I
10.1007/s10109-019-00292-4
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
The recent advances in location tracking technologies and the widespread use of location-aware applications have resulted in big datasets of moving object trajectories. While there exists a couple of research prototypes for moving object databases, there is a lack of systems that can process big spatiotemporal data. This work proposes HadoopTrajectory, a Hadoop extension for spatiotemporal data processing. The extension adds spatiotemporal types and operators to the Hadoop core. These types and operators can be directly used in MapReduce programs, which gives the Hadoop user the possibility to write spatiotemporal data analytics programs. The storage layer of Hadoop, the HDFS, is extended by types to represent trajectory data and their corresponding input and output functions. It is also extended by file splitters and record readers. This enables Hadoop to read big files of moving object trajectories such as vehicle GPS tracks and split them over worker nodes for distributed processing. The storage layer is also extended by spatiotemporal indexes that help filtering the data before splitting it over the worker nodes. Several data access functions are provided so that the MapReduce layer can deal with this data. The MapReduce layer is extended with trajectory processing operators, to compute for instance the length of a trajectory in meters. This paper describes the extension and evaluates it using a synthetic dataset and a real dataset. Comparisons with non-Hadoop systems and with standard Hadoop are given. The extension accounts for about 11,601 lines of Java code.
引用
收藏
页码:211 / 235
页数:25
相关论文
共 19 条
  • [1] Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce
    Aji, Ablimit
    Wang, Fusheng
    Vo, Hoang
    Lee, Rubao
    Liu, Qiaoling
    Zhang, Xiaodong
    Saltz, Joel
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11): : 1009 - 1020
  • [2] A Demonstration of ST-Hadoop: A MapReduce Framework for Big Spatio-temporal Data
    Alarabi, Louai
    Mokbel, Mohamed F.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1961 - 1964
  • [3] [Anonymous], HERMES FRAMEWORK LOC
  • [4] A spatiotemporal algebra in Hadoop for moving objects
    Bakli, Mohamed S.
    Sakrb, Mahmoud A.
    Soliman, Taysir Hassan A.
    [J]. GEO-SPATIAL INFORMATION SCIENCE, 2018, 21 (02) : 102 - 114
  • [5] Indexing the trajectories of moving objects in networks
    de Almeida, V
    Güting, RH
    [J]. GEOINFORMATICA, 2005, 9 (01) : 33 - 60
  • [6] BerlinMOD: a benchmark for moving object databases
    Duntgen, Christian
    Behr, Thomas
    Gueting, Ralf Hartmut
    [J]. VLDB JOURNAL, 2009, 18 (06) : 1335 - 1368
  • [7] Eldawy A, 2015, PROC INT CONF DATA, P1352
  • [8] Forlizzi L, 2000, SIGMOD RECORD, V29, P319, DOI 10.1145/335191.335426
  • [9] Fox A, 2013, IEEE INT CONF BIG DA
  • [10] Indexing objects moving on fixed networks
    Frentzos, E
    [J]. ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2003, 2750 : 289 - 305