UlTraMan: A Unified Platform for Big Trajectory Data Management and Analytics

被引:72
作者
Ding, Xin [1 ,2 ]
Chen, Lu [3 ]
Gao, Yunjun [1 ,2 ]
Jensen, Christian S. [3 ]
Bao, Hujun [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou, Zhejiang, Peoples R China
[3] Aalborg Univ, Dept Comp Sci, Aalborg, Denmark
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2018年 / 11卷 / 07期
关键词
D O I
10.14778/3192965.3192970
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Massive trajectory data is being generated by GPS-equipped devices, such as cars and mobile phones, which is used increasingly in transportation, location-based services, and urban computing. As a result, a variety of methods have been proposed for trajectory data management and analytics. However, traditional systems and methods are usually designed for very specific data management or analytics needs, which forces users to stitch together heterogeneous systems to analyze trajectory data in an inefficient manner. Targeting the overall data pipeline of big trajectory data management and analytics, we present a unified platform, termed as UlTraMan. In order to achieve scalability, efficiency, persistence, and flexibility, (i) we extend Apache Spark with respect to both data storage and computing by seamlessly integrating a key-value store, and (ii) we enhance the MapReduce paradigm to allow flexible optimizations based on random data access. We study the resulting systems flexibility using case studies on data retrieval, aggregation analyses, and pattern mining. Extensive experiments on real and synthetic trajectory data are reported to offer insight into the scalability and performance of UlTraMan.
引用
收藏
页码:787 / 799
页数:13
相关论文
共 35 条
[1]  
[Anonymous], 1973, Cartographica: the international journal for geographic information and geovisualization, DOI [DOI 10.3138/FM57-6770-U75U-7727, 10.3138/FM57-6770-U75U-7727]
[2]   Spark SQL: Relational Data Processing in Spark [J].
Armbrust, Michael ;
Xin, Reynold S. ;
Lian, Cheng ;
Huai, Yin ;
Liu, Davies ;
Bradley, Joseph K. ;
Meng, Xiangrui ;
Kaftan, Tomer ;
Franklint, Michael J. ;
Ghodsi, Ali ;
Zaharia, Matei .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :1383-1394
[3]   PIST: An efficient and practical indexing technique for historical spatio-temporal point data [J].
Botea, Viorica ;
Mallett, Daniel ;
Nascimento, Mario A. ;
Sander, Joerg .
GEOINFORMATICA, 2008, 12 (02) :143-168
[4]   A framework for generating network-based moving objects [J].
Brinkhoff, T .
GEOINFORMATICA, 2002, 6 (02) :153-180
[5]  
Chen Z., 2010, P ACM SIGMOD INT C M, P255
[6]   TrajS']jStore: An Adaptive Storage System for Very Large Trajectory Data Sets [J].
Cudre-Mauroux, Philippe ;
Wu, Eugene ;
Madden, Samuel .
26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, :109-120
[7]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[8]   BerlinMOD: a benchmark for moving object databases [J].
Duntgen, Christian ;
Behr, Thomas ;
Gueting, Ralf Hartmut .
VLDB JOURNAL, 2009, 18 (06) :1335-1368
[9]  
Eldawy Ahmed, 2015, 2015 IEEE 31st International Conference on Data Engineering (ICDE), P1352, DOI 10.1109/ICDE.2015.7113382
[10]  
Ester M., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P226