Mining spatiotemporal co-occurrence patterns in non-relational databases

被引:11
作者
Aydin, Berkay [1 ]
Akkineni, Vijay [1 ]
Angryk, Rafal [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, 25 Pk Pl,Suite 700, Atlanta, GA 30303 USA
基金
美国国家科学基金会;
关键词
Spatiotemporal Pattern mining; Non-relational databases; DATA SETS;
D O I
10.1007/s10707-016-0255-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spatiotemporal co-occurrence patterns (STCOPs) represent the subsets of feature types whose instances are frequently co-occurring both in space and time. Spatiotemporal co-occurrences reflect the spatiotemporal overlap relationships among two or more spatiotemporal instances both in spatial and temporal dimensions. STCOPs can be potentially used to predict and understand the generation and evolution of different types of interacting phenomena in various scientific fields such as astronomy, meteorology, biology, geosciences. Meaningful and statistically significant data analysis for these scientific fields requires processing sufficiently large datasets. Due to the computationally expensive nature of spatiotemporal operations required for mining spatiotemporal co-occurrences, it is increasingly difficult to identify spatiotemporal co-occurrences and discover STCOPs in centralized system settings. As a solution, we developed a cloud-based distributed mining system for discovering STCOPs. Our system uses Accumulo, a column-oriented non-relational database management system as its backbone. In order to efficiently mine the STCOPs, we propose three data models for managing trajectory-based spatiotemporal data in Accumulo. We introduce an in-memory join-index structure and a join algorithm for effectively performing spatiotemporal join operations on spatiotemporal trajectories in non-relational databases. Lastly, with the experiments with artificial and real life datasets, we evaluate the performance of the proposed models for STCOP mining.
引用
收藏
页码:801 / 828
页数:28
相关论文
共 34 条
[1]  
Agouris P, 2012, TECH REP
[2]  
Agrawal R., P 20 INT C VERY LARG
[3]  
Andrienko N., 2007, CARTOGRAPHICA V42 2, P117, DOI DOI 10.3138/CART0.42.2.117
[4]  
[Anonymous], 2008, Introduction to information retrieval
[5]  
[Anonymous], 2005, USING CLIMATE PREDIC
[6]   A View of Cloud Computing [J].
Armbrust, Michael ;
Fox, Armando ;
Griffith, Rean ;
Joseph, Anthony D. ;
Katz, Randy ;
Konwinski, Andy ;
Lee, Gunho ;
Patterson, David ;
Rabkin, Ariel ;
Stoica, Ion ;
Zaharia, Matei .
COMMUNICATIONS OF THE ACM, 2010, 53 (04) :50-58
[7]   Mining spatiotemporal co-occurrence patterns in solar datasets [J].
Aydin, B. ;
Kempton, D. ;
Akkineni, V. ;
Angryk, R. ;
Pillai, K. G. .
ASTRONOMY AND COMPUTING, 2015, 13 :136-144
[8]  
Aydin Berkay, 2014, 2014 IEEE International Conference on Big Data (Big Data), P1, DOI 10.1109/BigData.2014.7004398
[9]  
Aydin B., 2014, P 27 INT FLOR ART IN
[10]  
Burrows M, 2006, USENIX ASSOCIATION 7TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P335