Multi-dimensional geospatial data mining in a distributed environment using MapReduce

被引:17
作者
Alkathiri, Mazin [1 ]
Jhummarwala, Abdul [2 ]
Potdar, M. B. [2 ]
机构
[1] Univ Sci & Technol, Adm Sci Coll Hadhramout, Hadhramout, Yemen
[2] Bhaskaracharya Inst Space Applicat & Geoinformat, Gandhinagar 382007, India
关键词
Multiband raster processing; Multi-dimensional data processing; Geospatial processing; Spectral to geometrical space; K-means clustering; HADOOP;
D O I
10.1186/s40537-019-0245-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data mining and machine learning techniques for processing raster data consider a single spectral band of data at a time. The individual results are combined to obtain the final output. The essence of related multi-spectral information is lost when the bands are considered independently. The proposed platform is based on Apache Hadoop ecosystem and supports performing analysis on large amounts of multispectral raster data using MapReduce. A novel technique of transforming the spectral space to the geometrical space is also proposed. The technique allows to consider multiple bands coherently. The results of clustering 10(6) pixels for multiband imagery with widely used GIS software have been tested and other machine learning methods are planned to be incorporated in the platform. The platform is scalable to support tens of spectral bands. The results from our platform were found to be better and are also available faster due to application of distributed processing.
引用
收藏
页数:34
相关论文
共 69 条
[1]  
Abdul J, 2016, 2016 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION, & AUTOMATION (ICACCA) (FALL), P22
[2]   A k-mean clustering algorithm for mixed numeric and categorical data [J].
Ahmad, Amir ;
Dey, Lipika .
DATA & KNOWLEDGE ENGINEERING, 2007, 63 (02) :503-527
[3]   ST-Hadoop: a MapReduce framework for spatio-temporal data [J].
Alarabi, Louai ;
Mokbel, Mohamed F. ;
Musleh, Mashaal .
GEOINFORMATICA, 2018, 22 (04) :785-813
[4]  
Alkathiri M., 2016, INT J COMPUTER APPL, V135, P28
[5]  
[Anonymous], 1990, TECHNICAL REPORT
[6]  
BEDARD Y., 2001, Geographic Data Mining and knowledge discovery, P53, DOI DOI 10.4324/9780203468029_CHAPTER_3
[7]  
Bennett J., 2010, OpenStreetMap
[8]   Ontop of Geospatial Databases [J].
Bereta, Konstantina ;
Koubarakis, Manolis .
SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 :37-52
[9]   Clustering of Maxima: Spatial Dependencies among Heavy Rainfall in France [J].
Bernard, Elsa ;
Naveau, Philippe ;
Vrac, Mathieu ;
Mestre, Olivier .
JOURNAL OF CLIMATE, 2013, 26 (20) :7929-7937
[10]  
Bhosale H.S., 2014, Int J Sci Res, V4, P1