Distributed Clustering Algorithm for Spatial Data Mining

被引:0
作者
Bendechache, Malika [1 ]
Kechadi, M-Tahar [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci & Informat, Dublin 04, Ireland
来源
PROCEEDINGS 2015 SECOND IEEE INTERNATIONAL CONFERENCE ON SPATIAL DATA MINING AND GEOGRAPHICAL KNOWLEDGE SERVICES (ICSDM 2015) | 2015年
关键词
Spatial data; Clustering; Distributed mining; Data analysis; K-means; SHAPE; POINTS; SET;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed data mining techniques and mainly distributed clustering are widely used in the last decade because they deal with very large and heterogeneous datasets which cannot be gathered centrally. Current distributed clustering approaches are normally generating global models by aggregating local results that are obtained on each site. While this approach mines the datasets on their locations the aggregation phase is complex, which may produce incorrect and ambiguous global clusters and therefore incorrect knowledge. In this paper we propose a new clustering approach for very large spatial datasets that are heterogeneous and distributed. The approach is based on K-means Algorithm but it generates the number of global clusters dynamically. Moreover, this approach uses an elaborated aggregation phase. The aggregation phase is designed in such a way that the overall process is efficient in time and memory allocation. Preliminary results show that the proposed approach produces high quality results and scales up well. We also compared it to two popular clustering algorithms and show that this approach is much more efficient.
引用
收藏
页码:60 / 65
页数:6
相关论文
共 34 条
[1]   Parallel mining of association rules [J].
Agrawal, R ;
Shafer, JC .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) :962-969
[2]  
[Anonymous], P P INT C COMP GRAPH
[3]  
[Anonymous], MINING VERY LARGE DA
[4]  
[Anonymous], ADV DATABASE TECHNOL
[5]  
[Anonymous], ALGORITHMS COMPUTATI
[6]  
[Anonymous], LARG SCAL PAR DAT MI
[7]  
[Anonymous], P 12 INT C MACH LEAR
[8]  
[Anonymous], LNCS LNAI
[9]  
[Anonymous], KNOWLEDGE INFORM SYS
[10]  
[Anonymous], 2 INT C KD DM PORTL