Clustering Algorithm for Multi-density Datasets

被引:0
作者
Fahim, Ahmed [1 ,2 ]
机构
[1] Prince Sattam Bin Abdulaziz Univ, Fac Sci & Humanitarian Study, Al Aflaj, Saudi Arabia
[2] Suez Univ, Fac Comp & Informat, Suez, Egypt
来源
ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY | 2019年 / 22卷 / 3-4期
关键词
Clustering methods; Data analysis; Data mining; Knowledge discovery; Un-supervised learning; EFFICIENT ALGORITHM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm is density-based clustering method. It discovers clusters with varied shapes, sizes and handles noise. But it fails to discover clusters of varied density. This problem arises due to its dependency on global parameters especially Eps (represents neighborhood radius for each point in dataset). This paper introduces very simple idea to deal with this problem. The idea is steamed from density-based methods especially DENCLUE (DENsity-based CLUstEring), DBSCAN algorithm and k-nearest neighbors. The proposed method estimates local density -for each point in dataset- as the sum of distances to the k-nearest neighbor, arranges points in ascending order based on local density. The algorithm starts the clustering process from the highest density point by adding un-clustered points that have similar density as first point in cluster. Similar means there is small variance in density between the current point and the first point in cluster. Also, the point is assigned to current cluster if the sum of distances to its Minpts-nearest neighbors is less than or equal to the density of first point (core point condition in DBSCAN). Experimental results show the efficiency of the proposed method in discovering varied density clusters from data.
引用
收藏
页码:244 / 258
页数:15
相关论文
共 24 条
[1]  
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[2]  
[Anonymous], 1994, P 20 INT C VER LARG
[3]  
[Anonymous], 2014 WORLD C COMP AP
[4]  
[Anonymous], 2007, P ICSSSM 07 2007 INT
[5]  
[Anonymous], 2012, INT J ADV RES COMPUT
[6]  
Ashour W, 2011, LECT NOTES COMPUT SC, V6936, P446, DOI 10.1007/978-3-642-23878-9_53
[7]   A k-Deviation Density Based Clustering Algorithm [J].
Chen Jungan ;
Chen Jinyin ;
Yang Dongyong ;
Li Jun .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
[8]   GMDBSCAN: Multi-Density DBSCAN Cluster Based on Grid [J].
Chen Xiaoyun ;
Min Yufang ;
Zhao Yan ;
Wang Ping .
PROCEEDINGS OF THE ICEBE 2008: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, 2008, :780-783
[9]  
Debnath Madhuri, 2015, 2015 International Workshop on Data Mining with Industrial Applications (DMIA). Proceedings, P51, DOI 10.1109/DMIA.2015.14
[10]   EFFICIENT ALGORITHM FOR A COMPLETE LINK METHOD [J].
DEFAYS, D .
COMPUTER JOURNAL, 1977, 20 (04) :364-366