An Efficient Method for subjectively choosing parameter 'k' automatically in VDBSCAN (Varied Density Based Spatial Clustering of Applications with Noise) Algorithm

被引:10
作者
Chowdhury, A. K. M. Rasheduzzaman [1 ]
Mollah, Md. Elias [1 ]
Rahman, Md. Asikur [2 ]
机构
[1] Green Univ Bangladesh, Comp Sci & Engn, Dhaka 1206, Bangladesh
[2] Bangladesh Univ Business & Technology, Comp Sci & Engn, Dhaka 1206, Bangladesh
来源
2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1 | 2010年
关键词
Density based clustering; DBSCAN; VDBSCAN; data mining; center based approach;
D O I
10.1109/ICCAE.2010.5452004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Density based clustering algorithms are one of the primary method for data mining. The clusters which are formed using density clustering are easy to understand and it does limit itself to shapes of clusters. Existing density based algorithms have trouble because they are not capable of finding out all meaningful clusters whenever the density is so much varied. VDBSCAN is introduced to compensate this problem. It is same as DBSCAN (Density Based Spatial Clustering of Applications with Noise) but only the difference is VDBSCAN selects several values of parameter Eps for different densities according to k-dist plot. The problem is the value of parameter k in k-dist plot is user defined. This paper introduces a new method to find out the value of parameter k automatically based on the characteristics of the datasets. In this method we consider spatial distance from a point to all others points in the datasets. The proposed method has potential to find out optimal value for parameter k. In this paper a synthetic database with two dimensional data is used for demonstration.
引用
收藏
页码:38 / 41
页数:4
相关论文
共 7 条
  • [1] AIN AK, 1988, ALGORITHMS CLUSTERIN
  • [2] Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
  • [3] [Anonymous], 2006, Introduction to Data Mining
  • [4] [Anonymous], P 4 INT C KNOWL DISC
  • [5] Liu Peng, 2007, VARIED DENSITY BASED
  • [6] Sheikholeslami G., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P428
  • [7] Xue-gang S., 2003, J CHINESE INFORM PRO, V17, P21