DBSCAN-like clustering method for various data densities

被引:40
作者
Scitovski, Rudolf [1 ]
Sabo, Kristian [1 ]
机构
[1] Univ Osijek, Dept Math, Trg Ljudevita Gaja 6, Osijek 31000, Croatia
关键词
Clustering; DBSCAN; Incremental algorithm; Various data densities; Clusters merging; Least Squares distance-like function; FAST PARTITIONING ALGORITHM; K-MEANS; OPTIMIZATION; TIME;
D O I
10.1007/s10044-019-00809-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a modification of the well-known DBSCAN algorithm, which recognizes clusters with various data densities in a given set of data points A = {a(i) epsilon R-n : i = 1,..., m}. First, we define the parameter MinPts = vertical bar ln vertical bar A vertical bar vertical bar and after that, by using a standard procedure from DBSCAN algorithm, for each a epsilon A we determine radius epsilon(a) of the circle containing MinPts elements from the set.. We group the set of all these radii into the most appropriate number (t) of clusters by using Least Squares distance-like function applying SymDIRECT or SepDIRECT algorithm. In that way, we obtain parameters epsilon(1) >... > epsilon(t). Furthermore, for parameters {MinPts, epsilon(1)} we construct a partition starting with one cluster and then add new clusters for as long as the isolated groups of at least MinPts data points in some circle with radius epsilon(1) exist. We follow a similar procedure for other parameters epsilon(2),..., epsilon(t). After the implementation of the algorithm, a larger number of clusters appear than can be expected in the optimal partition. Along with defined criteria, some of them are merged by applying a merging process for which a detailed algorithm has been written. Compared to the standard DBSCAN algorithm, we show an obvious advantage for the case of data with various densities.
引用
收藏
页码:541 / 554
页数:14
相关论文
共 50 条
[1]   EDCircles: A real-time circle detector with a false detection control [J].
Akinlar, Cuneyt ;
Topal, Cihan .
PATTERN RECOGNITION, 2013, 46 (03) :725-740
[2]   An incremental method combining density clustering and support vector machines for voice pathology detection [J].
Amami, Rimah ;
Smiti, Abir .
COMPUTERS & ELECTRICAL ENGINEERING, 2017, 57 :257-265
[3]   G-DBSCAN: A GPU Accelerated Algorithm for Density-based Clustering [J].
Andrade, Guilherme ;
Ramos, Gabriel ;
Madeira, Daniel ;
Sachetto, Rafael ;
Ferreira, Renato ;
Rocha, Leonardo .
2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 :369-378
[4]  
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[5]  
[Anonymous], DATA CLUSTERING ALGO
[6]  
[Anonymous], MIN SCI DAT WORKSH 2
[7]   Fast modified global k-means algorithm for incremental cluster construction [J].
Bagirov, Adil M. ;
Ugon, Julien ;
Webb, Dean .
PATTERN RECOGNITION, 2011, 44 (04) :866-876
[8]   Efficient incremental density-based algorithm for clustering large datasets [J].
Bakr, Ahmad M. ;
Ghanem, Nagia M. ;
Ismail, Mohamed A. .
ALEXANDRIA ENGINEERING JOURNAL, 2015, 54 (04) :1147-1154
[9]  
Bezdek J., 2005, Fuzzy Models and Algorithms for Pattern Recognition and Image Processing
[10]   ST-DBSCAN: An algorithm for clustering spatial-temp oral data [J].
Birant, Derya ;
Kut, Alp .
DATA & KNOWLEDGE ENGINEERING, 2007, 60 (01) :208-221