Data clustering with stochastic cellular automata

被引:5
作者
Dundar, Enes Burak [1 ]
Korkmaz, Emin Erkan [1 ]
机构
[1] Yeditepe Univ, Dept Comp Engn, Istanbul, Turkey
关键词
Cellular automata; clustering; big data; GENETIC ALGORITHM; DIFFUSION;
D O I
10.3233/IDA-173488
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data clustering is a well studied problem, where the aim is to partition a group of data instances into a number of clusters. Various methods have been proposed for the problem. K-means and its variants are the most well known examples. A common characteristic shared by the clustering algorithms is that they are all based on distance calculations between data points, or between data points and centroids. Hence, the efficiency of the proposed methods decline when big data is clustered. Clustering algorithms based on cellular automata have also been proposed in the literature. However, these methods are based on distance calculations, too. In this study, a new approach is proposed for the clustering problem. The method is based on the formation of clusters in a cellular automata by the interaction of neighborhood cells. The data points are mapped to fixed cellular automata cells, and the clusters are formed in a parallel fashion. The initial clusters formed spread in the cellular automata by uniting neighborhood cells in the same cluster. The rules utilized to compose clusters in the automata are inspired by the heat transfer process in nature. No distance calculation is used during the procedure. Therefore, it is possible to cluster huge datasets within a reasonable amount of time with the method proposed.
引用
收藏
页码:735 / 750
页数:16
相关论文
共 26 条
[1]  
Adwan O., 2013, INT REV COMPUTERS SO
[2]  
[Anonymous], 1996, PROC 2 INT C KNOWLED, DOI DOI 10.5555/3001460.3001507
[3]  
[Anonymous], 1991, SFI STUD SCI COMPLEX
[4]  
[Anonymous], 2007, ACM Transactions on Knowledge Discovery from Data, DOI DOI 10.1145/1217299.1217303
[5]   Looking for natural patterns in data - Part 1. Density-based approach [J].
Daszykowski, M ;
Walczak, B ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 56 (02) :83-92
[6]   Data clustering using a linear cellular automata-based algorithm [J].
de Lope, Javier ;
Maravall, Dario .
NEUROCOMPUTING, 2013, 114 :86-91
[7]   CELLULAR AUTOMATA APPROACHES TO BIOLOGICAL MODELING [J].
ERMENTROUT, GB ;
EDELSTEINKESHET, L .
JOURNAL OF THEORETICAL BIOLOGY, 1993, 160 (01) :97-133
[8]   FANTASTIC COMBINATIONS OF JOHN CONWAYS NEW SOLITAIRE GAME LIFE [J].
GARDNER, M .
SCIENTIFIC AMERICAN, 1970, 223 (04) :120-&
[9]  
Handl J., 2004, Tech. Rep.
[10]  
Hartigan J., 1975, CLUSTERING ALGORITHM