A new semi-supervised clustering technique using multi-objective optimization

被引:0
作者
Abhay Kumar Alok
Sriparna Saha
Asif Ekbal
机构
[1] Indian Institute of Technology,Department of Computer Science and Engineering
来源
Applied Intelligence | 2015年 / 43卷
关键词
Semi-supervised clustering; Multiobjective optimization; Cluster validity index; AMOSA;
D O I
暂无
中图分类号
学科分类号
摘要
Semi-supervised clustering techniques have been proposed in the literature to overcome the problems associated with unsupervised and supervised classification. It considers a small amount of labeled data and the whole data distribution during the process of clustering a data. In this paper, a new approach towards semi-supervised clustering is implemented using multiobjective optimization (MOO) framework. Four objective functions are optimized using the search capability of a multiobjective simulated annealing based technique, AMOSA. These objective functions are based on some unsupervised and supervised information. First three objective functions represent, respectively, the goodness of the partitioning in terms of Euclidean distance, total symmetry present in the clusters and the cluster connectedness. For the last objective function, we have considered different external cluster validity indices, including adjusted rand index, rand index, a newly developed min-max distance based MMI index, NMMI index and Minkowski Score. Results show that the proposed semi-supervised clustering technique can effectively detect the appropriate number of clusters as well as the appropriate partitioning from the data sets having either well-separated clusters of any shape or symmetrical clusters with or without overlaps. Twenty four artificial and five real-life data sets have been used in the evaluation. We develop five different versions of Semi-GenClustMOO clustering technique by varying the external cluster validity indices. Obtained partitioning results are compared with another recently developed multiobjective semi-supervised clustering technique, Mock-Semi. At the end of the paper the effectiveness of the proposed Semi-GenClustMOO clustering technique is shown in segmenting one remote sensing satellite image on the part from the city of Kolkata.
引用
收藏
页码:633 / 661
页数:28
相关论文
共 41 条
[1]  
Alok AK(2014)Development of an external cluster validity index using probabilistic approach and min-max distance IJCISIM 6 494-504
[2]  
Saha S(2011)Multiobjective simulated annealing for fuzzy clustering with stability and validity. Systems, Man, and Cybernetics, Part C: Applications and Reviews IEEE Trans 41 682-691
[3]  
Ekbal A(2002)Genetic clustering for automatic evolution of clusters and application to image classification Pattern Recog 35 1197-1208
[4]  
Bandyopadhyay S(2001)Pixel classification using variable string genetic algorithms with chromosome differentiation. Geoscience and Remote Sensing IEEE Trans 39 303-308
[5]  
Bandyopadhyay S(2007)Gaps: A clustering method using a new point symmetry-based distance measure Pattern Recog 40 3430-3451
[6]  
Maulik U(2008)A point symmetry-based clustering technique for automatic evolution of clusters. Knowledge and Data Engineering IEEE Trans 20 1441-1457
[7]  
Bandyopadhyay S(2008)A simulated annealing-based multiobjective optimization algorithm: Amosa. Evolutionary Computation IEEE Trans 12 269-283
[8]  
Pal SK(2006)Data clustering with partial supervision Data Min Knowl Discov 12 47-78
[9]  
Bandyopadhyay S(2011)Genetic algorithm-tuned entropy-based fuzzy c-means algorithm for obtaining distinct and compact clusters Fuzzy Optim Decis Making 10 153-166
[10]  
Saha S(1936)The use of multiple measurements in taxonomic problems Annals of Eugenics 7 179-188