Evolutionary multi-objective optimization based overlapping subspace clustering ?

被引:2
作者
Paul, Dipanjyoti [1 ]
Saha, Sriparna [1 ]
Kumar, Abhishek [1 ]
Mathew, Jimson [1 ]
机构
[1] Indian Inst Technol Patna, Patna, Bihar, India
关键词
Subspace clustering; Multi-objective optimization; ICC-index; PSM-index; MNR-index; GENETIC ALGORITHM; SELECTION;
D O I
10.1016/j.patrec.2021.02.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Subspace clustering techniques divide the data set into various groups, where each group is represented by a subset of features known as subspace feature set, that are relevant to the objects in the group. The grouping is performed in such a way that similar objects are placed in the same group, whereas dissimilar objects are in different groups. Most of the previous subspace clustering methods have not considered an object to be a part of more than one cluster. However, in many real-life situations, an object may belong to more than one cluster. Moreover , subspace clustering algorithms developed in the past are based on single objective optimization framework which limits in optimizing only a particular shape or property of the clusters. To this end, we have developed an evolutionary-based overlapped subspace clustering method using multi-objective optimization framework. Various mutation operators have been used to explore the search space effectively. Multiple objectives that have been optimized simultaneously in this algorithm are ICC-index, MNR-index and PSM-index. The developed algorithm is evaluated with 7 real-life and 16 synthetic data sets. However, to check the efficiency of using multiple objectives, the proposed algorithm is also tested with 3 big data sets. An application of the proposed method is shown in bi-clustering the gene expression data. The results obtained using these 23 data sets and 3 big data sets are compared with many state-of-the-art algorithms. The comparative study illustrates the efficacy of the proposed algorithm over state-of-the-art algorithms. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:208 / 215
页数:8
相关论文
共 35 条
[1]  
Aggarwal C.C., FINDING GEN PROJECTE, V29
[2]  
Aggarwal CC, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P61, DOI 10.1145/304181.304188
[3]  
[Anonymous], 2016, WORKSH INT DAT EXPL
[4]  
[Anonymous], 2002, Proceedings of the ACM SIGMOD International Conference on Management of Data, DOI DOI 10.1145/564691.564739
[5]  
[Anonymous], 2018, NEURAL COMPUT APPL
[6]  
Bandyopadhyay S, 2012, GEN AUTOMATIC CLUSTE
[7]   A new index for clustering validation with overlapped clusters [J].
Campo, D. N. ;
Stegmayer, G. ;
Milone, D. H. .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 :549-556
[8]  
Cheng C. -H., 1999, P 5 ACM SIGKDD INT C, P84
[9]  
Cleuziou G, 2010, STUD COMPUT INTELL, V292, P149
[10]   A fast and elitist multiobjective genetic algorithm: NSGA-II [J].
Deb, K ;
Pratap, A ;
Agarwal, S ;
Meyarivan, T .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) :182-197