Cluster ensemble selection and consensus clustering: A multi-objective optimization approach

被引:2
作者
Aktas, Dilay [1 ]
Lokman, Banu [2 ]
Inkaya, Tulin [3 ]
Dejaegere, Gilles [4 ]
机构
[1] Ctr Ind Management, KU Leuven, Celestijnenlaan 300, B-3001 Leuven, Belgium
[2] Univ Portsmouth, Ctr Operat Res & Logist, Sch Org Syst & People, Portsmouth PO1 3DE, England
[3] Bursa Uludag Univ, Dept Ind Engn, TR-16240 Nilufer, Bursa, Turkiye
[4] Univ Libre Bruxelles, Serv Math Gest, Blvd Triomphe CP 210-01, B-1050 Brussels, Belgium
关键词
Multiple objective programming; Cluster ensembles; Ensemble selection; Consensus clustering; QUALITY; DIVERSITY; MODEL;
D O I
10.1016/j.ejor.2023.10.029
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Cluster ensembles have emerged as a powerful tool to obtain clusters of data points by combining a library of clustering solutions into a consensus solution. In this paper, we address the cluster ensemble selection problem and design a multi -objective optimization -based solution framework to produce consensus solutions. Given a library of clustering solutions, we first design a preprocessing procedure that measures the agreement of each clustering solution with the other solutions and eliminates the ones that may mislead the process. We then develop a multi -objective optimization algorithm that selects representative clustering solutions from the preprocessed library with respect to size, coverage, and diversity criteria and combines them into a single consensus solution, for which the true number of clusters is assumed to be unknown. We conduct experiments on different benchmark data sets. The results show that our approach yields more accurate consensus solutions compared to full -ensemble and the existing approaches for most data sets. We also present an application on the customer segmentation problem, where our approach is used to segment customers and to find a consensus solution for each
引用
收藏
页码:1065 / 1077
页数:13
相关论文
共 54 条
[1]   Clustering ensemble selection considering quality and diversity [J].
Abbasi, Sadr-olah ;
Nejatian, Samad ;
Parvin, Hamid ;
Rezaie, Vahideh ;
Bagherifard, Karamolah .
ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (02) :1311-1340
[2]   Hierarchical cluster ensemble selection [J].
Akbari, Ebrahim ;
Dahlan, Halina Mohamed ;
Ibrahim, Roliana ;
Alizadeh, Hosein .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 39 :146-156
[3]   To improve the quality of cluster ensembles by selecting a subset of base clusters [J].
Alizadeh, Hosein ;
Minaei-Bidgoli, Behrouz ;
Parvin, Hamid .
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2014, 26 (01) :127-150
[4]  
[Anonymous], 2004, P 21 INT C MACH LEAR
[5]  
Ayad H, 2003, LECT NOTES COMPUT SC, V2709, P166
[6]   On voting-based consensus of cluster ensembles [J].
Ayad, Hanan G. ;
Kamel, Mohamed S. .
PATTERN RECOGNITION, 2010, 43 (05) :1943-1953
[7]  
Azimi J, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P992
[8]  
Berkhin P, 2006, GROUPING MULTIDIMENSIONAL DATA: RECENT ADVANCES IN CLUSTERING, P25
[9]   Cluster ensembles: A survey of approaches with recent extensions and applications [J].
Boongoen, Tossapon ;
Iam-On, Natthakan .
COMPUTER SCIENCE REVIEW, 2018, 28 :1-25
[10]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227