Multi-Objective Cluster Ensemble based on Filter Refinement Scheme

被引:3
作者
Dai, Dan [1 ,2 ]
Yu, Zhiwen [1 ,3 ]
Huang, Weijie [1 ]
Hu, Yang [4 ]
Chen, C. L. Philip [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510650, Guangdong, Peoples R China
[2] Univ Warwick, Coventry CV4 7AL, England
[3] Pengcheng Lab, Shenzhen 518066, Guangdong, Peoples R China
[4] Univ Oxford, Oxford OX3 7LF, England
关键词
Consensus clustering; cluster ensemble selection; multi-objective optimization; evolutionary algorithm; CLASSIFICATION; EVOLUTIONARY; CLASSIFIERS; ALGORITHMS; PREDICTION; STABILITY; MODELS;
D O I
10.1109/TKDE.2022.3207141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster ensemble improves the robustness and stability of clustering performances by utilizing multiple solutions. Although traditional cluster ensemble methods have achieved promising performances, they are not adaptive enough to cope with data sets that have multiple levels of complexities. Besides, these methods may contain noisy and redundancy members which have negative effects. To mitigate the above issues, in this paper, we propose a multi-objective filter refinement scheme (MOFRS). First, we perform various clustering methods on different representations of data to generate diverse solutions. Second, we propose a solution filter to select a proper method and reduce the number of initial partitions for a given data set. Third, four stability indices are designed to split instances into stable and unstable groups. Fourth, objective functions based on diversity and quality are utilized to quantify the goodness of base clustering solutions. Finally, we design an improvement oriented multi-objective evolutionary algorithm to optimize these objective functions. Extensive experimental results conducted on 27 real-world data sets show that MOFRS outperforms most cluster ensemble selection methods, and achieves statistically significant improvements, compared with full ensemble methods.
引用
收藏
页码:8257 / 8269
页数:13
相关论文
共 64 条
  • [1] Ahalya G, 2015, 2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), P532, DOI 10.1109/ABLAZE.2015.7154919
  • [2] Hierarchical cluster ensemble selection
    Akbari, Ebrahim
    Dahlan, Halina Mohamed
    Ibrahim, Roliana
    Alizadeh, Hosein
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 39 : 146 - 156
  • [3] ALIZADEH H, 2011, ARTIFICIAL INTELLI 1, P240
  • [4] Cluster ensemble selection based on a new cluster stability measure
    Alizadeh, Hosein
    Minaei-Bidgoli, Behrouz
    Parvin, Hamid
    [J]. INTELLIGENT DATA ANALYSIS, 2014, 18 (03) : 389 - 408
  • [5] Azimi J, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P992
  • [6] Reliability-based fuzzy clustering ensemble
    Bagherinia, Ali
    Minaei-Bidgoli, Behrooz
    Hosseinzadeh, Mehdi
    Parvin, Hamid
    [J]. FUZZY SETS AND SYSTEMS, 2021, 413 : 1 - 28
  • [7] Multiobjective genetic clustering for pixel classification in remote sensing imagery
    Bandyopadhyay, Sanghamitra
    Maulik, Ujjwal
    Mukhopadhyay, Anirban
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (05): : 1506 - 1511
  • [8] Speeding up incremental wrapper feature subset selection with Naive Bayes classifier
    Bermejo, Pablo
    Gamez, Jose A.
    Puerta, Jose M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 55 : 140 - 147
  • [9] Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses
    Bhattacharjee, A
    Richards, WG
    Staunton, J
    Li, C
    Monti, S
    Vasa, P
    Ladd, C
    Beheshti, J
    Bueno, R
    Gillette, M
    Loda, M
    Weber, G
    Mark, EJ
    Lander, ES
    Wong, W
    Johnson, BE
    Golub, TR
    Sugarbaker, DJ
    Meyerson, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) : 13790 - 13795
  • [10] Hierarchical Clustering With Prototypes via Minimax Linkage
    Bien, Jacob
    Tibshirani, Robert
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (495) : 1075 - 1084