Multi-Objective Cluster Ensemble based on Filter Refinement Scheme

被引:3
作者
Dai, Dan [1 ,2 ]
Yu, Zhiwen [1 ,3 ]
Huang, Weijie [1 ]
Hu, Yang [4 ]
Chen, C. L. Philip [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510650, Guangdong, Peoples R China
[2] Univ Warwick, Coventry CV4 7AL, England
[3] Pengcheng Lab, Shenzhen 518066, Guangdong, Peoples R China
[4] Univ Oxford, Oxford OX3 7LF, England
关键词
Consensus clustering; cluster ensemble selection; multi-objective optimization; evolutionary algorithm; CLASSIFICATION; EVOLUTIONARY; CLASSIFIERS; ALGORITHMS; PREDICTION; STABILITY; MODELS;
D O I
10.1109/TKDE.2022.3207141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster ensemble improves the robustness and stability of clustering performances by utilizing multiple solutions. Although traditional cluster ensemble methods have achieved promising performances, they are not adaptive enough to cope with data sets that have multiple levels of complexities. Besides, these methods may contain noisy and redundancy members which have negative effects. To mitigate the above issues, in this paper, we propose a multi-objective filter refinement scheme (MOFRS). First, we perform various clustering methods on different representations of data to generate diverse solutions. Second, we propose a solution filter to select a proper method and reduce the number of initial partitions for a given data set. Third, four stability indices are designed to split instances into stable and unstable groups. Fourth, objective functions based on diversity and quality are utilized to quantify the goodness of base clustering solutions. Finally, we design an improvement oriented multi-objective evolutionary algorithm to optimize these objective functions. Extensive experimental results conducted on 27 real-world data sets show that MOFRS outperforms most cluster ensemble selection methods, and achieves statistically significant improvements, compared with full ensemble methods.
引用
收藏
页码:8257 / 8269
页数:13
相关论文
共 64 条
  • [22] García S, 2008, J MACH LEARN RES, V9, P2677
  • [23] From few to many: Illumination cone models for face recognition under variable lighting and pose
    Georghiades, AS
    Belhumeur, PN
    Kriegman, DJ
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (06) : 643 - 660
  • [24] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [25] Moderate diversity for better cluster ensembles
    Hadjitodorov, Stefan T.
    Kuncheva, Ludmila I.
    Todorova, Ludmila P.
    [J]. INFORMATION FUSION, 2006, 7 (03) : 264 - 275
  • [26] Handl J., 2005, Dimension, V2, P20
  • [27] An evolutionary approach to multiobjective clustering
    Handl, Julia
    Knowles, Joshua
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2007, 11 (01) : 56 - 76
  • [28] Soft Subspace Based Ensemble Clustering for Multivariate Time Series Data
    He, Guoliang
    Jiang, Wenjun
    Peng, Rong
    Yin, Ming
    Han, Min
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7761 - 7774
  • [29] A multi-act sequential game-based multi-objective clustering approach for categorical data
    Heloulou, Imen
    Radjef, Mohammed Said
    Kechadi, Mohand Tahar
    [J]. NEUROCOMPUTING, 2017, 267 : 320 - 332
  • [30] Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601