A comprehensive study of clustering ensemble weighting based on cluster quality and diversity

被引:60
|
作者
Nazari, Ahmad [1 ]
Dehghan, Ayob [1 ]
Nejatian, Samad [2 ,3 ]
Rezaie, Vahideh [3 ,4 ]
Parvin, Hamid [1 ,5 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[3] Islamic Azad Univ, Yasooj Branch, Young Researchers & Elite Club, Yasuj, Iran
[4] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
[5] Islamic Azad Univ, Young Researchers & Elite Club, Nourabad Mamasani Branch, Nourabad, Mamasani, Iran
关键词
Data clustering; Clustering ensemble; Consensus function; Weighting; COMBINING MULTIPLE CLUSTERINGS; TRANSFER DISTANCE; SELECTION; CONSENSUS; PARTITIONS;
D O I
10.1007/s10044-017-0676-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering as a major task in data mining is responsible for discovering hidden patterns in unlabeled datasets. Finding the best clustering is also considered as one of the most challenging problems in data mining. Due to the problem complexity and the weaknesses of primary clustering algorithm, a large part of research has been directed toward ensemble clustering methods. Ensemble clustering aggregates a pool of base clusterings and produces an output clustering that is also named consensus clustering. The consensus clustering is usually better clustering than the output clusterings of the basic clustering algorithms. However, lack of quality in base clusterings makes their consensus clustering weak. In spite of some researches in selection of a subset of high quality base clusterings based on a clustering assessment metric, cluster-level selection has been always ignored. In this paper, a new clustering ensemble framework has been proposed based on cluster-level weighting. The certainty amount that the given ensemble has about a cluster is considered as the reliability of that cluster. The certainty amount that the given ensemble has about a cluster is computed by the accretion amount of that cluster by the ensemble. Then by selecting the best clusters and assigning a weight to each selected cluster based on its reliability, the final ensemble is created. After that, the paper proposes cluster-level weighting co-association matrix instead of traditional co-association matrix. Then, two consensus functions have been introduced and used for production of the consensus partition. The proposed framework completely overshadows the state-of-the-art clustering ensemble methods experimentally.
引用
收藏
页码:133 / 145
页数:13
相关论文
共 50 条
  • [31] A comparative study of clustering ensemble algorithms
    Wu, Xiuge
    Ma, Tinghuai
    Cao, Jie
    Tian, Yuan
    Alabdulkarim, Alia
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 68 : 603 - 615
  • [32] A novel member enhancement-based clustering ensemble algorithm
    He, Yulin
    Yang, Jin
    Cheng, Yingchao
    Du, Xueqin
    Huang, Joshua Zhexue
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (10)
  • [33] Adaptive Ensemble Clustering With Boosting BLS-Based Autoencoder
    Shi, Yifan
    Yang, Kaixiang
    Yu, Zhiwen
    Chen, C. L. Philip
    Zeng, Huanqiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12369 - 12383
  • [34] Fuzzy Clustering Ensemble Considering Cluster Dependability
    Chen, Zhong
    Bagherinia, Ali
    Minaei-Bidgoli, Behrooz
    Parvin, Hamid
    Pho, Kim-Hung
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2021, 30 (02)
  • [35] STABLE CLUSTERING ENSEMBLE BASED ON EVIDENCE THEORY
    Fu, Haijie
    Yue, Xiaodong
    Liu, Wei
    Denoeux, Thierry
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2046 - 2050
  • [36] A Clustering-Ensemble Approach Based on Voting
    Meng, Fanrong
    Tong, Xuejiao
    Wang, Zhixiao
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 421 - 427
  • [37] Hybrid Sampling-Based Clustering Ensemble With Global and Local Constitutions
    Yang, Yun
    Jiang, Jianmin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (05) : 952 - 965
  • [38] An ensemble density-based clustering method
    Xia, Luning
    Jing, Jiwu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
  • [39] Clustering ensemble based on sample's stability
    Li, Feijiang
    Qian, Yuhua
    Wang, Jieting
    Dang, Chuangyin
    Jing, Liping
    ARTIFICIAL INTELLIGENCE, 2019, 273 : 37 - 55
  • [40] A Multicriteria Clustering Approach Based on Similarity Indices and Clustering Ensemble Techniques
    Rouba, Baroudi
    Bahloul, Safia Nait
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2014, 13 (04) : 811 - 837