Semi-supervised hierarchical ensemble clustering based on an innovative distance metric and constraint information

被引:6
|
作者
Shen, Baohua [1 ]
Jiang, Juan [1 ]
Qian, Feng [1 ]
Li, Daoguo [1 ]
Ye, Yanming [1 ]
Ahmadi, Gholamreza [2 ]
机构
[1] Hangzhou Dianzi Univ Informat Engn Coll, Sch Management, Hangzhou 311035, Zhejiang, Peoples R China
[2] Persian Gulf Univ, Dept Comp Engn, Bushehr, Iran
关键词
Ensemble clustering; AHC; Semi-supervised clustering; Distance metric; Information constraints; SCHEME;
D O I
10.1016/j.engappai.2023.106571
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Agglomerative Hierarchical Clustering (AHC) is a bottom-up clustering strategy in which each object is originally a cluster, and more pairs of clusters are formed by traversing the hierarchy. It has been proven that there is no individual AHC clustering algorithm that can be efficient in all situations. In order to address this problem, ensemble clustering techniques have been introduced. These techniques combine the results of several output partitions to achieve a consensus with higher accuracy compared to an individual clustering algorithm. This paper proposes an AHC-based ensemble semi-supervised clustering algorithm to improve performance. In semi-supervised clustering, class membership information is used in some objects. Here, we introduce the Semi-Supervised Ensemble Hierarchical Clustering based on Constraints Information (SSEHCCI) algorithm. SSEHCCI is developed using several individual clustering algorithms based on AHC. SSEHCCI includes a flexible weighting policy to generate base partitions and uses the constraints information to configure the semi-supervised clustering. In addition, SSEHCCI uses an innovative distance measure to calculate the distance between each pair of objects. Experimental results show that SSEHCCI performs better than existing semi -supervised algorithms on some University of California Irvine (UCI) datasets. Specifically, we observed an average accuracy of SSEHCCI compared to SSDC and RSSC of 2.6% and 1.8%, respectively.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Semi-supervised Selective Clustering Ensemble based on constraint information
    Ma, Tinghuai
    Zhang, Zheng
    Guo, Lei
    Wang, Xin
    Qian, Yurong
    Al-Nabhan, Najla
    NEUROCOMPUTING, 2021, 462 : 412 - 425
  • [2] Combined constraint-based with metric-based in semi-supervised clustering ensemble
    Siting Wei
    Zhixin Li
    Canlong Zhang
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 1085 - 1100
  • [3] Combined constraint-based with metric-based in semi-supervised clustering ensemble
    Wei, Siting
    Li, Zhixin
    Zhang, Canlong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (07) : 1085 - 1100
  • [4] Semi-Supervised Ensemble Clustering Based on Selected Constraint Projection
    Yu, Zhiwen
    Luo, Peinan
    Liu, Jiming
    Wong, Hau-San
    You, Jane
    Han, Guoqiang
    Zhang, Jun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (12) : 2394 - 2407
  • [5] Constraint projections for semi-supervised spectral clustering ensemble
    Yang, Jingya
    Sun, Linfu
    Wu, Qishi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (20):
  • [6] Semi-supervised hierarchical clustering ensemble and its application
    Xiao, Wenchao
    Yang, Yan
    Wang, Hongjun
    Li, Tianrui
    Xing, Huanlai
    NEUROCOMPUTING, 2016, 173 : 1362 - 1376
  • [7] A semi-supervised hierarchical ensemble clustering framework based on a novel similarity metric and stratified feature sampling
    Shi, Hui
    Peng, Qiang
    Xie, Zhiming
    Wang, Jian
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
  • [8] A Comparison of Distance Metrics in Semi-supervised Hierarchical Clustering
    Aljohani, Abeer
    Lai, Daphne Teck Ching
    Bell, Paul C.
    Edirisinghe, Eran A.
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 719 - 731
  • [9] Kernelized evolutionary distance metric learning for semi-supervised clustering
    Kalintha, Wasin
    Ono, Satoshi
    Numao, Masayuki
    Fukui, Ken-ichi
    INTELLIGENT DATA ANALYSIS, 2019, 23 (06) : 1271 - 1297
  • [10] Kernelized Evolutionary Distance Metric Learning for Semi-Supervised Clustering
    Kalintha, Wasin
    Ono, Satoshi
    Numao, Masayuki
    Fukui, Ken-ichi
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4945 - 4946