Automatic extraction of clusters from hierarchical clustering representations

被引:0
|
作者
Sander, J [1 ]
Qin, XJ [1 ]
Lu, ZY [1 ]
Niu, N [1 ]
Kovarsky, A [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING | 2003年 / 2637卷
关键词
hierarchical clustering; OPTICS; single-link method; dendrogram; reachability-plot;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical clustering algorithms are typically more effective in detecting the true clustering structure of a data set than partitioning algorithms. However, hierarchical clustering algorithms do not actually create clusters, but compute only a hierarchical representation, of the data set. This makes them unsuitable as an automatic pre-processing step for other algorithms that operate on detected clusters. This is true for both dendrograms and reachability plots, which have been proposed as hierarchical clustering representations, and which have different advantages and disadvantages. In this paper we first investigate the relation between dendrograms and reachability plots and introduce methods to convert them into each other showing that they essentially contain the same information. Based on reachability plots, we then introduce a technique that automatically determines the significant clusters in a hierarchical cluster representation. This makes it for the first time possible to use hierarchical clustering as an automatic pre-processing step that requires no user interaction to select clusters from a hierarchical cluster representation.
引用
收藏
页码:75 / 87
页数:13
相关论文
共 50 条
  • [1] Automatic identification of the number of clusters in hierarchical clustering
    Karna, Ashutosh
    Gibert, Karina
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (01) : 119 - 134
  • [2] Automatic identification of the number of clusters in hierarchical clustering
    Ashutosh Karna
    Karina Gibert
    Neural Computing and Applications, 2022, 34 : 119 - 134
  • [3] A robust hierarchical clustering algorithm for automatic identification of clusters
    Long, Jianwu
    Wang, Qiang
    Liu, Luping
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [4] An Ensemble Clustering Framework Based on Hierarchical Clustering Ensemble Selection and Clusters Clustering
    Li, Wenjun
    Wang, Zikang
    Sun, Wei
    Bahrami, Sara
    CYBERNETICS AND SYSTEMS, 2023, 54 (05) : 741 - 766
  • [5] Indexes to Find the Optimal Number of Clusters in a Hierarchical Clustering
    David Martin-Fernandez, Jose
    Maria Luna-Romera, Jose
    Pontes, Beatriz
    Riquelme-Santos, Jose C.
    14TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2019), 2020, 950 : 3 - 13
  • [6] Web Data Extraction with Hierarchical Clustering and Rich Features
    Dong, Yongquan
    Zhao, Xiangjun
    Zhang, Gongjie
    RECENT TRENDS IN MATERIALS AND MECHANICAL ENGINEERING MATERIALS, MECHATRONICS AND AUTOMATION, PTS 1-3, 2011, 55-57 : 1003 - 1008
  • [7] FedHC: Learning Imbalanced Clusters via Federated Hierarchical Clustering
    Zhang, Yue
    Liao, Xinfa
    Chen, Qingsheng
    Wu, Haotian
    Zhang, Yiqun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT 1, 2025, 15031 : 505 - 521
  • [8] An ensemble agglomerative hierarchical clustering algorithm based on clusters clustering technique and the novel similarity measurement
    Li, Teng
    Rezaeipanah, Amin
    El Din, ElSayed M. Tag
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 3828 - 3842
  • [9] Hierarchical clustering in astronomy
    Yu, Heng
    Hou, Xiaolan
    ASTRONOMY AND COMPUTING, 2022, 41
  • [10] Taxonomy Extraction Using Knowledge Graph Embeddings and Hierarchical Clustering
    Martel, Felix
    Zouaq, Amal
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 836 - 844