Optimized leaf ordering with class labels for hierarchical clustering

被引:3
|
作者
Novoselova, Natalia [1 ]
Wang, Junxi [2 ]
Klawonn, Frank [2 ,3 ]
机构
[1] United Inst Informat Problems, Dept Bioinformat, Surganova Str 6, Minsk 220012, BELARUS
[2] Helmholtz Ctr Infect Res, Biostat, D-38124 Braunschweig, Germany
[3] Ostfalia Univ Appl Sci, Dept Comp Sci, D-38302 Wolfenbuttel, Germany
关键词
Hierarchical clustering; dendrogram; leaf ordering; dynamic programming; biomedical data;
D O I
10.1142/S0219720015500122
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hierarchical clustering is extensively used in the bioinformatics community to analyze biomedical data. These data are often tagged with class labels, as e.g. disease subtypes or gene ontology (GO) terms. Heatmaps in connection with dendrograms are the common standard to visualize results of hierarchical clustering. The heatmap can be enriched by an additional color bar at the side, indicating for each instance in the data set to which class it belongs. In the ideal case, when the clustering matches perfectly with the classes, one would expect that instances from the same class cluster together and the color bar consists of well-separated color blocks without frequent alteration of colors (classes). But even in the case when instances from the same class cluster perfectly together, the dendrogram might not reflect this important aspect due to the fact that its representation is not unique. In this paper, we propose a leaf ordering algorithm for the dendrogram that preserving the hierarchical clustering result tries to group instances from the same class together. It is based on the concept of dynamic programming which can efficiently compute the optimal or nearly optimal order, consistent with the structure of the tree.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Enhanced Adjacency-Constrained Hierarchical Clustering Using Fine-Grained Pseudo Labels
    Yang, Jie
    Lin, Chin-Teng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2481 - 2492
  • [22] OPTIMIZED SIMILARITY BASED HIERARCHICAL CLUSTERING APPROACH FOR BRAIN MRI IMAGE SEGMENTATION
    Dayana, Roslin K.
    Sakthi, D. Vishnu
    SherinBeevi, L.
    Manisha, G.
    Rani, Shobha P.
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 3156 - 3160
  • [23] GCFI plus plus : Embedding and Frequent Itemset based Incremental Hierarchical Clustering with Labels and Outliers
    Dixit, Vini
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 135 - 143
  • [24] A PRELIMINARY ORDERING OF POLICY NETWORK LABELS
    JORDAN, G
    SCHUBERT, K
    EUROPEAN JOURNAL OF POLITICAL RESEARCH, 1992, 21 (1-2) : 7 - 27
  • [25] Hierarchical Clustering Using One-Class Support Vector Machines
    Lee, Gyemin
    SYMMETRY-BASEL, 2015, 7 (03): : 1164 - 1175
  • [26] Hierarchical Clustering via Sketches and Hierarchical Correlation Clustering
    Vainstein, Danny
    Chatziafratis, Vaggos
    Citovsky, Gui
    Rajagopalan, Anand
    Mahdian, Mohammad
    Azar, Yossi
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 559 - +
  • [27] Hierarchical vertex ordering
    Woo, SH
    Yang, SB
    GRAPH TRANSFORMATIONS, PROCEEDINGS, 2002, 2505 : 393 - 401
  • [28] Incremental Clustering for Hierarchical Clustering
    Narita, Kakeru
    Hochin, Teruhisa
    Nomiya, Hiroki
    2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 102 - 107
  • [29] Class Ordering Tool - A Tool for Class Ordering in Integration Testing
    Jaroenpiboonkit, Jutarat
    Suwannasart, Taratip
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 724 - 728
  • [30] Feature selection for binary classification based on class labeling, SOM, and hierarchical clustering
    Zhao Zhengtian
    Rui Zhiyuan
    Duan Xiaoyan
    MEASUREMENT & CONTROL, 2023, 56 (9-10): : 1649 - 1669