Evaluation of clustering algorithms for gene expression data using gene ontology annotations

被引:3
|
作者
Ma Ning [1 ]
Zhang Zheng-guo [1 ]
机构
[1] Chinese Acad Med Sci, Peking Union Med Coll, Inst Basic Med Sci, Dept Biomed Engn,Sch Basic Med, Beijing 100005, Peoples R China
关键词
microarray; gene expression; clustering; gene ontology; TOOL;
D O I
10.3760/cma.j.issn.0366-6999.2012.17.015
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Clustering is a useful exploratory technique for interpreting gene expression data to reveal groups of genes sharing common functional attributes. Biologists frequently face the problem of choosing an appropriate algorithm. We aimed to provide a standalone, easily accessible and biologically oriented criterion for expression data clustering evaluation. Methods An external criterion utilizing annotation based similarities between genes is proposed in this work. Gene ontology information is employed as the annotation source. Comparisons among six widely used clustering algorithms over various types of gene expression data sets were carried out based on the criterion proposed. Results The rank of these algorithms given by the criterion coincides with our common knowledge. Single-linkage has significantly poorer performance, even worse than the random algorithm. Ward's method archives the best performance in most cases. Conclusions The criterion proposed has a strong ability to distinguish among different clustering algorithms with different distance measurements. It is also demonstrated that analyzing main contributors of the criterion may offer some guidelines in finding local compact clusters. As an addition, we suggest using Ward's algorithm for gene expression data analysis. Chin Med J 2012;125(17):3048-3052
引用
收藏
页码:3048 / 3052
页数:5
相关论文
共 50 条
  • [1] Evaluation of clustering algorithms for gene expression data using gene ontology annotations
    MA Ning
    ZHANG Zheng-guo
    中华医学杂志(英文版), 2012, (17) : 3048 - 3052
  • [2] Evaluation of clustering algorithms for gene expression data
    Datta, Susmita
    Datta, Somnath
    BMC BIOINFORMATICS, 2006, 7 (Suppl 4)
  • [3] Evaluation of clustering algorithms for gene expression data
    Susmita Datta
    Somnath Datta
    BMC Bioinformatics, 7
  • [4] Incorporating gene ontology in clustering gene expression data
    Kustra, Rafal
    Zagdanski, Adam
    19TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2006, : 555 - +
  • [5] Gene-Ontology-based clustering of gene expression data
    Adryan, B
    Schuh, R
    BIOINFORMATICS, 2004, 20 (16) : 2851 - 2852
  • [6] Computational algorithms to predict Gene Ontology annotations
    Pinoli, Pietro
    Chicco, Davide
    Masseroli, Marco
    BMC BIOINFORMATICS, 2015, 16
  • [7] Computational algorithms to predict Gene Ontology annotations
    Pietro Pinoli
    Davide Chicco
    Marco Masseroli
    BMC Bioinformatics, 16
  • [8] Clustering Algorithms: Their Application to Gene Expression Data
    Oyelade, Jelili
    Isewon, Itunuoluwa
    Oladipupo, Funke
    Aromolaran, Olufemi
    Uwoghiren, Efosa
    Ameh, Faridah
    Achas, Moses
    Adebiyi, Ezekiel
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2016, 10 : 237 - 253
  • [9] Using Gene Ontology annotations in exploratory microarray clustering to understand cancer etiology
    Macintyre, Geoff
    Bailey, James
    Gustafsson, Daniel
    Haviv, Izhak
    Kowalczyk, Adam
    PATTERN RECOGNITION LETTERS, 2010, 31 (14) : 2138 - 2146
  • [10] Incorporating gene ontology into fuzzy relational clustering of microarray gene expression data
    Paul, Animesh Kumar
    Shill, Pintu Chandra
    BIOSYSTEMS, 2018, 163 : 1 - 10