Empirical Evidence of the Applicability of Functional Clustering through Gene Expression Classification

被引:8
|
作者
Krejnik, Milos [1 ]
Klema, Jiri [1 ]
机构
[1] Czech Tech Univ, Dept Cybernet, Fac Elect Engn, Prague 16627 6, Czech Republic
关键词
Biological prior knowledge; gene expression; gene set analysis; clustering; feature extraction; classification; MICROARRAY DATA; CANCER; TOOLS; PREDICTION; EPITHELIUM; SELECTION; QUALITY;
D O I
10.1109/TCBB.2012.23
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The availability of a great range of prior biological knowledge about the roles and functions of genes and gene-gene interactions allows us to simplify the analysis of gene expression data to make it more robust, compact, and interpretable. Here, we objectively analyze the applicability of functional clustering for the identification of groups of functionally related genes. The analysis is performed in terms of gene expression classification and uses predictive accuracy as an unbiased performance measure. Features of biological samples that originally corresponded to genes are replaced by features that correspond to the centroids of the gene clusters and are then used for classifier learning. Using 10 benchmark data sets, we demonstrate that functional clustering significantly outperforms random clustering without biological relevance. We also show that functional clustering performs comparably to gene expression clustering, which groups genes according to the similarity of their expression profiles. Finally, the suitability of functional clustering as a feature extraction technique is evaluated and discussed.
引用
收藏
页码:788 / 798
页数:11
相关论文
共 50 条
  • [31] On the selection of appropriate distances for gene expression data clustering
    Pablo A Jaskowiak
    Ricardo JGB Campello
    Ivan G Costa
    BMC Bioinformatics, 15
  • [32] Clustering Gene Expression Data Based on Harmony Search and K-harmonic Means
    Song, Anping
    Chen, Jianjiao
    Tran Thi Anh Tuyet
    Bai, Xuebin
    Xie, Jiang
    Zhang, Wu
    2012 11TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING & SCIENCE (DCABES), 2012, : 455 - 460
  • [33] On the classification of microarray gene-expression data
    Basford, Kaye E.
    McLachlan, Geoffrey J.
    Rathnayake, Suren I.
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (04) : 402 - 410
  • [34] Soybean kinome: functional classification and gene expression patterns
    Liu, Jinyi
    Chen, Nana
    Grant, Joshua N.
    Cheng, Zong-Ming
    Stewart, C. Neal, Jr.
    Hewezi, Tarek
    JOURNAL OF EXPERIMENTAL BOTANY, 2015, 66 (07) : 1919 - 1934
  • [35] Nonlinear dimensionality reduction of gene expression data for visualization and clustering analysis of cancer tissue samples
    Shi, Jinlong
    Luo, Zhigang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2010, 40 (08) : 723 - 732
  • [36] Radiomic Consensus Clustering in Glioblastoma and Association with Gene Expression Profiles
    Wroblewski, Tadeusz H.
    Karabacak, Mert
    Seah, Carina
    Yong, Raymund L.
    Margetis, Konstantinos
    CANCERS, 2024, 16 (24)
  • [37] A fuzzy approach to clustering and selecting features for classification of gene expression data
    Chitsaz, Elham
    Taheri, Mohammad
    Katebi, Seraj D.
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 1650 - 1655
  • [38] Techniques for clustering gene expression data
    Kerr, G.
    Ruskin, H. J.
    Crane, M.
    Doolan, P.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2008, 38 (03) : 283 - 293
  • [39] An Incremental Clustering of Gene Expression data
    Das, Rosy
    Bhattacharyya, Dhruba K.
    Kalita, Jugal K.
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 741 - +
  • [40] Gene clustering using Gene expression data and Self-Organizing Map (SOM)
    Kekic, Leila
    Hodic, Jasin
    Alispahic, Belma
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING 2017 (CMBEBIH 2017), 2017, 62 : 445 - 451