Empirical Evidence of the Applicability of Functional Clustering through Gene Expression Classification

被引:8
|
作者
Krejnik, Milos [1 ]
Klema, Jiri [1 ]
机构
[1] Czech Tech Univ, Dept Cybernet, Fac Elect Engn, Prague 16627 6, Czech Republic
关键词
Biological prior knowledge; gene expression; gene set analysis; clustering; feature extraction; classification; MICROARRAY DATA; CANCER; TOOLS; PREDICTION; EPITHELIUM; SELECTION; QUALITY;
D O I
10.1109/TCBB.2012.23
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The availability of a great range of prior biological knowledge about the roles and functions of genes and gene-gene interactions allows us to simplify the analysis of gene expression data to make it more robust, compact, and interpretable. Here, we objectively analyze the applicability of functional clustering for the identification of groups of functionally related genes. The analysis is performed in terms of gene expression classification and uses predictive accuracy as an unbiased performance measure. Features of biological samples that originally corresponded to genes are replaced by features that correspond to the centroids of the gene clusters and are then used for classifier learning. Using 10 benchmark data sets, we demonstrate that functional clustering significantly outperforms random clustering without biological relevance. We also show that functional clustering performs comparably to gene expression clustering, which groups genes according to the similarity of their expression profiles. Finally, the suitability of functional clustering as a feature extraction technique is evaluated and discussed.
引用
收藏
页码:788 / 798
页数:11
相关论文
共 50 条
  • [21] Clustering gene expression data using a diffraction-inspired framework
    Dinger, Steven C.
    Van Wyk, Michael A.
    Carmona, Sergio
    Rubin, David M.
    BIOMEDICAL ENGINEERING ONLINE, 2012, 11
  • [22] Clustering-based gene-subnetwork biomarker identification using gene expression data
    Doungpan, Narumol
    Engchuan, Worrawat
    Meechai, Asawin
    Chan, Jonathan H.
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [23] Double Selection Based Semi-Supervised Clustering Ensemble for Tumor Clustering from Gene Expression Profiles
    Yu, Zhiwen
    Chen, Hongsheng
    You, Jane
    Wong, Hau-San
    Liu, Jiming
    Li, Le
    Han, Guoqiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (04) : 727 - 740
  • [24] Gene Expression Data Classification by VVRKFA
    Ghorai, Santanu
    Mukherjee, Anirban
    Dutta, Pranab K.
    2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 : 330 - 335
  • [25] A Cooperative Feature Gene Extraction Algorithm that Combines Classification and Clustering
    Chow, Chi Kin
    Zhu, Hailong
    Lacy, Jessica
    Lingen, Mark W.
    Kuo, Winston Patrick
    Chan, Keith
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 194 - +
  • [26] Multi-class cancer classification through gene expression profiles: microRNA versus mRNA
    Peng, Sihua
    Zeng, Xiaomin
    Li, Xiaobo
    Peng, Xiaoning
    Chen, Liangbiao
    JOURNAL OF GENETICS AND GENOMICS, 2009, 36 (07) : 409 - 416
  • [27] Technique of Gene Expression Profiles Extraction Based on the Complex Use of Clustering and Classification Methods
    Babichev, Sergii
    Skvor, Jiri
    DIAGNOSTICS, 2020, 10 (08)
  • [28] Gene expression studies with DGL global optimization for the molecular classification of cancer
    Li, Dongguang
    SOFT COMPUTING, 2011, 15 (01) : 111 - 129
  • [29] Gene expression data classification using locally linear discriminant embedding
    Li, Bo
    Zheng, Chun-Hou
    Huang, De-Shuang
    Zhang, Lei
    Han, Kyungsook
    COMPUTERS IN BIOLOGY AND MEDICINE, 2010, 40 (10) : 802 - 810
  • [30] Clustering High Dimensional Gene Expression Data via Two Step Feature Filtering
    Chen, Jianjiao
    Song, Anping
    Zhang, Wu
    2011 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY (ICCIT), 2012, : 299 - 303