Empirical Evidence of the Applicability of Functional Clustering through Gene Expression Classification

被引:8
|
作者
Krejnik, Milos [1 ]
Klema, Jiri [1 ]
机构
[1] Czech Tech Univ, Dept Cybernet, Fac Elect Engn, Prague 16627 6, Czech Republic
关键词
Biological prior knowledge; gene expression; gene set analysis; clustering; feature extraction; classification; MICROARRAY DATA; CANCER; TOOLS; PREDICTION; EPITHELIUM; SELECTION; QUALITY;
D O I
10.1109/TCBB.2012.23
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The availability of a great range of prior biological knowledge about the roles and functions of genes and gene-gene interactions allows us to simplify the analysis of gene expression data to make it more robust, compact, and interpretable. Here, we objectively analyze the applicability of functional clustering for the identification of groups of functionally related genes. The analysis is performed in terms of gene expression classification and uses predictive accuracy as an unbiased performance measure. Features of biological samples that originally corresponded to genes are replaced by features that correspond to the centroids of the gene clusters and are then used for classifier learning. Using 10 benchmark data sets, we demonstrate that functional clustering significantly outperforms random clustering without biological relevance. We also show that functional clustering performs comparably to gene expression clustering, which groups genes according to the similarity of their expression profiles. Finally, the suitability of functional clustering as a feature extraction technique is evaluated and discussed.
引用
收藏
页码:788 / 798
页数:11
相关论文
共 50 条
  • [41] Discriminant Projection Shared Dictionary Learning for Classification of Tumors Using Gene Expression Data
    Peng, Shaoliang
    Yang, Yaning
    Liu, Wei
    Li, Fei
    Liao, Xiangke
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (04) : 1464 - 1473
  • [42] FACTOR ANALYSIS FOR CROSS-PLATFORM TUMOR CLASSIFICATION BASED ON GENE EXPRESSION PROFILES
    Wang, Shu-Lin
    Gui, Jie
    Li, Xueling
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2010, 19 (01) : 243 - 258
  • [43] Computational analysis of microarray gene expression profiles: clustering, classification, and beyond
    Liang, J
    Kachalo, S
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 62 (02) : 199 - 216
  • [44] A Bayesian network classification methodology for gene expression data
    Helman, P
    Veroff, R
    Atlas, SR
    Willman, C
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (04) : 581 - 615
  • [45] Gene expression profile based classification models of psoriasis
    Guo, Pi
    Luo, Youxi
    Mai, Guoqin
    Zhang, Ming
    Wang, Guoqing
    Zhao, Miaomiao
    Gao, Liming
    Li, Fan
    Zhou, Fengfeng
    GENOMICS, 2014, 103 (01) : 48 - 55
  • [46] Selecting Few Genes for Microarray Gene Expression Classification
    Alonso-Gonzalez, Carlos J.
    Isaac Moro, Q.
    Prieto, Oscar J.
    Aranzazu Simon, M.
    CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2010, 5988 : 111 - 120
  • [47] Classification of Cancer Types based on Gene Expression Data
    He, Yinchao
    Bockmon, Ryan
    Modey, Miracle
    Roscoe, Sarah
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2175 - 2182
  • [48] Model-based clustering and classification of functional data
    Chamroukhi, Faicel
    Nguyen, Hien D.
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (04)
  • [49] A network-assisted co-clustering algorithm to discover cancer subtypes based on gene expression
    Liu, Yiyi
    Gu, Quanquan
    Hou, Jack P.
    Han, Jiawei
    Ma, Jian
    BMC BIOINFORMATICS, 2014, 15
  • [50] Clustering gene expression data with a penalized graph-based metric
    Baya, Ariel E.
    Granitto, Pablo M.
    BMC BIOINFORMATICS, 2011, 12