Gene ontology driven feature selection from microarray gene expression data

被引:0
|
作者
Qi, Jianlong [1 ]
Tang, Jian [1 ]
机构
[1] Mem Univ Newfoundland, Dept Comp Sci, St John, NF A1B 3X5, Canada
关键词
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the main challenges in the classification of microarray gene expression data is the small sample size compared with the large number of genes, so feature selection is an essential step to remove genes not relevant to class label. Traditional gene selection methods often select the top-ranked genes based on their individual discriminative powers. The problem with these simple ranking models is that they evaluate genes in isolation and this may introduce redundancy among the selected feature subset. Most redundancy based methods solely evaluate gene expression levels. This may decrease the effectiveness of feature selection since some values may not be accurately measured. In this paper, we propose a gene ontology based method for feature selection. The novelty of this model is to detect redundancy between a pair of genes by the convex combination of their expression similarity and semantic similarity in gene ontology. The effectiveness of our method is demonstrated by the experiment in two widely used datasets.
引用
收藏
页码:428 / +
页数:2
相关论文
共 50 条
  • [41] A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis
    Lazar, Cosmin
    Taminau, Jonatan
    Meganck, Stijn
    Steenhoff, David
    Coletta, Alain
    Molter, Colin
    de Schaetzen, Virginie
    Duque, Robin
    Bersini, Hugues
    Nowe, Ann
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (04) : 1106 - 1119
  • [42] Microarray data mining using gene ontology
    Li, SH
    Becich, MJ
    Gilbertson, J
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 778 - 782
  • [43] Optimized gene selection and classification of cancer from microarray gene expression data using deep learning
    Shah, Shamveel Hussain
    Iqbal, Muhammad Javed
    Ahmad, Iftikhar
    Khan, Suleman
    Rodrigues, Joel J. P. C.
    NEURAL COMPUTING & APPLICATIONS, 2020,
  • [44] Microarray data mining using gene ontology
    Li, Songhui
    Becich, Michael J.
    Studies in Health Technology and Informatics, 2004, 107 : 778 - 782
  • [45] Data mining for feature selection in gene expression autism data
    Latkowski, Tomasz
    Osowski, Stanislaw
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (02) : 864 - 872
  • [46] Gene Selection for Microarray Expression Data with Imbalanced Sample Distributions
    Kamal, Abu H. M.
    Zhu, Xingquan
    Narayanan, Ramaswamy
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 3 - +
  • [47] Unsupervised selection of informative genes in microarray gene expression data
    Liaghat, Samaneh
    Mansoori, Eghbal G.
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2016, 3 (04) : 351 - 367
  • [48] Parsimonious Selection of Useful Genes in Microarray Gene Expression Data
    Gonzalez-Navarro, Felix F.
    Belanche-Munoz, Lluis A.
    SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 45 - 55
  • [49] Bi-dimensional principal gene feature selection from big gene expression data
    Hou, Xiaoqian
    Hou, Jingyu
    Huang, Guangyan
    PLOS ONE, 2022, 17 (12):
  • [50] Informative Feature Clustering and Selection for Gene Expression Data
    Yang, Yuqi
    Yin, Pengshuai
    Luo, Zhihang
    Gu, Wenwen
    Chen, Renjie
    Wu, Qingyao
    IEEE ACCESS, 2019, 7 : 169174 - 169184