LEARNING WITH GENE ONTOLOGY ANNOTATION USING FEATURE SELECTION AND CONSTRUCTION

被引:1
作者
Akand, Elma [1 ]
Bain, Michael [1 ]
Temple, Mark [2 ]
机构
[1] Univ New S Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[2] Univ Western Sydney, Sch Biomed & Hlth Sci, Penrith, Australia
关键词
SACCHAROMYCES-CEREVISIAE; EXPRESSION DATA; GENOME; CLASSIFICATION; DELETION; BIOLOGY; STRESS; TOOLS;
D O I
10.1080/08839510903448627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key role for ontologies in bioinformatics is their use as a standardized, structured terminology, particularly to annotate the genes in a genome with functional and other properties. Since the output of many genome-scale experiments results in gene sets it is natural to ask if they share a common function. A standard approach is to apply a statistical test for overrepresentation of functional annotation, often within the gene ontology. In this article we propose an alternative to the standard approach that avoids problems in overrepresentation analysis due to statistical dependencies between ontology categories. We apply methods of feature construction and selection to preprocess gene ontology terms used for the annotation of gene sets and incorporate these features as input to a standard supervised machine-learning algorithm. Our approach is shown to allow the straightforward use of an ontology in the context of data sourced from multiple experiments to learn classifiers predicting gene function as part of a cellular response to environmental stress.
引用
收藏
页码:5 / 38
页数:34
相关论文
共 50 条
  • [31] Feature Selection and Clustering of Gene Expression Profiles Using Biological Knowledge
    Mitra, Sushmita
    Ghosh, Sampreeti
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1590 - 1599
  • [32] Feature Selection and Classification for Gene Expression Data using Evolutionary Computation
    Banka, Haider
    Dara, Suresh
    2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 185 - 189
  • [33] Feature Selection for Gene Expression Using Model-Based Entropy
    Zhu, Shenghuo
    Wang, Dingding
    Yu, Kai
    Li, Tao
    Gong, Yihong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (01) : 25 - 36
  • [34] Feature Selection Using Information Distance Measure for Gene Expression Data
    Cai, Jie
    Liang, Cheng
    Luo, Jiawei
    CURRENT PROTEOMICS, 2018, 15 (05) : 352 - 362
  • [35] Robust Feature Selection Using Ensemble Feature Selection Techniques
    Saeys, Yvan
    Abeel, Thomas
    Van de Peer, Yves
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 313 - +
  • [36] Ensemble Method of Feature Selection and Reverse Construction of Gene Logical Network Based on Information Entropy
    Zhao, Qingfeng
    Zhang, Yulin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (02)
  • [37] A Study on the Effect of Feature Selection on Malware Analysis using Machine Learning
    Babaagba, Kehinde Oluwatoyin
    Adesanya, Samuel Olumide
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2019), 2019, : 51 - 55
  • [38] An Efficient Feature Selection Algorithm for Gene Families Using NMF and ReliefF
    Liu, Kai
    Chen, Qi
    Huang, Guo-Hua
    GENES, 2023, 14 (02)
  • [39] Advancing Cardiac Disease Detection Using Feature Extraction, Feature Selection, and Ensemble Learning Approaches
    Tripathy, S. R.
    Rath, A.
    Sharma, R.
    Panda, G.
    Sharma, Meenakshi
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2025, 84 (02): : 207 - 218
  • [40] Learning Proximity Relations for Feature Selection
    Zhang, Taiping
    Ren, Pengfei
    Ge, Yao
    Zheng, Yali
    Tang, Yuan Yan
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (05) : 1231 - 1244