LEARNING WITH GENE ONTOLOGY ANNOTATION USING FEATURE SELECTION AND CONSTRUCTION

被引:1
|
作者
Akand, Elma [1 ]
Bain, Michael [1 ]
Temple, Mark [2 ]
机构
[1] Univ New S Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[2] Univ Western Sydney, Sch Biomed & Hlth Sci, Penrith, Australia
关键词
SACCHAROMYCES-CEREVISIAE; EXPRESSION DATA; GENOME; CLASSIFICATION; DELETION; BIOLOGY; STRESS; TOOLS;
D O I
10.1080/08839510903448627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key role for ontologies in bioinformatics is their use as a standardized, structured terminology, particularly to annotate the genes in a genome with functional and other properties. Since the output of many genome-scale experiments results in gene sets it is natural to ask if they share a common function. A standard approach is to apply a statistical test for overrepresentation of functional annotation, often within the gene ontology. In this article we propose an alternative to the standard approach that avoids problems in overrepresentation analysis due to statistical dependencies between ontology categories. We apply methods of feature construction and selection to preprocess gene ontology terms used for the annotation of gene sets and incorporate these features as input to a standard supervised machine-learning algorithm. Our approach is shown to allow the straightforward use of an ontology in the context of data sourced from multiple experiments to learn classifiers predicting gene function as part of a cellular response to environmental stress.
引用
收藏
页码:5 / 38
页数:34
相关论文
共 50 条
  • [21] Feature Selection Using Tabu Search with Learning Memory: Learning Tabu Search
    Mousin, Lucien
    Jourdan, Laetitia
    Marmion, Marie-Eleonore Kessaci
    Dhaenens, Clarisse
    LEARNING AND INTELLIGENT OPTIMIZATION (LION 10), 2016, 10079 : 141 - 156
  • [22] Predicting Novel Human Gene Ontology Annotations Using Semantic Analysis
    Done, Bogdan
    Khatri, Purvesh
    Done, Arina
    Draghici, Sorin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (01) : 91 - 99
  • [23] Manual Gene Ontology annotation workflow at the Mouse Genome Informatics Database
    Drabkin, Harold J.
    Blake, Judith A.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2012,
  • [24] Identifying Effective Feature Selection Methods for Alzheimer's Disease Biomarker Gene Detection Using Machine Learning
    Alshamlan, Hala
    Omar, Samar
    Aljurayyad, Rehab
    Alabduljabbar, Reham
    DIAGNOSTICS, 2023, 13 (10)
  • [25] Designing genetic programming classifiers with feature selection and feature construction
    Ma, Jianbin
    Gao, Xiaoying
    APPLIED SOFT COMPUTING, 2020, 97
  • [26] Using a genetic algorithm and a perceptron for feature selection and supervised class learning in DNA microarray data
    Karzynski, M
    Mateos, A
    Herrero, J
    Dopazo, J
    ARTIFICIAL INTELLIGENCE REVIEW, 2003, 20 (1-2) : 39 - 51
  • [27] Dual regularized subspace learning using adaptive graph learning and rank constraint: Unsupervised feature selection on gene expression microarray datasets
    Moslemi, Amir
    Ahmadian, Arash
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [28] An empirical evaluation of hierarchical feature selection methods for classification in bioinformatics datasets with gene ontology-based features
    Cen Wan
    Alex A. Freitas
    Artificial Intelligence Review, 2018, 50 : 201 - 240
  • [29] Image annotation techniques based on feature selection for class-pairs
    Lu, Jianjiang
    Li, Ran
    Zhang, Yafei
    Zhao, Tianzhong
    Lu, Zining
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (02) : 325 - 337
  • [30] Feature Selection and Classification for Gene Expression Data using Evolutionary Computation
    Banka, Haider
    Dara, Suresh
    2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 185 - 189