LEARNING WITH GENE ONTOLOGY ANNOTATION USING FEATURE SELECTION AND CONSTRUCTION

被引:1
作者
Akand, Elma [1 ]
Bain, Michael [1 ]
Temple, Mark [2 ]
机构
[1] Univ New S Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[2] Univ Western Sydney, Sch Biomed & Hlth Sci, Penrith, Australia
关键词
SACCHAROMYCES-CEREVISIAE; EXPRESSION DATA; GENOME; CLASSIFICATION; DELETION; BIOLOGY; STRESS; TOOLS;
D O I
10.1080/08839510903448627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key role for ontologies in bioinformatics is their use as a standardized, structured terminology, particularly to annotate the genes in a genome with functional and other properties. Since the output of many genome-scale experiments results in gene sets it is natural to ask if they share a common function. A standard approach is to apply a statistical test for overrepresentation of functional annotation, often within the gene ontology. In this article we propose an alternative to the standard approach that avoids problems in overrepresentation analysis due to statistical dependencies between ontology categories. We apply methods of feature construction and selection to preprocess gene ontology terms used for the annotation of gene sets and incorporate these features as input to a standard supervised machine-learning algorithm. Our approach is shown to allow the straightforward use of an ontology in the context of data sourced from multiple experiments to learn classifiers predicting gene function as part of a cellular response to environmental stress.
引用
收藏
页码:5 / 38
页数:34
相关论文
共 50 条
  • [41] Learning Proximity Relations for Feature Selection
    Zhang, Taiping
    Ren, Pengfei
    Ge, Yao
    Zheng, Yali
    Tang, Yuan Yan
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (05) : 1231 - 1244
  • [42] Unsupervised feature selection with ensemble learning
    Elghazel, Haytham
    Aussem, Alex
    MACHINE LEARNING, 2015, 98 (1-2) : 157 - 180
  • [43] A comprehensive learning based swarm optimization approach for feature selection in gene expression data
    Easwaran, Subha
    Venugopal, Jothi Prakash
    Subramanian, Arul Antran Vijay
    Sundaram, Gopikrishnan
    Naseeba, Beebi
    HELIYON, 2024, 10 (17)
  • [44] Unsupervised Feature Selection for Microarray Gene Expression Data Based on Discriminative Structure Learning
    Ye, Xiucai
    Sakurai, Tetsuya
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2018, 24 (06) : 725 - 741
  • [45] Feature selection with ensemble learning for prostate cancer diagnosis from microarray gene expression
    Gumaei, Abdu
    Sammouda, Rachid
    Al-Rakhami, Mabrook
    AlSalman, Hussain
    El-Zaart, Ali
    HEALTH INFORMATICS JOURNAL, 2021, 27 (01)
  • [46] Feature selection for unsupervised learning through local learning
    Yao, Jin
    Mao, Qi
    Goodison, Steve
    Mai, Volker
    Sun, Yijun
    PATTERN RECOGNITION LETTERS, 2015, 53 : 100 - 107
  • [47] Efficient n-gram construction for text categorization using feature selection techniques
    Garcia, Maximiliano
    Maldonado, Sebastian
    Vairetti, Carla
    INTELLIGENT DATA ANALYSIS, 2021, 25 (03) : 509 - 525
  • [48] SoFoCles: Feature filtering for microarray classification based on Gene Ontology
    Papachristoudis, Georgios
    Diplaris, Sotiris
    Mitkas, Pericles A.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (01) : 1 - 14
  • [49] Deepdefense: annotation of immune systems in prokaryotes using deep learning
    Hauns, Sven
    Alkhnbashi, Omer S.
    Backofen, Rolf
    GIGASCIENCE, 2024, 13
  • [50] Gene Selection Using Iterative Feature Elimination Random Forests for Survival Outcomes
    Pang, Herbert
    George, Stephen L.
    Hui, Ken
    Tong, Tiejun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (05) : 1422 - 1431