Interestingness measures and strategies for mining multi-ontology multi-level association rules from gene ontology annotations for the discovery of new GO relationships

被引:21
作者
Manda, Prashanti [1 ]
McCarthy, Fiona [2 ]
Bridges, Susan M. [1 ]
机构
[1] Mississippi State Univ, Dept Comp Sci & Engn, Mississippi State, MS USA
[2] Univ Arizona, Dept Vet Sci & Microbiol, Tucson, AZ USA
关键词
Gene ontology; Association rule mining; Data mining; Interestingness measures; Gene ontology relationships; Interpro relationships; TOOL;
D O I
10.1016/j.jbi.2013.06.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Gene Ontology (GO), a set of three sub-ontologies, is one of the most popular bio-ontologies used for describing gene product characteristics. GO annotation data containing terms from multiple sub-ontologies and at different levels in the ontologies is an important source of implicit relationships between terms from the three sub-ontologies. Data mining techniques such as association rule mining that are tailored to mine from multiple ontologies at multiple levels of abstraction are required for effective knowledge discovery from GO annotation data. We present a data mining approach, Multi-ontology data mining at All Levels (MOAL) that uses the structure and relationships of the GO to mine multi-ontology multi-level association rules. We introduce two interestingness measures: Multi-ontology Support (MOSupport) and Multi-ontology Confidence (MOConfidence) customized to evaluate multi-ontology multi-level association rules. We also describe a variety of post-processing strategies for pruning uninteresting rules. We use publicly available GO annotation data to demonstrate our methods with respect to two applications (I) the discovery of co-annotation suggestions and (2) the discovery of new cross-ontology relationships. (C) 2013 The Authors. Published by Elsevier Inc. All rights reserved.
引用
收藏
页码:849 / 856
页数:8
相关论文
共 22 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] GO PaD: The gene ontology partition database
    Alterovitz, Gil
    Xiang, Michael
    Mohan, Mamta
    Ramoni, Marco F.
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D322 - D327
  • [3] [Anonymous], DATABASE OXFORD
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] QuickGO: a web-based tool for Gene Ontology searching
    Binns, David
    Dimmer, Emily
    Huntley, Rachael
    Barrell, Daniel
    O'Donovan, Claire
    Apweiler, Rolf
    [J]. BIOINFORMATICS, 2009, 25 (22) : 3045 - 3046
  • [6] Bodenreider O, 2005, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, P91
  • [7] Borgelt C, 2002, P 15 C COMP STAT
  • [8] Burgun A., 2004, P WORKSH FORM ARCH G, P28
  • [9] Integrated analysis of gene expression by association rules discovery
    Carmona-Saez, P
    Chagoyen, M
    Rodriguez, A
    Trelles, O
    Carazo, JM
    Pascual-Montano, A
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [10] Mining GO Annotations for Improving Annotation Consistency
    Faria, Daniel
    Schlicker, Andreas
    Pesquita, Catia
    Bastos, Hugo
    Ferreira, Antonio E. N.
    Albrecht, Mario
    Falcao, Andre O.
    [J]. PLOS ONE, 2012, 7 (07):