Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type?

被引:27
|
作者
Mazandu, Gaston K. [1 ,2 ,3 ]
Mulder, Nicola J. [1 ]
机构
[1] Univ Cape Town, Computat Biol Grp, Dept Clin Lab Sci, IDM,Fac Hlth Sci, ZA-7925 Cape Town, South Africa
[2] African Inst Math Sci, Cape Town, South Africa
[3] African Inst Math Sci, Cape Coast, Ghana
来源
PLOS ONE | 2014年 / 9卷 / 12期
关键词
SEMANTIC SIMILARITY; TOOL;
D O I
10.1371/journal.pone.0113859
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The current increase in Gene Ontology (GO) annotations of proteins in the existing genome databases and their use in different analyses have fostered the improvement of several biomedical and biological applications. To integrate this functional data into different analyses, several protein functional similarity measures based on GO term information content (IC) have been proposed and evaluated, especially in the context of annotation-based measures. In the case of topology-based measures, each approach was set with a specific functional similarity measure depending on its conception and applications for which it was designed. However, it is not clear whether a specific functional similarity measure associated with a given approach is the most appropriate, given a biological data set or an application, i.e., achieving the best performance compared to other functional similarity measures for the biological application under consideration. We show that, in general, a specific functional similarity measure often used with a given term IC or term semantic similarity approach is not always the best for different biological data and applications. We have conducted a performance evaluation of a number of different functional similarity measures using different types of biological data in order to infer the best functional similarity measure for each different term IC and semantic similarity approach. The comparisons of different protein functional similarity measures should help researchers choose the most appropriate measure for the biological application under consideration.
引用
收藏
页数:20
相关论文
共 4 条
  • [1] Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory
    Mazandu, Gaston K.
    Mulder, Nicola J.
    BIOMED RESEARCH INTERNATIONAL, 2013, 2013
  • [2] Applying semantic similarity measures based on information content in the evaluation of a domain ontology
    Hernandez Garcia, Aimee Cecilia
    Tovar Vidal, Mireya
    Lavalle Martinez, Jose de Jesus
    2018 17TH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (MICAI 2018), 2018, : 8 - 12
  • [3] A semantic similarity measure for linked data: An information content-based approach
    Meymandpour, Rouzbeh
    Davis, Joseph G.
    KNOWLEDGE-BASED SYSTEMS, 2016, 109 : 276 - 293
  • [4] DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures
    Gaston K Mazandu
    Nicola J Mulder
    BMC Bioinformatics, 14