Evaluating the Significance of Protein Functional Similarity Based on Gene Ontology

被引:3
|
作者
Konopka, Bogumil M. [1 ]
Golda, Tomasz [1 ]
Kotulska, Malgorzata [1 ]
机构
[1] Wroclaw Univ Technol, Inst Biomed Engn & Instrumentat, PL-50370 Wroclaw, Poland
关键词
gene ontology; protein function; semantic similarity; SEMANTIC SIMILARITY; PREDICTION; TOOL;
D O I
10.1089/cmb.2014.0181
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene ontology is among the most successful ontologies in the biomedical domain. It is used to describe, unambiguously, protein molecular functions, cellular localizations, and processes in which proteins participate. The hierarchical structure of gene ontology allows quantifying protein functional similarity by application of algorithms that calculate semantic similarities. The scores, however, are meaningless without a given context. Here, we propose how to evaluate the significance of protein function semantic similarity scores by comparing them to reference distributions calculated for randomly chosen proteins. In the study, thresholds for significant functional semantic similarity, in four representative annotation corpuses, were estimated. We also show that the score significance is influenced by the number and specificity of gene ontology terms that are annotated to compared proteins. While proteins with a greater number of terms tend to yield higher similarity scores, proteins with more specific terms produce lower scores. The estimated significance thresholds were validated using protein sequence-function and structure-function relationships. Taking into account the term number and term specificity improves the distinction between significant and insignificant semantic similarity comparisons.
引用
收藏
页码:809 / 822
页数:14
相关论文
共 50 条
  • [21] Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
    Jian Wang
    Dong Xie
    Hongfei Lin
    Zhihao Yang
    Yijia Zhang
    Proteome Science, 10
  • [22] Characterisation of semantic similarity on gene ontology based on a shortest path approach
    Shen, Ying
    Zhang, Shaohong
    Wong, Hau-San
    Zhang, Lin
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 10 (01) : 33 - 48
  • [23] An Integrated Information-Based Similarity Measurement of Gene Ontology Terms
    Zhang, Shu-Bo
    Lai, Jian-Huang
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 12 (04) : 1235 - 1253
  • [24] Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach
    Peng, Jiajie
    Zhang, Xuanshuo
    Hui, Weiwei
    Lu, Junya
    Li, Qianqian
    Liu, Shuhui
    Shang, Xuequn
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [25] An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology
    Shobhit Jain
    Gary D Bader
    BMC Bioinformatics, 11
  • [26] Assessing Human Disease Phenotype Similarity Based on Ontology
    Le, Duc-Hau
    Pham, Ba-Su
    Dao, Anh-Minh
    2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2016, : 211 - 216
  • [27] Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type?
    Mazandu, Gaston K.
    Mulder, Nicola J.
    PLOS ONE, 2014, 9 (12):
  • [28] Implications of functional similarity for gene regulatory interactions
    Glass, Kimberly
    Ott, Edward
    Losert, Wolfgang
    Girvan, Michelle
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2012, 9 (72) : 1625 - 1636
  • [29] Ontology-Based Prediction and Prioritization of Gene Functional Annotations
    Chicco, Davide
    Masseroli, Marco
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (02) : 248 - 260
  • [30] Evaluating Topology-based Metrics for GO Term Similarity Measures
    Jeong, Jong Cheol
    Chen, Xue-wen
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,