Evaluating the Significance of Protein Functional Similarity Based on Gene Ontology

被引:3
|
作者
Konopka, Bogumil M. [1 ]
Golda, Tomasz [1 ]
Kotulska, Malgorzata [1 ]
机构
[1] Wroclaw Univ Technol, Inst Biomed Engn & Instrumentat, PL-50370 Wroclaw, Poland
关键词
gene ontology; protein function; semantic similarity; SEMANTIC SIMILARITY; PREDICTION; TOOL;
D O I
10.1089/cmb.2014.0181
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene ontology is among the most successful ontologies in the biomedical domain. It is used to describe, unambiguously, protein molecular functions, cellular localizations, and processes in which proteins participate. The hierarchical structure of gene ontology allows quantifying protein functional similarity by application of algorithms that calculate semantic similarities. The scores, however, are meaningless without a given context. Here, we propose how to evaluate the significance of protein function semantic similarity scores by comparing them to reference distributions calculated for randomly chosen proteins. In the study, thresholds for significant functional semantic similarity, in four representative annotation corpuses, were estimated. We also show that the score significance is influenced by the number and specificity of gene ontology terms that are annotated to compared proteins. While proteins with a greater number of terms tend to yield higher similarity scores, proteins with more specific terms produce lower scores. The estimated significance thresholds were validated using protein sequence-function and structure-function relationships. Taking into account the term number and term specificity improves the distinction between significant and insignificant semantic similarity comparisons.
引用
收藏
页码:809 / 822
页数:14
相关论文
共 50 条
  • [41] Determining protein-protein functional associations by functional rules based on gene ontology and KEGG pathway
    Zhang, Yu-Hang
    Zeng, Tao
    Chen, Lei
    Huang, Tao
    Cai, Yu-Dong
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2021, 1869 (06):
  • [42] Exploring information from the topology beneath the Gene Ontology terms to improve semantic similarity measures
    Zhang, Shu-Bo
    Lai, Jian-Huang
    GENE, 2016, 586 (01) : 148 - 157
  • [43] An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology
    Jain, Shobhit
    Bader, Gary D.
    BMC BIOINFORMATICS, 2010, 11
  • [44] Defining functional distances over Gene Ontology
    Angela del Pozo
    Florencio Pazos
    Alfonso Valencia
    BMC Bioinformatics, 9
  • [45] A New Family of Similarity Measures for Scoring Confidence of Protein Interactions Using Gene Ontology
    Paul, Madhusudan
    Anand, Ashish
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 19 - 30
  • [46] Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing
    Zhao, Yingwen
    Fu, Guangyuan
    Wang, Jun
    Guo, Maozu
    Yu, Guoxian
    GENOMICS, 2019, 111 (03) : 334 - 342
  • [47] Evaluating disease similarity based on gene network reconstruction and representation
    Yang Li
    Wang Keqi
    Wang, Guohua
    BIOINFORMATICS, 2021, 37 (20) : 3579 - 3587
  • [48] Methods of Gene Ontology Term Similarity Analysis in Graph Database Environment
    Stypka, Lukasz
    Kozielski, Michal
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2014, 2014, 424 : 345 - 354
  • [49] seGOsa: Software Environment for Gene Ontology-driven Similarity Assessment
    Zheng, Huiru
    Azuaje, Francisco
    Wang, Haiying
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 539 - 542
  • [50] Functional Interpretation of Gene Sets: Semantic-Based Clustering of Gene Ontology Terms on the BioTest Platform
    Gruca, Aleksandra
    Jaksik, Roman
    Psiuk-Maksymowicz, Krzysztof
    MAN-MACHINE INTERACTIONS 5, ICMMI 2017, 2018, 659 : 125 - 136