PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms

被引:5
|
作者
Sengupta, Kaustav [1 ,2 ,3 ]
Saha, Sovan [4 ]
Halder, Anup Kumar [1 ,3 ]
Chatterjee, Piyali [5 ]
Nasipuri, Mita [2 ]
Basu, Subhadip [2 ]
Plewczynski, Dariusz [1 ,3 ]
机构
[1] Univ Warsaw, Ctr New Technol, Lab Funct & Struct Genom, Warsaw, Poland
[2] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[3] Warsaw Univ Technol, Fac Math & Informat Sci, Lab Bioinformat & Computat Genom, Warsaw, Poland
[4] Inst Engn & Management, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[5] Netaji Subhash Engn Coll, Dept Comp Sci & Engn, Kolkata, India
基金
欧盟地平线“2020”; 美国国家卫生研究院;
关键词
protein sequence; protein domain; protein-protein interaction network; 3D gene-gene association; ranked GO; protein function prediction; GENE ONTOLOGY; SUBCELLULAR-LOCALIZATION; INTERACTION NETWORKS; FUNCTION ANNOTATION; DATABASE; ORDER; FAMILIES; TOOLS;
D O I
10.3389/fgene.2022.969915
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Protein function prediction is gradually emerging as an essential field in biological and computational studies. Though the latter has clinched a significant footprint, it has been observed that the application of computational information gathered from multiple sources has more significant influence than the one derived from a single source. Considering this fact, a methodology, PFP-GO, is proposed where heterogeneous sources like Protein Sequence, Protein Domain, and Protein-Protein Interaction Network have been processed separately for ranking each individual functional GO term. Based on this ranking, GO terms are propagated to the target proteins. While Protein sequence enriches the sequence-based information, Protein Domain and Protein-Protein Interaction Networks embed structural/functional and topological based information, respectively, during the phase of GO ranking. Performance analysis of PFP-GO is also based on Precision, Recall, and F-Score. The same was found to perform reasonably better when compared to the other existing state-of-art. PFP-GO has achieved an overall Precision, Recall, and F-Score of 0.67, 0.58, and 0.62, respectively. Furthermore, we check some of the top-ranked GO terms predicted by PFP-GO through multilayer network propagation that affect the 3D structure of the genome.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Protein Function Prediction Using Function Associations in Protein-Protein Interaction Network
    Sun, Pingping
    Tan, Xian
    Guo, Sijia
    Zhang, Jingbo
    Sun, Bojian
    Du, Ning
    Wang, Han
    Sun, Hui
    IEEE ACCESS, 2018, 6 : 30892 - 30902
  • [2] Active learning for protein function prediction in protein-protein interaction networks
    Xiong, Wei
    Xie, Luyu
    Zhou, Shuigeng
    Guan, Jihong
    NEUROCOMPUTING, 2014, 145 : 44 - 52
  • [3] Protein function prediction using neighbor relativity in protein-protein interaction network
    Moosavi, Sobhan
    Rahgozar, Masoud
    Rahimi, Amir
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2013, 43 : 11 - 16
  • [4] Detection of Locally Over-Represented GO Terms in Protein-Protein Interaction Networks
    Lavallee-Adam, Mathieu
    Coulombe, Benoit
    Blanchette, Mathieu
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2009, 5541 : 302 - +
  • [5] Identifying Protein Complexes From Protein-Protein Interaction Networks Based on Fuzzy Clustering and GO Semantic Information
    Pan, Xiangyu
    Hu, Lun
    Hu, Pengwei
    You, Zhu-Hong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (05) : 2882 - 2893
  • [6] COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information
    Zhang, Chengxin
    Freddolino, Peter L.
    Zhang, Yang
    NUCLEIC ACIDS RESEARCH, 2017, 45 (W1) : W291 - W299
  • [7] Domain Content Based Protein Function Prediction Using Incomplete GO Annotation Information
    Tan, Lirong
    Yu, Zhiwen
    Wong, Hau-San
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 49 - +
  • [8] Prediction of membrane protein types by fusing protein-protein interaction and protein sequence information
    Zhang, Xiaolin
    Chen, Lei
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2020, 1868 (12):
  • [9] Improving protein function prediction using protein sequence and GO-term similarities
    Makrodimitris, Stavros
    van Ham, Roeland C. H. J.
    Reinders, Marcel J. T.
    BIOINFORMATICS, 2019, 35 (07) : 1116 - 1124
  • [10] InfersentPPI: Prediction of Protein-Protein Interaction Using Protein Sentence Embedding With Gene Ontology Information
    Li, Meijing
    Jiang, Yingying
    Ryu, Keun Ho
    FRONTIERS IN GENETICS, 2022, 13