PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms

被引:5
|
作者
Sengupta, Kaustav [1 ,2 ,3 ]
Saha, Sovan [4 ]
Halder, Anup Kumar [1 ,3 ]
Chatterjee, Piyali [5 ]
Nasipuri, Mita [2 ]
Basu, Subhadip [2 ]
Plewczynski, Dariusz [1 ,3 ]
机构
[1] Univ Warsaw, Ctr New Technol, Lab Funct & Struct Genom, Warsaw, Poland
[2] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[3] Warsaw Univ Technol, Fac Math & Informat Sci, Lab Bioinformat & Computat Genom, Warsaw, Poland
[4] Inst Engn & Management, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[5] Netaji Subhash Engn Coll, Dept Comp Sci & Engn, Kolkata, India
基金
欧盟地平线“2020”; 美国国家卫生研究院;
关键词
protein sequence; protein domain; protein-protein interaction network; 3D gene-gene association; ranked GO; protein function prediction; GENE ONTOLOGY; SUBCELLULAR-LOCALIZATION; INTERACTION NETWORKS; FUNCTION ANNOTATION; DATABASE; ORDER; FAMILIES; TOOLS;
D O I
10.3389/fgene.2022.969915
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Protein function prediction is gradually emerging as an essential field in biological and computational studies. Though the latter has clinched a significant footprint, it has been observed that the application of computational information gathered from multiple sources has more significant influence than the one derived from a single source. Considering this fact, a methodology, PFP-GO, is proposed where heterogeneous sources like Protein Sequence, Protein Domain, and Protein-Protein Interaction Network have been processed separately for ranking each individual functional GO term. Based on this ranking, GO terms are propagated to the target proteins. While Protein sequence enriches the sequence-based information, Protein Domain and Protein-Protein Interaction Networks embed structural/functional and topological based information, respectively, during the phase of GO ranking. Performance analysis of PFP-GO is also based on Precision, Recall, and F-Score. The same was found to perform reasonably better when compared to the other existing state-of-art. PFP-GO has achieved an overall Precision, Recall, and F-Score of 0.67, 0.58, and 0.62, respectively. Furthermore, we check some of the top-ranked GO terms predicted by PFP-GO through multilayer network propagation that affect the 3D structure of the genome.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Integrating experimental and literature protein-protein interaction data for protein complex prediction
    Yijia Zhang
    Hongfei Lin
    Zhihao Yang
    Jian Wang
    BMC Genomics, 16
  • [22] Integrating experimental and literature protein-protein interaction data for protein complex prediction
    Zhang, Yijia
    Lin, Hongfei
    Yang, Zhihao
    Wang, Jian
    BMC GENOMICS, 2015, 16
  • [23] Prediction and characterization of protein-protein interaction networks in swine
    Wang, Fen
    Liu, Min
    Song, Baoxing
    Li, Dengyun
    Pei, Huimin
    Guo, Yang
    Huang, Jingfei
    Zhang, Deli
    PROTEOME SCIENCE, 2012, 10
  • [24] PSOPIA: Toward more reliable protein-protein interaction prediction from sequence information
    Murakami, Yoichi
    Mizuguchi, Kenji
    2017 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2017, : 255 - 261
  • [25] Integrated protein function prediction by mining function associations, sequences, and protein-protein and gene-gene interaction networks
    Cao, Renzhi
    Cheng, Jianlin
    METHODS, 2016, 93 : 84 - 91
  • [26] Novel Domain Identification Approach for Protein-protein Interaction Prediction
    Shatnawi, Maad
    Zaki, Nazar M.
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 145 - 152
  • [27] Utilizing shared interacting domain patterns and Gene Ontology information to improve protein-protein interaction prediction
    Roslan, Rosfuzah
    Othman, Razib M.
    Shah, Zuraini A.
    Kasim, Shahreen
    Asmuni, Hishammuddin
    Taliba, Jumail
    Hassan, Rohayanti
    Zakaria, Zalmiyah
    COMPUTERS IN BIOLOGY AND MEDICINE, 2010, 40 (06) : 555 - 564
  • [28] Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction
    Khatun, Mst Shamima
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    CURRENT GENOMICS, 2020, 21 (06) : 454 - 463
  • [29] TransDomain: A Transitive Domain-Based Method in Protein-Protein Interaction Prediction
    Tang, Yi-Tsung
    Kao, Hung-Yu
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2011, 6674 : 240 - 252
  • [30] A matrix based algorithm for protein-protein interaction prediction using domain-domain associations
    Priya, S. Binny
    Saha, Subhojit
    Anishetty, Ramesh
    Anishetty, Sharmila
    JOURNAL OF THEORETICAL BIOLOGY, 2013, 326 : 36 - 42