PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms

被引:5
作者
Sengupta, Kaustav [1 ,2 ,3 ]
Saha, Sovan [4 ]
Halder, Anup Kumar [1 ,3 ]
Chatterjee, Piyali [5 ]
Nasipuri, Mita [2 ]
Basu, Subhadip [2 ]
Plewczynski, Dariusz [1 ,3 ]
机构
[1] Univ Warsaw, Ctr New Technol, Lab Funct & Struct Genom, Warsaw, Poland
[2] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[3] Warsaw Univ Technol, Fac Math & Informat Sci, Lab Bioinformat & Computat Genom, Warsaw, Poland
[4] Inst Engn & Management, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[5] Netaji Subhash Engn Coll, Dept Comp Sci & Engn, Kolkata, India
基金
美国国家卫生研究院; 欧盟地平线“2020”;
关键词
protein sequence; protein domain; protein-protein interaction network; 3D gene-gene association; ranked GO; protein function prediction; GENE ONTOLOGY; SUBCELLULAR-LOCALIZATION; INTERACTION NETWORKS; FUNCTION ANNOTATION; DATABASE; ORDER; FAMILIES; TOOLS;
D O I
10.3389/fgene.2022.969915
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Protein function prediction is gradually emerging as an essential field in biological and computational studies. Though the latter has clinched a significant footprint, it has been observed that the application of computational information gathered from multiple sources has more significant influence than the one derived from a single source. Considering this fact, a methodology, PFP-GO, is proposed where heterogeneous sources like Protein Sequence, Protein Domain, and Protein-Protein Interaction Network have been processed separately for ranking each individual functional GO term. Based on this ranking, GO terms are propagated to the target proteins. While Protein sequence enriches the sequence-based information, Protein Domain and Protein-Protein Interaction Networks embed structural/functional and topological based information, respectively, during the phase of GO ranking. Performance analysis of PFP-GO is also based on Precision, Recall, and F-Score. The same was found to perform reasonably better when compared to the other existing state-of-art. PFP-GO has achieved an overall Precision, Recall, and F-Score of 0.67, 0.58, and 0.62, respectively. Furthermore, we check some of the top-ranked GO terms predicted by PFP-GO through multilayer network propagation that affect the 3D structure of the genome.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Some Remarks on Prediction of Protein-Protein Interaction with Machine Learning
    Zhang, Shao-Wu
    Wei, Ze-Gang
    MEDICINAL CHEMISTRY, 2015, 11 (03) : 254 - 264
  • [32] A matrix based algorithm for protein-protein interaction prediction using domain-domain associations
    Priya, S. Binny
    Saha, Subhojit
    Anishetty, Ramesh
    Anishetty, Sharmila
    JOURNAL OF THEORETICAL BIOLOGY, 2013, 326 : 36 - 42
  • [33] Human protein-protein interaction prediction
    Mark D McDowall
    Michelle S Scott
    Geoffrey J Barton
    BMC Bioinformatics, 11 (Suppl 10)
  • [34] Eukaryotic protein subcellular localization prediction based on sequence conservation and protein-protein interaction
    Zhang Song
    Xia Xue-Feng
    Shen Jin-Cheng
    Sun Zhi-Rong
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2008, 35 (05) : 531 - 535
  • [35] Computational Prediction of Protein Function Based on Weighted Mapping of Domains and GO Terms
    Teng, Zhixia
    Guo, Maozu
    Dai, Qiguo
    Wang, Chunyu
    Li, Jin
    Liu, Xiaoyan
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [36] In Silico Protein-Protein Interaction Prediction with Sequence Alignment and Classifier Stacking
    Marini, Simone
    Xu, Qian
    Yang, Qiang
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2011, 12 (07) : 614 - 620
  • [37] Collective prediction of protein functions from protein-protein interaction networks
    Wu, Qingyao
    Ye, Yunming
    Ng, Michael K.
    Ho, Shen-Shyang
    Shi, Ruichao
    BMC BIOINFORMATICS, 2014, 15
  • [38] Evolutionary profiles improve protein-protein interaction prediction from sequence
    Hamp, Tobias
    Rost, Burkhard
    BIOINFORMATICS, 2015, 31 (12) : 1945 - 1950
  • [39] Domain Linker Region Knowledge Contributes to Protein-protein Interaction Prediction
    Zaki, Nazar
    Campbell, Piers
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING (IACSIT ICMLC 2009), 2009, : 70 - 74
  • [40] Prediction of Protein-Protein Interactions from Protein Sequence Using Local Descriptors
    Yang, Lei
    Xia, Jun-Feng
    Gui, Jie
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09) : 1085 - 1090