PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms

被引:5
作者
Sengupta, Kaustav [1 ,2 ,3 ]
Saha, Sovan [4 ]
Halder, Anup Kumar [1 ,3 ]
Chatterjee, Piyali [5 ]
Nasipuri, Mita [2 ]
Basu, Subhadip [2 ]
Plewczynski, Dariusz [1 ,3 ]
机构
[1] Univ Warsaw, Ctr New Technol, Lab Funct & Struct Genom, Warsaw, Poland
[2] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[3] Warsaw Univ Technol, Fac Math & Informat Sci, Lab Bioinformat & Computat Genom, Warsaw, Poland
[4] Inst Engn & Management, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[5] Netaji Subhash Engn Coll, Dept Comp Sci & Engn, Kolkata, India
基金
美国国家卫生研究院; 欧盟地平线“2020”;
关键词
protein sequence; protein domain; protein-protein interaction network; 3D gene-gene association; ranked GO; protein function prediction; GENE ONTOLOGY; SUBCELLULAR-LOCALIZATION; INTERACTION NETWORKS; FUNCTION ANNOTATION; DATABASE; ORDER; FAMILIES; TOOLS;
D O I
10.3389/fgene.2022.969915
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Protein function prediction is gradually emerging as an essential field in biological and computational studies. Though the latter has clinched a significant footprint, it has been observed that the application of computational information gathered from multiple sources has more significant influence than the one derived from a single source. Considering this fact, a methodology, PFP-GO, is proposed where heterogeneous sources like Protein Sequence, Protein Domain, and Protein-Protein Interaction Network have been processed separately for ranking each individual functional GO term. Based on this ranking, GO terms are propagated to the target proteins. While Protein sequence enriches the sequence-based information, Protein Domain and Protein-Protein Interaction Networks embed structural/functional and topological based information, respectively, during the phase of GO ranking. Performance analysis of PFP-GO is also based on Precision, Recall, and F-Score. The same was found to perform reasonably better when compared to the other existing state-of-art. PFP-GO has achieved an overall Precision, Recall, and F-Score of 0.67, 0.58, and 0.62, respectively. Furthermore, we check some of the top-ranked GO terms predicted by PFP-GO through multilayer network propagation that affect the 3D structure of the genome.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Advances in Computational Methods for Protein-Protein Interaction Prediction
    Xian, Lei
    Wang, Yansu
    ELECTRONICS, 2024, 13 (06)
  • [42] Prediction of human protein-protein interaction by a domain-based approach
    Zhang, Xiaopan
    Jiao, Xiong
    Song, Jie
    Chang, Shan
    JOURNAL OF THEORETICAL BIOLOGY, 2016, 396 : 144 - 153
  • [43] Collective prediction of protein functions from protein-protein interaction networks
    Qingyao Wu
    Yunming Ye
    Michael K Ng
    Shen-Shyang Ho
    Ruichao Shi
    BMC Bioinformatics, 15
  • [44] Domain-PFP allows protein function prediction using function-aware domain embedding representations
    Ibtehaz, Nabil
    Kagaya, Yuki
    Kihara, Daisuke
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [45] Improving protein-protein interaction prediction using protein language model and protein network features
    Hu, Jun
    Li, Zhe
    Rao, Bing
    Thafar, Maha A.
    Arif, Muhammad
    ANALYTICAL BIOCHEMISTRY, 2024, 693
  • [46] Prediction of protein motions from amino acid sequence and its application to protein-protein interaction
    Hirose, Shuichi
    Yokota, Kiyonobu
    Kuroda, Yutaka
    Wako, Hiroshi
    Endo, Shigeru
    Kanai, Satoru
    Noguchi, Tamotsu
    BMC STRUCTURAL BIOLOGY, 2010, 10
  • [47] Protein-protein interaction and site prediction using transfer learning
    Liu, Tuoyu
    Gao, Han
    Ren, Xiaopu
    Xu, Guoshun
    Liu, Bo
    Wu, Ningfeng
    Luo, Huiying
    Wang, Yuan
    Tu, Tao
    Yao, Bin
    Guan, Feifei
    Teng, Yue
    Huang, Huoqing
    Tian, Jian
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [48] Prediction of protein-protein interaction sites using an ensemble method
    Deng, Lei
    Guan, Jihong
    Dong, Qiwen
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2009, 10
  • [49] Prediction of Protein-Protein Interaction Types Using the Decision Templates
    Chen, Wei
    Zhang, Shao-Wu
    Cheng, Yong-Mei
    2009 FOURTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PROCEEDINGS, 2009, : 93 - 98
  • [50] Improving the prediction of yeast protein function using weighted protein-protein interactions
    Ahmed, Khaled S.
    Saloma, Nahed H.
    Kadah, Yasser M.
    THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2011, 8