DeepPBI-KG: a deep learning method for the prediction of phage-bacteria interactions based on key genes

被引:0
|
作者
Wei, Tongqing [1 ]
Lu, Chenqi [1 ]
Du, Hanxiao [1 ]
Yang, Qianru [1 ]
Qi, Xin [2 ]
Liu, Yankun [2 ]
Zhang, Yi [3 ]
Chen, Chen [3 ]
Li, Yutong [1 ]
Tang, Yuanhao [1 ]
Zhang, Wen-Hong [1 ,2 ,3 ]
Tao, Xu [1 ,2 ,3 ]
Jiang, Ning [1 ,2 ,3 ]
机构
[1] Fudan Univ, Sch Life Sci, State Key Lab Genet Engn, 2005 Songhu Rd, Shanghai 200433, Peoples R China
[2] Shanghai Scitech Inno Ctr Infect & Immun, 1688 Guoquan Bei Rd, Shanghai, Peoples R China
[3] Fudan Univ, Huashan Hosp, Shanghai Med Coll, Dept Infect Dis, 12 Wulumuqi Zhong Rd, Shanghai, Peoples R China
关键词
phage-bacteria interaction; machine learning; deep learning; negative sample selection; receptor binding protein; BACTERIOPHAGES; RESISTANCE; HOSTS;
D O I
10.1093/bib/bbae484
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phages, the natural predators of bacteria, were discovered more than 100 years ago. However, increasing antimicrobial resistance rates have revitalized phage research. Methods that are more time-consuming and efficient than wet-laboratory experiments are needed to help screen phages quickly for therapeutic use. Traditional computational methods usually ignore the fact that phage-bacteria interactions are achieved by key genes and proteins. Methods for intraspecific prediction are rare since almost all existing methods consider only interactions at the species and genus levels. Moreover, most strains in existing databases contain only partial genome information because whole-genome information for species is difficult to obtain. Here, we propose a new approach for interaction prediction by constructing new features from key genes and proteins via the application of K-means sampling to select high-quality negative samples for prediction. Finally, we develop DeepPBI-KG, a corresponding prediction tool based on feature selection and a deep neural network. The results show that the average area under the curve for prediction reached 0.93 for each strain, and the overall AUC and area under the precision-recall curve reached 0.89 and 0.92, respectively, on the independent test set; these values are greater than those of other existing prediction tools. The forward and reverse validation results indicate that key genes and key proteins regulate and influence the interaction, which supports the reliability of the model. In addition, intraspecific prediction experiments based on Klebsiella pneumoniae data demonstrate the potential applicability of DeepPBI-KG for intraspecific prediction. In summary, the feature engineering and interaction prediction approaches proposed in this study can effectively improve the robustness and stability of interaction prediction, can achieve high generalizability, and may provide new directions and insights for rapid phage screening for therapy. Graphical Abstract
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Applying one-class learning algorithms to predict phage-bacteria interactions
    Fernando Lopez, Juan
    Lopez Sotelo, Jesus Alfonso
    Leite, Diogo
    Pena-Reyes, Carlos
    2019 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2019, : 63 - 68
  • [2] PTBGRP: predicting phage-bacteria interactions with graph representation learning on microbial heterogeneous information network
    Pan, Jie
    You, Zhuhong
    You, Wencai
    Zhao, Tian
    Feng, Chenlu
    Zhang, Xuexia
    Ren, Fengzhi
    Ma, Sanxing
    Wu, Fan
    Wang, Shiwei
    Sun, Yanmei
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [3] Exploration of multiclass and one-class learning methods for prediction of phage-bacteria interaction at strain level
    Leite, Diogo Manuel Carvalho
    Fernando Lopez, Juan
    Brochet, Xavier
    Barreto-Sanz, Miguel
    Que, Yok-Ai
    Resch, Gregory
    Pena-Reyes, Carlos
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1818 - 1825
  • [4] deep-Sep: a deep learning-based method for fast and accurate prediction of selenoprotein genes in bacteria
    Xiao, Yao
    Zhang, Yan
    MSYSTEMS, 2025,
  • [5] A novel method for identifying key genes in macroevolution based on deep learning with attention mechanism
    Jiawei Mao
    Yong Cao
    Yan Zhang
    Biaosheng Huang
    Youjie Zhao
    Scientific Reports, 13
  • [6] A novel method for identifying key genes in macroevolution based on deep learning with attention mechanism
    Mao, Jiawei
    Cao, Yong
    Zhang, Yan
    Huang, Biaosheng
    Zhao, Youjie
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] Towards BacterioPhage Genetic Edition: Deep Learning Prediction of Phage-Bacterium Interactions
    Ataee, Shabnam
    Rodriguez, Oscar
    Brochet, Xavier
    Pena, Carlos Andres
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2946 - 2948
  • [8] Inflation Prediction Method Based on Deep Learning
    Yang, Cheng
    Guo, Shuhua
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [9] A Prediction Method of Localizability Based on Deep Learning
    Gao, Yang
    Wang, Shu Qi
    Li, Jing Hang
    Hu, Meng Qi
    Xia, Hong Yao
    Hu, Hui
    Wang, Lai Jun
    IEEE ACCESS, 2020, 8 : 110103 - 110115
  • [10] Flow velocity prediction method based on deep learning
    Zhao, Zihan
    Yang, Zihao
    SEVENTH INTERNATIONAL CONFERENCE ON TRAFFIC ENGINEERING AND TRANSPORTATION SYSTEM, ICTETS 2023, 2024, 13064