VirusHound-I: prediction of viral proteins involved in the evasion of host adaptive immune response using the random forest algorithm and generative adversarial network for data augmentation

被引:6
作者
Beltran, Jorge F. [1 ]
Belen, Lisandra Herrera [2 ]
Farias, Jorge G. [3 ]
Zamorano, Mauricio [4 ]
Lefin, Nicolas [5 ]
Miranda, Javiera [5 ]
Parraguez-Contreras, Fernanda [5 ]
机构
[1] Univ La Frontera, Fac Engn & Sci, Dept Chem Engn, Ave Francisco Salazar 01145, Temuco, Chile
[2] Univ Santo Tomas Temuco, Temuco, Chile
[3] Univ La Frontera, Fac Engn & Sci, Temuco, Chile
[4] Univ La Frontera Temuco, Dept Chem Engn, Temuco, Chile
[5] Univ La Frontera, Temuco, Chile
关键词
virus; pathogen; machine learning; neural network; deep learning; protein; SUBCELLULAR-LOCALIZATION; STRATEGIES; MECHANISMS;
D O I
10.1093/bib/bbad434
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Throughout evolution, pathogenic viruses have developed different strategies to evade the response of the adaptive immune system. To carry out successful replication, some pathogenic viruses encode different proteins that manipulate the molecular mechanisms of host cells. Currently, there are different bioinformatics tools for virus research; however, none of them focus on predicting viral proteins that evade the adaptive system. In this work, we have developed a novel tool based on machine and deep learning for predicting this type of viral protein named VirusHound-I. This tool is based on a model developed with the multilayer perceptron algorithm using the dipeptide composition molecular descriptor. In this study, we have also demonstrated the robustness of our strategy for data augmentation of the positive dataset based on generative antagonistic networks. During the 10-fold cross-validation step in the training dataset, the predictive model showed 0.947 accuracy, 0.994 precision, 0.943 F1 score, 0.995 specificity, 0.896 sensitivity, 0.894 kappa, 0.898 Matthew's correlation coefficient and 0.989 AUC. On the other hand, during the testing step, the model showed 0.964 accuracy, 1.0 precision, 0.967 F1 score, 1.0 specificity, 0.936 sensitivity, 0.929 kappa, 0.931 Matthew's correlation coefficient and 1.0 AUC. Taking this model into account, we have developed a tool called VirusHound-I that makes it possible to predict viral proteins that evade the host's adaptive immune system. We believe that VirusHound-I can be very useful in accelerating studies on the molecular mechanisms of evasion of pathogenic viruses, as well as in the discovery of therapeutic targets.
引用
收藏
页数:8
相关论文
共 54 条
  • [1] Viral mechanisms of immune evasion
    Alcami, A
    Koszinowski, UH
    [J]. IMMUNOLOGY TODAY, 2000, 21 (09): : 447 - 455
  • [2] E5 protein of human papillomavirus 16 downregulates HLA class I and interacts with the heavy chain via its first hydrophobic domain
    Ashrafi, G. Hossein
    Haghshenas, Mohammad
    Marchetti, Barbara
    Campo, M. Saveria
    [J]. INTERNATIONAL JOURNAL OF CANCER, 2006, 119 (09) : 2105 - 2112
  • [3] UniProt: the universal protein knowledgebase in 2021
    Bateman, Alex
    Martin, Maria-Jesus
    Orchard, Sandra
    Magrane, Michele
    Agivetova, Rahat
    Ahmad, Shadab
    Alpi, Emanuele
    Bowler-Barnett, Emily H.
    Britto, Ramona
    Bursteinas, Borisas
    Bye-A-Jee, Hema
    Coetzee, Ray
    Cukura, Austra
    Da Silva, Alan
    Denny, Paul
    Dogan, Tunca
    Ebenezer, ThankGod
    Fan, Jun
    Castro, Leyla Garcia
    Garmiri, Penelope
    Georghiou, George
    Gonzales, Leonardo
    Hatton-Ellis, Emma
    Hussein, Abdulrahman
    Ignatchenko, Alexandr
    Insana, Giuseppe
    Ishtiaq, Rizwan
    Jokinen, Petteri
    Joshi, Vishal
    Jyothi, Dushyanth
    Lock, Antonia
    Lopez, Rodrigo
    Luciani, Aurelien
    Luo, Jie
    Lussi, Yvonne
    Mac-Dougall, Alistair
    Madeira, Fabio
    Mahmoudy, Mahdi
    Menchi, Manuela
    Mishra, Alok
    Moulang, Katie
    Nightingale, Andrew
    Oliveira, Carla Susana
    Pundir, Sangya
    Qi, Guoying
    Raj, Shriya
    Rice, Daniel
    Lopez, Milagros Rodriguez
    Saidi, Rabie
    Sampson, Joseph
    [J]. NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) : D480 - D489
  • [4] Innate immune evasion strategies of DNA and RNA viruses
    Beachboard, Dia C.
    Horner, Stacy M.
    [J]. CURRENT OPINION IN MICROBIOLOGY, 2016, 32 : 113 - 119
  • [5] AntiVPP 1.0: A portable tool for prediction of antiviral peptides
    Beltran Lissabet, Jorge Felix
    Herrera Belen, Lisandra
    Farias, Jorge G.
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 107 : 127 - 130
  • [6] ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST
    Bhasin, M
    Raghava, GPS
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : W414 - W419
  • [7] Emerging pathogen evolution Using evolutionary theory to understand the fate of novel infectious pathogens
    Bonneaud, Camille
    Longdon, Ben
    [J]. EMBO REPORTS, 2020, 21 (09)
  • [8] Bravo Ignacio G., 2015, Evolution Medicine and Public Health, P32, DOI 10.1093/emph/eov003
  • [9] Strategies for immune evasion by human tumor viruses
    Bussey, Kendra A.
    Brinkmann, Melanie M.
    [J]. CURRENT OPINION IN VIROLOGY, 2018, 32 : 30 - 39
  • [10] SARS-CoV-2 variant biology: immune escape, transmission and fitness
    Carabelli, Alessandro G.
    Peacock, Thomas P.
    Thorne, Lucy G.
    Harvey, William T.
    Hughes, Joseph
    Peacock, Sharon J.
    Barclay, Wendy S.
    de Silva, Thushan, I
    Towers, Greg J.
    Robertson, David L.
    [J]. NATURE REVIEWS MICROBIOLOGY, 2023, 21 (03) : 162 - 177