Predicting host dependency factors of pathogens in Drosophila melanogaster using machine learning

被引:5
作者
Aromolaran, Olufemi [1 ,2 ,3 ]
Beder, Thomas [2 ,4 ]
Adedeji, Eunice [3 ,5 ]
Ajamma, Yvonne [3 ]
Oyelade, Jelili [1 ,2 ,3 ]
Adebiyi, Ezekiel [1 ,3 ]
Koenig, Rainer [2 ,4 ]
机构
[1] Covenant Univ, Dept Comp & Informat Sci, Ota, Ogun State, Nigeria
[2] Jena Univ Hosp, Integrated Res & Treatment Ctr, Ctr Sepsis Control & Care CSCC, Klinikum 1, D-07747 Jena, Germany
[3] Covenant Univ, Covenant Univ Bioinformat Res CUBRe, Ota, Ogun State, Nigeria
[4] Jena Univ Hosp, Inst Infect Dis & Infect Control, Klinikum 1, D-07747 Jena, Germany
[5] Covenant Univ, Dept Biochem, Ota, Ogun State, Nigeria
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2021年 / 19卷 / 19期
关键词
Host factors; Bacteria; Infection; Knockout screen; Machine learning; Drosophila; WIDE RNAI SCREEN; RAB GTPASES; ESSENTIAL GENES; WEB SERVER; PROTEIN; EXPRESSION; TRAFFICKING; DATABASE; CANCER; LEGIONELLA;
D O I
10.1016/j.csbj.2021.08.010
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Pathogens causing infections, and particularly when invading the host cells, require the host cell machinery for efficient regeneration and proliferation during infection. For their life cycle, host proteins are needed and these Host Dependency Factors (HDF) may serve as therapeutic targets. Several attempts have approached screening for HDF producing large lists of potential HDF with, however, only marginal overlap. To get consistency into the data of these experimental studies, we developed a machine learning pipeline. As a case study, we used publicly available lists of experimentally derived HDF from twelve different screening studies based on gene perturbation in Drosophila melanogaster cells or in vivo upon bacterial or protozoan infection. A total of 50,334 gene features were generated from diverse categories including their functional annotations, topology attributes in protein interaction networks, nucleotide and protein sequence features, homology properties and subcellular localization. Cross-validation revealed an excellent prediction performance. All feature categories contributed to the model. Predicted and experimentally derived HDF showed a good consistency when investigating their common cellular processes and function. Cellular processes and molecular function of these genes were highly enriched in membrane trafficking, particularly in the trans-Golgi network, cell cycle and the Rab GTPase binding family. Using our machine learning approach, we show that HDF in organisms can be predicted with high accuracy evidencing their common investigated characteristics. We elucidated cellular processes which are utilized by invading pathogens during infection. Finally, we provide a list of 208 novel HDF proposed for future experimental studies. (C) 2021 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:4581 / 4592
页数:12
相关论文
共 98 条
  • [21] Use of RNA interference in Drosophila S2 cells to identify host pathways controlling compartmentalization of an intracellular pathogen
    Cheng, LW
    Viala, JPM
    Stuurman, N
    Wiedemann, U
    Vale, RD
    Portnoy, DA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (38) : 13646 - 13651
  • [22] N-terminal ubiquitination: more protein substrates join in
    Ciechanover, A
    Ben-Saadon, R
    [J]. TRENDS IN CELL BIOLOGY, 2004, 14 (03) : 103 - 106
  • [23] The Structure and Function of the Retromer Protein Complex
    Collins, Brett M.
    [J]. TRAFFIC, 2008, 9 (11) : 1811 - 1822
  • [24] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [25] Genome-Wide RNAi Screen Identifies Genes Involved in Intestinal Pathogenic Bacterial Infection
    Cronin, Shane J. F.
    Nehme, Nadine T.
    Limmer, Stefanie
    Liegeois, Samuel
    Pospisilik, J. Andrew
    Schramek, Daniel
    Leibbrandt, Andreas
    Simoes, Ricardo de Matos
    Gruber, Susanne
    Puc, Urszula
    Ebersberger, Ingo
    Zoranovic, Tamara
    Neely, G. Gregory
    von Haeseler, Arndt
    Ferrandon, Dominique
    Penninger, Josef M.
    [J]. SCIENCE, 2009, 325 (5938) : 340 - 343
  • [26] Rab18 is required for viral assembly of hepatitis C virus through trafficking of the core protein to lipid droplets
    Dansako, Hiromichi
    Hiramoto, Hiroki
    Ikeda, Masanori
    Wakita, Takaji
    Kato, Nobuyuki
    [J]. VIROLOGY, 2014, 462 : 166 - 174
  • [27] RNAi screen in Drosophila cells reveals the involvement of the tom complex in Chlamydia infection
    Derre, Isabelle
    Pypaert, Marc
    Dautry-Varsat, Alice
    Agaisse, Herve
    [J]. PLOS PATHOGENS, 2007, 3 (10) : 1446 - 1458
  • [28] Delineation of prognostic biomarkers in prostate cancer
    Dhanasekaran, SM
    Barrette, TR
    Ghosh, D
    Shah, R
    Varambally, S
    Kurachi, K
    Pienta, KJ
    Rubin, MA
    Chinnaiyan, AM
    [J]. NATURE, 2001, 412 (6849) : 822 - 826
  • [29] A genome-wide RNAi screen to dissect centriole duplication and centrosome maturation in Drosophila
    Dobbelaere, Jeroen
    Josue, Filipe
    Suijkerbuijk, Saskia
    Baum, Buzz
    Tapon, Nicolas
    Raff, Jordan
    [J]. PLOS BIOLOGY, 2008, 6 (09): : 1975 - 1990
  • [30] RNA interference analysis of Legionella in Drosophila cells:: Exploitation of early secretory apparatus dynamics
    Dorer, Marion S.
    Kirton, Donald
    Bader, Joel S.
    Isberg, Ralph R.
    [J]. PLOS PATHOGENS, 2006, 2 (04) : 315 - 327