Multitask learning for host-pathogen protein interactions

被引:64
|
作者
Kshirsagar, Meghana [1 ]
Carbonell, Jaime [1 ]
Klein-Seetharaman, Judith [1 ,2 ,3 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Language Technol Inst, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[2] Forschungszentrum Julich, ICS 5, D-52425 Julich, Germany
[3] Univ Warwick, Syst Biol Ctr, Coventry CV4 7AL, W Midlands, England
关键词
GENE ONTOLOGY; PREDICTION;
D O I
10.1093/bioinformatics/btt245
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: An important aspect of infectious disease research involves understanding the differences and commonalities in the infection mechanisms underlying various diseases. Systems biology-based approaches study infectious diseases by analyzing the interactions between the host species and the pathogen organisms. This work aims to combine the knowledge from experimental studies of host-pathogen interactions in several diseases to build stronger predictive models. Our approach is based on a formalism from machine learning called 'multitask learning', which considers the problem of building models across tasks that are related to each other. A 'task' in our scenario is the set of host-pathogen protein interactions involved in one disease. To integrate interactions from several tasks (i.e. diseases), our method exploits the similarity in the infection process across the diseases. In particular, we use the biological hypothesis that similar pathogens target the same critical biological processes in the host, in defining a common structure across the tasks. Results: Our current work on host-pathogen protein interaction prediction focuses on human as the host, and four bacterial species as pathogens. The multitask learning technique we develop uses a task-based regularization approach. We find that the resulting optimization problem is a difference of convex (DC) functions. To optimize, we implement a Convex-Concave procedure-based algorithm. We compare our integrative approach to baseline methods that build models on a single host-pathogen protein interaction dataset. Our results show that our approach outperforms the baselines on the training data. We further analyze the protein interaction predictions generated by the models, and find some interesting insights.
引用
收藏
页码:217 / 226
页数:10
相关论文
共 50 条
  • [1] Editorial: Protein homeostasis in host-pathogen interactions
    Yeom, Jinki
    Shin, Donghyuk
    Qiao, Yuan
    FRONTIERS IN MICROBIOLOGY, 2023, 13
  • [2] Host-pathogen interactions
    Kaisho, Tsuneyasu
    Wagner, Hermann
    CURRENT OPINION IN IMMUNOLOGY, 2008, 20 (04) : 369 - 370
  • [3] Host-pathogen interactions
    Kaiser, P
    VETERINARY IMMUNOLOGY AND IMMUNOPATHOLOGY, 2004, 100 (3-4) : 115 - 115
  • [4] Host-pathogen interactions
    Kaufmann, Stefan H. E.
    Walker, Bruce D.
    CURRENT OPINION IN IMMUNOLOGY, 2006, 18 (04) : 371 - 373
  • [5] Host-pathogen interactions
    Garcia-Sastre, Adolfo
    Sansonetti, Philippe J.
    CURRENT OPINION IN IMMUNOLOGY, 2010, 22 (04) : 425 - 427
  • [6] Host-Pathogen Interactions
    Shader, Richard I.
    CLINICAL THERAPEUTICS, 2019, 41 (10) : 1899 - 1901
  • [7] A predictive approach for host-pathogen interactions using deep learning and protein sequences
    Shakibania T.
    Arabfard M.
    Najafi A.
    VirusDisease, 2024, 35 (3) : 434 - 445
  • [8] Considering the host in host-pathogen interactions
    不详
    NATURE MICROBIOLOGY, 2024, : 1149 - 1149
  • [9] Comparative mapping of host-pathogen protein-protein interactions
    Shah, Priya S.
    Wojcechowskyj, Jason A.
    Eckhardt, Manon
    Krogan, Nevan J.
    CURRENT OPINION IN MICROBIOLOGY, 2015, 27 : 62 - 68
  • [10] Computational prediction of host-pathogen protein-protein interactions
    Dyer, Matthew D.
    Murali, T. M.
    Sobral, Bruno W.
    BIOINFORMATICS, 2007, 23 (13) : I159 - I166