Machine Learning for Prediction of Drug Targets in Microbe Associated Cardiovascular Diseases by Incorporating Host-pathogen Interaction Network Parameters

被引:11
作者
Singh, Nirupma [1 ]
Bhatnagar, Sonika [1 ,2 ]
机构
[1] Netaji Subhas Inst Technol, Dept Biotechnol, New Delhi 110078, India
[2] Netaji Subhas Inst Technol, Dept Biol Sci & Engn, Computat & Struct Biol Lab, New Delhi 110078, India
关键词
Machine learning; Drug targets; Eigenvector Centrality; PROTEIN-PROTEIN INTERACTIONS; IDENTIFICATION; HEART; MYOCARDITIS; CENTRALITY; INHIBITORS; INFECTION; FEATURES; MIMICRY; BIOLOGY;
D O I
10.1002/minf.202100115
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Host-pathogen interactions play a crucial role in invasion, infection, and induction of immune response in humans. In this work, four machine learning algorithms, namely Logistic regression, K-nearest neighbor, Support Vector Machine, and Random Forest were implemented for the classification of drug targets. The algorithms were trained using 3400 hosts and 3800 pathogen drug and non-drug target proteins as learning instances. For each protein, 68 pathogen and 73 host features were computed that included sequence, structure, biological and host-pathogen network centrality characteristics. The Random Forest classifier model achieved the best accuracy after 10-fold cross-validation. 99 % accuracy was achieved with a ROC-AUC score of 0.99 +/- 0.01 for both pathogen and host training sets. The Eigenvector Centrality of host-pathogen interactions and host-host interactions was the top feature in performing classification of pathogen and host targets respectively. Other features important for classification were the presence of catalytic and binding sites, low instability/aliphatic index, and cellular location. The Random Forest classifier was then used for prediction of drug targets involved in Microbe Associated Cardiovascular Diseases. 331 host and 743 pathogen proteins were predicted as drug targets by the random forest model and can be validated experimentally for therapeutic intervention in Microbe Associated Cardiovascular Diseases.
引用
收藏
页数:17
相关论文
共 104 条
[1]   Network-Guided Discovery of Influenza Virus Replication Host Factors [J].
Ackerman, Emily E. ;
Kawakami, Eiryo ;
Katoh, Manami ;
Watanabe, Tokiko ;
Watanabe, Shinji ;
Tomita, Yuriko ;
Lopes, Tiago J. ;
Matsuoka, Yukiko ;
Kitano, Hiroaki ;
Shoemaker, Jason E. ;
Kawaoka, Yoshihiro .
MBIO, 2018, 9 (06) :1-14
[2]   Recombinant Staphylococcal Enterotoxin Type A Stimulate Antitumoral Cytokines [J].
Agheli, Reza ;
Emkanian, Bijan ;
Halabian, Raheleh ;
Mehrabadi, Jalil Fallah ;
Fooladi, Abbas Ali Imani .
TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2017, 16 (01) :125-132
[3]   Network biology discovers pathogen contact points in host protein-protein interactomes [J].
Ahmed, Hadia ;
Howton, T. C. ;
Sun, Yali ;
Weinberger, Natascha ;
Belkhadir, Youssef ;
Mukhtar, M. Shahid .
NATURE COMMUNICATIONS, 2018, 9
[4]   Analyzing a co-occurrence gene-interaction network to identify disease-gene association [J].
Al-Aamri, Amira ;
Taha, Kamal ;
Al-Hammadi, Yousof ;
Maalouf, Maher ;
Homouz, Dirar .
BMC BIOINFORMATICS, 2019, 20 (1)
[5]   Myocarditis: an expected health hazard associated with water resources contaminated with Coxsackie viruses type B [J].
Ali, MA ;
Abdel-Dayem, TMK .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL HEALTH RESEARCH, 2003, 13 (03) :261-270
[6]   Evaluation of k-nearest neighbour classifier performance for heterogeneous data sets [J].
Ali, Najat ;
Neagu, Daniel ;
Trundle, Paul .
SN APPLIED SCIENCES, 2019, 1 (12)
[7]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[8]   Small-Molecule Inhibitors of Protein-Protein Interactions: Progressing toward the Reality [J].
Arkin, Michelle R. ;
Tang, Yinyan ;
Wells, James A. .
CHEMISTRY & BIOLOGY, 2014, 21 (09) :1102-1114
[9]   Network Systems Biology for Drug Discovery [J].
Arrell, D. K. ;
Terzic, A. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2010, 88 (01) :120-125
[10]   A systematic survey of centrality measures for protein-protein interaction networks [J].
Ashtiani, Minoo ;
Salehzadeh-Yazdi, Ali ;
Razaghi-Moghadam, Zahra ;
Hennig, Holger ;
Wolkenhauer, Olaf ;
Mirzaie, Mehdi ;
Jafari, Mohieddin .
BMC SYSTEMS BIOLOGY, 2018, 12