Machine Learning Models to Predict Protein-Protein Interaction Inhibitors

被引:8
作者
Diaz-Eufracio, Barbara, I [1 ]
Medina-Franco, Jose L. [1 ]
机构
[1] Univ Nacl Autonoma Mexico, Sch Chem, Dept Pharm, DIFACQUIM Res Grp, Ave Univ 3000, Mexico City 04510, DF, Mexico
关键词
chemoinformatics; computer-aided drug design; drug discovery; machine learning; protein-protein interaction; INTERACTION MODULATORS; DRUG DISCOVERY;
D O I
10.3390/molecules27227986
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interaction (PPI) inhibitors have an increasing role in drug discovery. It is hypothesized that machine learning (ML) algorithms can classify or identify PPI inhibitors. This work describes the performance of different algorithms and molecular fingerprints used in chemoinformatics to develop a classification model to identify PPI inhibitors making the codes freely available to the community, particularly the medicinal chemistry research groups working with PPI inhibitors. We found that classification algorithms have different performances according to various features employed in the training process. Random forest (RF) models with the extended connectivity fingerprint radius 2 (ECFP4) had the best classification abilities compared to those models trained with ECFP6 o MACCS keys (166-bits). In general, logistic regression (LR) models had lower performance metrics than RF models, but ECFP4 was the representation most appropriate for LR. ECFP4 also generated models with high-performance metrics with support vector machines (SVM). We also constructed ensemble models based on the top-performing models. As part of this work and to help non-computational experts, we developed a pipeline code freely available.
引用
收藏
页数:10
相关论文
共 36 条
[1]   HIPPIE v2.0: enhancing meaningfulness and reliability of protein-protein interaction networks [J].
Alanis-Lobato, Gregorio ;
Andrade-Navarro, Miguel A. ;
Schaefer, Martin H. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D408-D414
[2]  
[Anonymous], 2018, Hands-on Machine Learning with Scikit-Learn and Tensorflow
[3]   KNIME:: The Konstanz Information Miner [J].
Berthold, Michael R. ;
Cebron, Nicolas ;
Dill, Fabian ;
Gabriel, Thomas R. ;
Koetter, Tobias ;
Meinl, Thorsten ;
Ohl, Peter ;
Sieb, Christoph ;
Thiel, Kilian ;
Wiswedel, Bernd .
DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, :319-326
[4]   Fr-PPIChem: An Academic Compound Library Dedicated to Protein- Protein Interactions [J].
Bosc, Nicolas ;
Muller, Christophe ;
Hoffer, Laurent ;
Lagorce, David ;
Bourg, Stephane ;
Derviaux, Carine ;
Gourdel, Marie-Edith ;
Rain, Jean-Christophe ;
Miller, Thomas W. ;
Villoutreix, Bruno O. ;
Miteva, Maria ;
Bonnet, Pascal ;
Morelli, Xavier ;
Sperandio, Olivier ;
Roche, Philippe .
ACS CHEMICAL BIOLOGY, 2020, 15 (06) :1566-1574
[5]   Design of Drug-Like Protein-Protein Interaction Stabilizers Guided By Chelation-Controlled Bioactive Conformation Stabilization [J].
Bosica, Francesco ;
Andrei, Sebastian A. ;
Neves, Joao Filipe ;
Brandt, Peter ;
Gunnarsson, Anders ;
Landrieu, Isabelle ;
Ottmann, Christian ;
O'Mahony, Gavin .
CHEMISTRY-A EUROPEAN JOURNAL, 2020, 26 (31) :7131-7139
[6]   Advancing Drug Discovery via Artificial Intelligence [J].
Chan, H. C. Stephen ;
Shan, Hanbin ;
Dahoun, Thamani ;
Vogel, Horst ;
Yuan, Shuguang .
TRENDS IN PHARMACOLOGICAL SCIENCES, 2019, 40 (08) :592-604
[7]   Exploring the chemical space of protein-protein interaction inhibitors through machine learning [J].
Choi, Jiwon ;
Yun, Jun Seop ;
Song, Hyeeun ;
Kim, Nam Hee ;
Kim, Hyun Sil ;
Yook, Jong In .
SCIENTIFIC REPORTS, 2021, 11 (01)
[8]   Applications of In Silico Methods for Design and Development of Drugs Targeting Protein-Protein Interactions [J].
Cicaloni, Vittoria ;
Trezza, Alfonso ;
Pettini, Francesco ;
Spiga, Ottavia .
CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2019, 19 (07) :534-554
[9]  
clinicaltrials.gov, A study of idasanutlin with cytarabine versus cytarabine plus placebo in participants with relapsed or refractory acute myeloid leukemia
[10]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297