Novel Big Data-Driven Machine Learning Models for Drug Discovery Application

被引:10
作者
Sripriya Akondi, Vishnu [1 ]
Menon, Vineetha [1 ]
Baudry, Jerome [2 ]
Whittle, Jana [2 ]
机构
[1] Univ Alabama Huntsville, Dept Comp Sci, Huntsville, AL 35899 USA
[2] Univ Alabama Huntsville, Dept Biol Sci, Huntsville, AL 35899 USA
关键词
drug discovery; class imbalance; machine learning; protein conformation selecton; drug candidates; ADORA2A; OPRK1;
D O I
10.3390/molecules27030594
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most contemporary drug discovery projects start with a 'hit discovery' phase where small chemicals are identified that have the capacity to interact, in a chemical sense, with a protein target involved in a given disease. To assist and accelerate this initial drug discovery process, 'virtual docking calculations' are routinely performed, where computational models of proteins and computational models of small chemicals are evaluated for their capacities to bind together. In cutting-edge, contemporary implementations of this process, several conformations of protein targets are independently assayed in parallel 'ensemble docking' calculations. Some of these protein conformations, a minority of them, will be capable of binding many chemicals, while other protein conformations, the majority of them, will not be able to do so. This fact that only some of the conformations accessible to a protein will be 'selected' by chemicals is known as 'conformational selection' process in biology. This work describes a machine learning approach to characterize and identify the properties of protein conformations that will be selected (i.e., bind to) chemicals, and classified as potential binding drug candidates, unlike the remaining non-binding drug candidate protein conformations. This work also addresses the class imbalance problem through advanced machine learning techniques that maximize the prediction rate of potential protein molecular conformations for the test case proteins ADORA2A (Adenosine A2a Receptor) and OPRK1 (Opioid Receptor Kappa 1), and subsequently reduces the failure rates and hastens the drug discovery process.
引用
收藏
页数:22
相关论文
共 21 条
[1]   Enhanced Sampling in Molecular Dynamics Using Metadynamics, Replica-Exchange, and Temperature-Acceleration [J].
Abrams, Cameron ;
Bussi, Giovanni .
ENTROPY, 2014, 16 (01) :163-199
[2]  
Akondi VS, 2019, IEEE INT C BIOINFORM, P2771, DOI 10.1109/BIBM47256.2019.8983213
[3]   Ensemble Docking in Drug Discovery [J].
Amaro, Rommie E. ;
Baudry, Jerome ;
Chodera, John ;
Demir, Ozlem ;
McCammon, J. Andrew ;
Miao, Yinglong ;
Smith, Jeremy C. .
BIOPHYSICAL JOURNAL, 2018, 114 (10) :2271-2278
[4]  
[Anonymous], 2013, J Netw Innov Comput
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]  
Chemical Computing Group U, 2019, MOL OP ENV MOE
[7]   Structure-based design of broadly protective group a streptococcal M protein-based vaccines [J].
Dale, James B. ;
Smeesters, Pierre R. ;
Courtney, Harry S. ;
Penfound, Thomas A. ;
Hohn, Claudia M. ;
Smith, Jeremy C. ;
Baudry, Jerome Y. .
VACCINE, 2017, 35 (01) :19-26
[8]   Predicting protein-protein interactions via multivariate mutual information of protein sequences [J].
Ding, Yijie ;
Tang, Jijun ;
Guo, Fei .
BMC BIOINFORMATICS, 2016, 17
[9]   Ensemble-based docking: From hit discovery to metabolism and toxicity predictions [J].
Evangelista, Wilfredo ;
Weir, Rebecca L. ;
Ellingson, Sally R. ;
Harris, Jason B. ;
Kapoor, Karan ;
Smith, Jeremy C. ;
Baudry, Jerome .
BIOORGANIC & MEDICINAL CHEMISTRY, 2016, 24 (20) :4928-4935
[10]   Drug-target interaction prediction via class imbalance-aware ensemble learning [J].
Ezzat, Ali ;
Wu, Min ;
Li, Xiao-Li ;
Kwoh, Chee-Keong .
BMC BIOINFORMATICS, 2016, 17