Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence

被引:18
作者
Hou, Tianling [1 ,2 ,3 ]
Bian, Yuemin [1 ,2 ,3 ]
McGuire, Terence [1 ,2 ,3 ]
Xie, Xiang-Qun [1 ,2 ,4 ,5 ]
机构
[1] Univ Pittsburgh, Sch Pharm, Dept Pharmaceut Sci, Computat Chem Genom Screen CCGS Ctr, Pittsburgh, PA 15261 USA
[2] Univ Pittsburgh, Sch Pharm, Pharmacometr Syst Pharmacol Program, 718 Salk Hall, Pittsburgh, PA 15261 USA
[3] Univ Pittsburgh, NIH Natl Ctr Excellence Computat Drug Abuse Res C, Pittsburgh, PA 15261 USA
[4] Univ Pittsburgh, Dept Computat Biol, Drug Discovery Inst, Pittsburgh, PA 15261 USA
[5] Univ Pittsburgh, Dept Struct Biol, Drug Discovery Inst, Pittsburgh, PA 15261 USA
关键词
GPCRs; allosteric regulation; machine learning; finger-prints; drug design; DRUG DISCOVERY; ANTAGONISTS; NETWORK;
D O I
10.3390/biom11060870
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
G-protein-coupled receptors (GPCRs) are the largest and most diverse group of cell surface receptors that respond to various extracellular signals. The allosteric modulation of GPCRs has emerged in recent years as a promising approach for developing target-selective therapies. Moreover, the discovery of new GPCR allosteric modulators can greatly benefit the further understanding of GPCR cell signaling mechanisms. It is critical but also challenging to make an accurate distinction of modulators for different GPCR groups in an efficient and effective manner. In this study, we focus on an 11-class classification task with 10 GPCR subtype classes and a random compounds class. We used a dataset containing 34,434 compounds with allosteric modulators collected from classical GPCR families A, B, and C, as well as random drug-like compounds. Six types of machine learning models, including support vector machine, naive Bayes, decision tree, random forest, logistic regression, and multilayer perceptron, were trained using different combinations of features including molecular descriptors, Atom-pair fingerprints, MACCS fingerprints, and ECFP6 fingerprints. The performances of trained machine learning models with different feature combinations were closely investigated and discussed. To the best of our knowledge, this is the first work on the multi-class classification of GPCR allosteric modulators. We believe that the classification models developed in this study can be used as simple and accurate tools for the discovery and development of GPCR allosteric modulators.
引用
收藏
页数:16
相关论文
共 51 条
[1]   AlphaFold at CASP13 [J].
AlQuraishi, Mohammed .
BIOINFORMATICS, 2019, 35 (22) :4862-4865
[2]   Machine Learning to Identify Flexibility Signatures of Class A GPCR Inhibition [J].
Bemister-Buffington, Joseph ;
Wolf, Alex J. ;
Raschka, Sebastian ;
Kuhn, Leslie A. .
BIOMOLECULES, 2020, 10 (03)
[3]   Generative chemistry: drug discovery with deep learning generative models [J].
Bian, Yuemin ;
Xie, Xiang-Qun .
JOURNAL OF MOLECULAR MODELING, 2021, 27 (03)
[4]   Covalent allosteric modulation: An emerging strategy for GPCRs drug discovery [J].
Bian, Yuemin ;
Jun, Jaden Jungho ;
Cuyler, Jacob ;
Xie, Xiang-Qun .
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2020, 206
[5]   Deep Convolutional Generative Adversarial Network (dcGAN) Models for Screening and Design of Small Molecules Targeting Cannabinoid Receptors [J].
Bian, Yuemin ;
Wang, Junmei ;
Jun, Jaden Jungho ;
Xie, Xiang-Qun .
MOLECULAR PHARMACEUTICS, 2019, 16 (11) :4451-4460
[6]   Prediction of Orthosteric and Allosteric Regulations on Cannabinoid Receptors Using Supervised Machine Learning Classifiers [J].
Bian, Yuemin ;
Jing, Yankang ;
Wang, Lirong ;
Ma, Shifan ;
Jun, Jaden Jungho ;
Xie, Xiang-Qun .
MOLECULAR PHARMACEUTICS, 2019, 16 (06) :2605-2615
[7]   G-protein-coupled receptors: From classical modes of modulation to allosteric mechanisms [J].
Bridges, Thomas M. ;
Lindsley, Craig W. .
ACS CHEMICAL BIOLOGY, 2008, 3 (09) :530-541
[8]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[9]  
Carletta J, 1996, COMPUT LINGUIST, V22, P249
[10]   Impact of GPCR Structures on Drug Discovery [J].
Congreve, Miles ;
de Graaf, Chris ;
Swain, Nigel A. ;
Tate, Christopher G. .
CELL, 2020, 181 (01) :81-91