ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning

被引:0
作者
Dejun Jiang
Tailong Lei
Zhe Wang
Chao Shen
Dongsheng Cao
Tingjun Hou
机构
[1] Zhejiang University,Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences
[2] Central South University,Xiangya School of Pharmaceutical Sciences
来源
Journal of Cheminformatics | / 12卷
关键词
Breast cancer resistance protein; Multi-drug resistance; Machine learning; Extreme gradient boosting; Ensemble learning; ADMET; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Breast cancer resistance protein (BCRP/ABCG2), an ATP-binding cassette (ABC) efflux transporter, plays a critical role in multi-drug resistance (MDR) to anti-cancer drugs and drug–drug interactions. The prediction of BCRP inhibition can facilitate evaluating potential drug resistance and drug–drug interactions in early stage of drug discovery. Here we reported a structurally diverse dataset consisting of 1098 BCRP inhibitors and 1701 non-inhibitors. Analysis of various physicochemical properties illustrates that BCRP inhibitors are more hydrophobic and aromatic than non-inhibitors. We then developed a series of quantitative structure–activity relationship (QSAR) models to discriminate between BCRP inhibitors and non-inhibitors. The optimal feature subset was determined by a wrapper feature selection method named rfSA (simulated annealing algorithm coupled with random forest), and the classification models were established by using seven machine learning approaches based on the optimal feature subset, including a deep learning method, two ensemble learning methods, and four classical machine learning methods. The statistical results demonstrated that three methods, including support vector machine (SVM), deep neural networks (DNN) and extreme gradient boosting (XGBoost), outperformed the others, and the SVM classifier yielded the best predictions (MCC = 0.812 and AUC = 0.958 for the test set). Then, a perturbation-based model-agnostic method was used to interpret our models and analyze the representative features for different models. The application domain analysis demonstrated the prediction reliability of our models. Moreover, the important structural fragments related to BCRP inhibition were identified by the information gain (IG) method along with the frequency analysis. In conclusion, we believe that the classification models developed in this study can be regarded as simple and accurate tools to distinguish BCRP inhibitors from non-inhibitors in drug design and discovery pipelines.[graphic not available: see fulltext]
引用
收藏
相关论文
共 654 条
  • [1] Ni Z(2010)Structure and function of the human breast cancer resistance protein (BCRP/ABCG2) Curr Drug Metab 11 603-617
  • [2] Bikadi Z(2005)Role of the breast cancer resistance protein (ABCG2) in drug transport AAPS J 7 E118-E133
  • [3] Rosenberg MF(2003)Multidrug resistance mediated by the breast cancer resistance protein BCRP (ABCG2) Oncogene 22 7340-7358
  • [4] Mao Q(2015)Role of the breast cancer resistance protein (BCRP/ABCG2) in drug transport-an update AAPS J 17 65-82
  • [5] Mao QC(2015)Role of breast cancer resistance protein (BCRP) as active efflux transporter on blood–brain barrier (BBB) permeability Mol Divers 19 163-172
  • [6] Unadkat JD(2008)The role of ABC transporters in drug absorption, distribution, metabolism, excretion and toxicity (ADME-Tox) Drug Discov Today 13 379-393
  • [7] Doyle LA(2006)Role of ABCG2/BCRP in biology and medicine Annu Rev Pharmacol Toxicol 46 381-410
  • [8] Ross DD(2009)Breast cancer resistance protein (BCRP/ABCG2): new inhibitors and QSAR studies by a 3D linear solvation energy approach Eur J Pharm Sci 38 39-46
  • [9] Mao Q(2002)Increased oral bioavailability of topotecan in combination with the breast cancer resistance protein and P-glycoprotein inhibitor GF120918 J Clin Oncol 20 2943-2950
  • [10] Unadkat JD(2004)Gefitinib enhances the antitumor activity and oral bioavailability of irinotecan in mice Cancer Res 64 7491-7499