Predicting Inhibitors for Multidrug Resistance Associated Protein-2 Transporter by Machine Learning Approach

被引:9
作者
Kharangarh, Sahil [1 ]
Sandhu, Hardeep [1 ]
Tangadpalliwar, Sujit [1 ]
Garg, Prabha [1 ]
机构
[1] Natl Inst Pharmaceut Educ & Res, Dept Pharmacoinformat, Sas Nagar 160062, Punjab, India
关键词
Multidrug Resistance Associated Protein-2 (MRP2); Machine Learning; Support Vector Machine (SVM); Random Forest (RF); k- Nearest Neighbor (k-NN); model development; ACTIVE EFFLUX TRANSPORTER; DRUG-RESISTANCE; P-GLYCOPROTEIN; CANCER; MRP2; BIOINFORMATICS; EXPRESSION; DATABASE; ABCC2; GENE;
D O I
10.2174/1386207321666181024104822
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The efflux transporter multidrug resistance associated protein-2 belongs to ATP-binding cassette superfamily which plays an important role in multidrug resistance and drug-drug interactions. Efflux transporters are considered to be important targets for increasing the efficacy of drugs and importance of computational study of efflux transporters for predicting substrates, non-substrates, inhibitors and non-inhibitors is well documented. Previous work on predictive models for inhibitors of multidrug resistance associated Protein-2 efflux transporter showed that machine learning methods produced good results. Objective: The aim of the present work was to develop a machine learning predictive model to classify inhibitors and non-inhibitors of multidrug resistance associated protein-2 transporter using a well refined dataset. Method: In this study, the various algorithms of machine learning were used to develop the predictive models i.e. support vector machine, random forest and k-nearest neighbor. The methods like variance threshold, SelectKBest, random forest, and recursive feature elimination were used to select the features generated by PyDPI. A total of 239 molecules consisting of 124 inhibitors and 115 non-inhibitors were used for model development Results: The best multidrug resistance associated protein-2 inhibitor model showed prediction accuracies of 0.76, 0.72 and 0.79 for training, 5-fold cross-validation and external sets, respectively. Conclusion: It was observed that support vector machine model built on features selected using recursive feature elimination method shows the best performance. The developed model can be used in the early stages of drug discovery for identifying the inhibitors of multidrug resistance associated protein-2 efflux transporter.
引用
收藏
页码:557 / 566
页数:10
相关论文
共 26 条
[1]  
Arts HJG, 1999, CLIN CANCER RES, V5, P2798
[2]   Classification of Breast Cancer Resistant Protein (BCRP) Inhibitors and Non-Inhibitors Using Machine Learning Approaches [J].
Belekar, Vilas ;
Lingineni, Karthik ;
Garg, Prabha .
COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2015, 18 (05) :476-485
[3]   A family of drug transporters: The multidrug resistance-associated proteins [J].
Borst, P ;
Evers, R ;
Kool, M ;
Wijnholds, J .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2000, 92 (16) :1295-1302
[4]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   PyDPI: Freely Available Python']Python Package for Chemoinformatics, Bioinformatics, and Chemogenomics Studies [J].
Cao, Dong-Sheng ;
Liang, Yi-Zeng ;
Yan, Jun ;
Tan, Gui-Shan ;
Xu, Qing-Song ;
Liu, Shao .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (11) :3086-3096
[7]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[8]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[9]  
Demel MA, 2008, EXPERT OPIN DRUG MET, V4, P1167, DOI [10.1517/17425255.4.9.1167, 10.1517/17425255.4.9.1167 ]
[10]  
Dudani S. A., 1976, IEEE Transactions on Systems, Man and Cybernetics, VSMC-6, P325, DOI 10.1109/TSMC.1976.5408784