Comparative Study for Prediction of Low and High Plasma Protein Binding Drugs by Various Machine Learning-Based Classification Algorithms

被引:0
作者
Govil, Sumit [1 ]
Tripathi, Sandesh [2 ]
Kumar, Amit [1 ]
Shrivastava, Divya [1 ]
Kumar, Shailesh [3 ]
机构
[1] Jaipur Natl Univ, Sch Life Sci, Jaipur 302025, Rajasthan, India
[2] Birla Inst Appl Sci, Naini Tal 263136, Uttarakhand, India
[3] Natl Ctr Cell Sci, NCCS Complex,Pune Univ Campus, Pune 411007, Maharashtra, India
关键词
Drug Discovery; Machine Learning; Multilayer Perceptron; Pharmacokinetic Plasma Protein Binding; Random Forest;
D O I
10.18311/ajprhc/2021/28497
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
In the drug discovery path, most drug candidates failed at the early stages due to their pharmacokinetic behavior in the system. Early prediction of pharmacokinetic properties and screening methods can reduce the time and investment for lead discoveries. Plasma protein binding is one of these properties which has a vital role in drug discovery and development. The focus of the current study is to develop a computational model for the classification of Low Plasma Protein Binding (LPPB) and High Plasma Protein Binding (HPPB) drugs using machine learning methods for early screening of molecules through WEKA. Plasma protein binding drugs data was collated from the Drug Bank database where 617 drug candidates were found to interact with plasma proteins, out of which an equal proportion of high and low plasma protein binding drugs were extracted to build a training set of similar to 300 drugs. The machine learning algorithms were trained with a training set and evaluated by a test set. We also compared various machine learning-based classification algorithms i.e., the Naive Bayes algorithm, Instance-Based Learner (IBK), multilayer perceptron, and random forest to determine the best model based on accuracy. It was observed that the random forest algorithm-based model outperforms with an accuracy of 99.67% and 0.9933 kappa value on training set and on test set as compared to other classification methods and can predict drug plasma binding capacity in the given data set using the WEKA tool.
引用
收藏
页码:312 / 320
页数:9
相关论文
共 27 条
[1]  
[Anonymous], 2021, IEEE Trans. Broadcast.
[2]   Plasma Protein Binding: From Discovery to Development [J].
Bohnert, Tonika ;
Gan, Liang-Shang .
JOURNAL OF PHARMACEUTICAL SCIENCES, 2013, 102 (09) :2953-2994
[3]   Automatic selection of molecular descriptors using random forest: Application to drug discovery [J].
Cano, Gaspar ;
Garcia-Rodriguez, Jose ;
Garcia-Garcia, Alberto ;
Perez-Sanchez, Horacio ;
Benediktsson, Jon Atli ;
Thapa, Anil ;
Barr, Alastair .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 :151-159
[4]   Scale-based clustering using the radial basis function network [J].
Chakravarthy, SV ;
Ghosh, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (05) :1250-1261
[5]  
Chauhan AS, 2014, WORLD J PHARMA RES, V3, P432
[6]  
Grossi E, 2001, BMJ-BRIT MED J, V323, P750, DOI [10.1136/bmj.323.7315.750, DOI 10.1136/BMJ.323.7315.750]
[7]  
Han J, 2012, MOR KAUF D, P1
[8]   RBF network methods for face detection and attentional frames [J].
Howell, AJ ;
Buxton, H .
NEURAL PROCESSING LETTERS, 2002, 15 (03) :197-211
[9]  
Kalmegh S., 2015, International Journal of Innovative Science Engineering and Technology, V2, P438
[10]  
Karthikeyan T., 2013, Int J Comput Appl, V62, P25, DOI [10.5120/10157-5032, DOI 10.5120/10157-5032]