MLViS: A Web Tool for Machine Learning-Based Virtual Screening in Early-Phase of Drug Discovery and Development

被引:35
作者
Korkmaz, Selcuk [1 ]
Zararsiz, Gokmen [1 ]
Goksuluk, Dincer [1 ]
机构
[1] Hacettepe Univ, Fac Med, TR-06100 Ankara, Turkey
关键词
SUPPORT VECTOR MACHINES; CLASSIFICATION; PREDICTION; INHIBITORS; FRAMEWORK; MODEL;
D O I
10.1371/journal.pone.0124600
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Virtual screening is an important step in early-phase of drug discovery process. Since there are thousands of compounds, this step should be both fast and effective in order to distinguish drug-like and nondrug-like molecules. Statistical machine learning methods are widely used in drug discovery studies for classification purpose. Here, we aim to develop a new tool, which can classify molecules as drug-like and nondrug-like based on various machine learning methods, including discriminant, tree-based, kernel-based, ensemble and other algorithms. To construct this tool, first, performances of twenty-three different machine learning algorithms are compared by ten different measures, then, ten best performing algorithms have been selected based on principal component and hierarchical cluster analysis results. Besides classification, this application has also ability to create heat map and dendrogram for visual inspection of the molecules through hierarchical cluster analysis. Moreover, users can connect the PubChem database to download molecular information and to create two-dimensional structures of compounds. This application is freely available through www.biosoft.hacettepe.edu.tr/MLViS/.
引用
收藏
页数:15
相关论文
共 45 条
[1]   Ligand-Based Virtual Screening Using Bayesian Networks [J].
Abdo, Ammar ;
Chen, Beining ;
Mueller, Christoph ;
Salim, Naomie ;
Willett, Peter .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (06) :1012-1020
[2]   Ranking Chemical Structures for Drug Discovery: A New Machine Learning Approach [J].
Agarwal, Shivani ;
Dugar, Deepak ;
Sengupta, Shiladitya .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (05) :716-731
[3]   Can we learn to distinguish between "drug-like" and "nondrug-like" molecules? [J].
Ajay ;
Walters, WP ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (18) :3314-3324
[4]   Applying support vector machines to imbalanced datasets [J].
Akbani, R ;
Kwek, S ;
Japkowicz, N .
MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 :39-50
[5]  
[Anonymous], SHIN WEB APPL FRAM R
[6]  
[Anonymous], ELEMENTS STAT LEARNI
[7]  
[Anonymous], 2020, R PACKAGE VERSION 6
[8]  
[Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
[9]  
Batista G. E., 2004, ACM SIGKDD Explor. Newslett., P20, DOI [10.1145/1007730.1007735, DOI 10.1145/1007730.1007735]
[10]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350