ClassyPose: A Machine-Learning Classification Model for Ligand Pose Selection Applied to Virtual Screening in Drug Discovery

被引:2
作者
Tran-Nguyen, Viet-Khoa [1 ]
Camproux, Anne-Claude [1 ]
Taboureau, Olivier [1 ]
机构
[1] Univ Paris Cite, CNRS, UMR8251, INSERM U1133,Unite Biol Fonct & Adaptat, F-75013 Paris, France
关键词
good pose probability; machine-learning; PLEC fingerprints; pose classification; pose selection; support vector machine; virtual screening; MOLECULAR DOCKING; SCORING FUNCTIONS; BINDING-AFFINITY; PREDICTION; SHAPE; PERFORMANCE; INHIBITORS; COMPLEXES; DATABASE; ACCURACY;
D O I
10.1002/aisy.202400238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the target-bound conformation of a drug-like molecule is a crucial step in drug design, as it affects the outcome of virtual screening (VS), and paves the way for hit-to-lead and lead optimization. While most docking programs usually manage to produce at least a near-native pose for a bioactive molecule inside its binding pocket, their integrated classical scoring functions (SFs) generally fail to prioritize this pose. Many studies have been carried out to tackle this SF problem, offering multiple pose refinement and/or classification methods, albeit with limitations. This study presents a new support vector machine model for pose classification, called "ClassyPose", which predicts the probability that a receptor-bound ligand conformation could be near-native, without any additional pose optimization step. Trained on protein-ligand extended connectivity features extracted from over 21 600 crystal and docking poses of diverse ligands, this model outperformed other machine-learning algorithms and three existing SFs in terms of docking power, identifying the native ligand pose as top-ranked solution for more than 90% of entries in two test sets. It also achieved high specificity (above 0.96), and improved VS performance when used for pose selection. This efficient, user-friendly tool and all related data are available at https://github.com/vktrannguyen/Classy_Pose. ClassyPose is a new support vector machine model for correct ligand pose selection. Trained on protein-ligand features extracted from native and redocked binding modes of diverse ligands, it has strong docking power, achieves high specificity, and improves virtual screening performance when used as a pose selection tool. The code and all data are user-friendly and available free of charge.image (c) 2024 WILEY-VCH GmbH
引用
收藏
页数:13
相关论文
共 50 条
[32]   When drug discovery meets web search: Learning to Rank for ligand-based virtual screening [J].
Wei Zhang ;
Lijuan Ji ;
Yanan Chen ;
Kailin Tang ;
Haiping Wang ;
Ruixin Zhu ;
Wei Jia ;
Zhiwei Cao ;
Qi Liu .
Journal of Cheminformatics, 7
[33]   Virtual Screening and Discovery of Matrix Metalloproteinase-12 Inhibitors by Swarm Intelligence Optimization Algorithm-Based Machine Learning [J].
Zhang, Chenghua ;
Wu, Keliang ;
Huang, Long ;
Sun, Kenan ;
Zou, Yurong ;
Xiong, Zhihui ;
Li, Bingke .
CHEMISTRYSELECT, 2020, 5 (36) :11112-11119
[34]   MLViS: A Web Tool for Machine Learning-Based Virtual Screening in Early-Phase of Drug Discovery and Development [J].
Korkmaz, Selcuk ;
Zararsiz, Gokmen ;
Goksuluk, Dincer .
PLOS ONE, 2015, 10 (04)
[35]   Comparative Analysis of Machine Learning Methods in Ligand-Based Virtual Screening of Large Compound Libraries [J].
Ma, Xiao H. ;
Jia, Jia ;
Zhu, Feng ;
Xue, Ying ;
Li, Ze R. ;
Chen, Yu Z. .
COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2009, 12 (04) :344-357
[36]   Machine Learning-based Virtual Screening for STAT3 Anticancer Drug Target [J].
Wadood, Abdul ;
Ajmal, Amar ;
Junaid, Muhammad ;
Rehman, Ashfaq Ur ;
Uddin, Reaz ;
Azam, Syed Sikander ;
Khan, Alam Zeb ;
Ali, Asad .
CURRENT PHARMACEUTICAL DESIGN, 2022, 28 (36) :3023-3032
[37]   Anti-MRSA drug discovery by ligand-based virtual screening and biological evaluation [J].
Lian, Xu ;
Xia, Zhonghua ;
Li, Xueyao ;
Karpov, Pavel ;
Jin, Hongwei ;
Tetko, Igor, V ;
Xia, Jie ;
Wu, Song .
BIOORGANIC CHEMISTRY, 2021, 114
[38]   Applying Machine Learning to Ultrafast Shape Recognition in Ligand-Based Virtual Screening [J].
Bonanno, Etienne ;
Ebejer, Jean-Paul .
FRONTIERS IN PHARMACOLOGY, 2020, 10
[39]   Image-based profiling for drug discovery: due for a machine-learning upgrade? [J].
Chandrasekaran, Srinivas Niranj ;
Ceulemans, Hugo ;
Boyd, Justin D. ;
Carpenter, Anne E. .
NATURE REVIEWS DRUG DISCOVERY, 2021, 20 (02) :145-159
[40]   Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening [J].
Ain, Qurrat Ul ;
Aleksandrova, Antoniya ;
Roessler, Florian D. ;
Ballester, Pedro J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2015, 5 (06) :405-424