ClassyPose: A Machine-Learning Classification Model for Ligand Pose Selection Applied to Virtual Screening in Drug Discovery

被引:2
作者
Tran-Nguyen, Viet-Khoa [1 ]
Camproux, Anne-Claude [1 ]
Taboureau, Olivier [1 ]
机构
[1] Univ Paris Cite, CNRS, UMR8251, INSERM U1133,Unite Biol Fonct & Adaptat, F-75013 Paris, France
关键词
good pose probability; machine-learning; PLEC fingerprints; pose classification; pose selection; support vector machine; virtual screening; MOLECULAR DOCKING; SCORING FUNCTIONS; BINDING-AFFINITY; PREDICTION; SHAPE; PERFORMANCE; INHIBITORS; COMPLEXES; DATABASE; ACCURACY;
D O I
10.1002/aisy.202400238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the target-bound conformation of a drug-like molecule is a crucial step in drug design, as it affects the outcome of virtual screening (VS), and paves the way for hit-to-lead and lead optimization. While most docking programs usually manage to produce at least a near-native pose for a bioactive molecule inside its binding pocket, their integrated classical scoring functions (SFs) generally fail to prioritize this pose. Many studies have been carried out to tackle this SF problem, offering multiple pose refinement and/or classification methods, albeit with limitations. This study presents a new support vector machine model for pose classification, called "ClassyPose", which predicts the probability that a receptor-bound ligand conformation could be near-native, without any additional pose optimization step. Trained on protein-ligand extended connectivity features extracted from over 21 600 crystal and docking poses of diverse ligands, this model outperformed other machine-learning algorithms and three existing SFs in terms of docking power, identifying the native ligand pose as top-ranked solution for more than 90% of entries in two test sets. It also achieved high specificity (above 0.96), and improved VS performance when used for pose selection. This efficient, user-friendly tool and all related data are available at https://github.com/vktrannguyen/Classy_Pose. ClassyPose is a new support vector machine model for correct ligand pose selection. Trained on protein-ligand features extracted from native and redocked binding modes of diverse ligands, it has strong docking power, achieves high specificity, and improves virtual screening performance when used as a pose selection tool. The code and all data are user-friendly and available free of charge.image (c) 2024 WILEY-VCH GmbH
引用
收藏
页数:13
相关论文
共 50 条
[21]   Implementation of machine-learning classification in remote sensing: an applied review [J].
Maxwell, Aaron E. ;
Warner, Timothy A. ;
Fang, Fang .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2018, 39 (09) :2784-2817
[22]   Performance of machine-learning scoring functions in structure-based virtual screening [J].
Wojcikowski, Maciej ;
Ballester, Pedro J. ;
Siedlecki, Pawel .
SCIENTIFIC REPORTS, 2017, 7
[23]   A practical guide to machine-learning scoring for structure-based virtual screening [J].
Tran-Nguyen, Viet-Khoa ;
Junaid, Muhammad ;
Simeon, Saw ;
Ballester, Pedro J. .
NATURE PROTOCOLS, 2023, 18 (11) :3460-3511
[24]   Traditional and machine learning approaches in structure-based drug virtual screening [J].
Zhang, Hong ;
Gao, Yi Qin .
CHINESE JOURNAL OF CHEMICAL PHYSICS, 2024, 37 (02) :177-191
[25]   An integrated machine-learning model for soil category classification based on CPT [J].
Bai, Ruihan ;
Shen, Feng ;
Zhang, Zhiping .
MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2024, 7 (03) :2121-2146
[26]   Discovery of novel ULK1 inhibitors through machine learning-guided virtual screening and biological evaluation [J].
Kong, Miao-Miao ;
Wei, Tao ;
Liu, Bo ;
Xi, Zi-Xuan ;
Ding, Jun-Tao ;
Liu, Xin ;
Li, Ke ;
Qin, Tian-Li ;
Qian, Zhen-Yong ;
Wu, Wen-Can ;
Wu, Jian-Zhang ;
Li, Wu-Lan .
FUTURE MEDICINAL CHEMISTRY, 2024, 16 (18) :1821-1837
[27]   Sequential virtual screening collaborated with machine-learning strategies for the discovery of precise medicine against non-small cell lung cancer [J].
Thirunavukkarasu, Muthu Kumar ;
Veerappapillai, Shanthi ;
Karuppasamy, Ramanathan .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024, 42 (02) :615-628
[28]   Design and discovery of POLQ helicase domain inhibitors by virtual screening and machine learning [J].
Feng, Wei ;
Liu, Lei ;
Li, Lingjun ;
Du, Peng ;
Yuan, Zhichen ;
Yuan, Jing ;
Huang, Changjiang ;
Qin, Zijian .
MEDICINAL CHEMISTRY RESEARCH, 2025, 34 (06) :1377-1391
[29]   Evaluation of different machine learning methods for ligand-based virtual screening [J].
R Kurczab ;
S Smusz ;
AJ Bojarski .
Journal of Cheminformatics, 3 (Suppl 1)
[30]   Multi-target-based polypharmacology prediction (mTPP): An approach using virtual screening and machine learning for multi-target drug discovery [J].
Liu, Kaiyang ;
Chen, Xi ;
Ren, Yue ;
Liu, Chaoqun ;
Lv, Tianyi ;
Liu, Ya'nan ;
Zhang, Yanling .
CHEMICO-BIOLOGICAL INTERACTIONS, 2022, 368