ClassyPose: A Machine-Learning Classification Model for Ligand Pose Selection Applied to Virtual Screening in Drug Discovery

被引:1
作者
Tran-Nguyen, Viet-Khoa [1 ]
Camproux, Anne-Claude [1 ]
Taboureau, Olivier [1 ]
机构
[1] Univ Paris Cite, CNRS, UMR8251, INSERM U1133,Unite Biol Fonct & Adaptat, F-75013 Paris, France
关键词
good pose probability; machine-learning; PLEC fingerprints; pose classification; pose selection; support vector machine; virtual screening; MOLECULAR DOCKING; SCORING FUNCTIONS; BINDING-AFFINITY; PREDICTION; SHAPE; PERFORMANCE; INHIBITORS; COMPLEXES; DATABASE; ACCURACY;
D O I
10.1002/aisy.202400238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the target-bound conformation of a drug-like molecule is a crucial step in drug design, as it affects the outcome of virtual screening (VS), and paves the way for hit-to-lead and lead optimization. While most docking programs usually manage to produce at least a near-native pose for a bioactive molecule inside its binding pocket, their integrated classical scoring functions (SFs) generally fail to prioritize this pose. Many studies have been carried out to tackle this SF problem, offering multiple pose refinement and/or classification methods, albeit with limitations. This study presents a new support vector machine model for pose classification, called "ClassyPose", which predicts the probability that a receptor-bound ligand conformation could be near-native, without any additional pose optimization step. Trained on protein-ligand extended connectivity features extracted from over 21 600 crystal and docking poses of diverse ligands, this model outperformed other machine-learning algorithms and three existing SFs in terms of docking power, identifying the native ligand pose as top-ranked solution for more than 90% of entries in two test sets. It also achieved high specificity (above 0.96), and improved VS performance when used for pose selection. This efficient, user-friendly tool and all related data are available at https://github.com/vktrannguyen/Classy_Pose. ClassyPose is a new support vector machine model for correct ligand pose selection. Trained on protein-ligand features extracted from native and redocked binding modes of diverse ligands, it has strong docking power, achieves high specificity, and improves virtual screening performance when used as a pose selection tool. The code and all data are user-friendly and available free of charge.image (c) 2024 WILEY-VCH GmbH
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Performance of machine-learning scoring functions in structure-based virtual screening
    Wojcikowski, Maciej
    Ballester, Pedro J.
    Siedlecki, Pawel
    SCIENTIFIC REPORTS, 2017, 7
  • [22] A practical guide to machine-learning scoring for structure-based virtual screening
    Tran-Nguyen, Viet-Khoa
    Junaid, Muhammad
    Simeon, Saw
    Ballester, Pedro J.
    NATURE PROTOCOLS, 2023, 18 (11) : 3460 - 3511
  • [23] Traditional and machine learning approaches in structure-based drug virtual screening
    Zhang, Hong
    Gao, Yi Qin
    CHINESE JOURNAL OF CHEMICAL PHYSICS, 2024, 37 (02) : 177 - 191
  • [24] An integrated machine-learning model for soil category classification based on CPT
    Bai, Ruihan
    Shen, Feng
    Zhang, Zhiping
    MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2024, 7 (03) : 2121 - 2146
  • [25] Discovery of novel ULK1 inhibitors through machine learning-guided virtual screening and biological evaluation
    Kong, Miao-Miao
    Wei, Tao
    Liu, Bo
    Xi, Zi-Xuan
    Ding, Jun-Tao
    Liu, Xin
    Li, Ke
    Qin, Tian-Li
    Qian, Zhen-Yong
    Wu, Wen-Can
    Wu, Jian-Zhang
    Li, Wu-Lan
    FUTURE MEDICINAL CHEMISTRY, 2024, 16 (18) : 1821 - 1837
  • [26] Sequential virtual screening collaborated with machine-learning strategies for the discovery of precise medicine against non-small cell lung cancer
    Thirunavukkarasu, Muthu Kumar
    Veerappapillai, Shanthi
    Karuppasamy, Ramanathan
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024, 42 (02) : 615 - 628
  • [27] Design and discovery of POLQ helicase domain inhibitors by virtual screening and machine learning
    Wei Feng
    Lei Liu
    Lingjun Li
    Peng Du
    Zhichen Yuan
    Jing Yuan
    Changjiang Huang
    Zijian Qin
    Medicinal Chemistry Research, 2025, 34 (6) : 1377 - 1391
  • [28] Evaluation of different machine learning methods for ligand-based virtual screening
    R Kurczab
    S Smusz
    AJ Bojarski
    Journal of Cheminformatics, 3 (Suppl 1)
  • [29] Multi-target-based polypharmacology prediction (mTPP): An approach using virtual screening and machine learning for multi-target drug discovery
    Liu, Kaiyang
    Chen, Xi
    Ren, Yue
    Liu, Chaoqun
    Lv, Tianyi
    Liu, Ya'nan
    Zhang, Yanling
    CHEMICO-BIOLOGICAL INTERACTIONS, 2022, 368