ClassyPose: A Machine-Learning Classification Model for Ligand Pose Selection Applied to Virtual Screening in Drug Discovery

被引:1
作者
Tran-Nguyen, Viet-Khoa [1 ]
Camproux, Anne-Claude [1 ]
Taboureau, Olivier [1 ]
机构
[1] Univ Paris Cite, CNRS, UMR8251, INSERM U1133,Unite Biol Fonct & Adaptat, F-75013 Paris, France
关键词
good pose probability; machine-learning; PLEC fingerprints; pose classification; pose selection; support vector machine; virtual screening; MOLECULAR DOCKING; SCORING FUNCTIONS; BINDING-AFFINITY; PREDICTION; SHAPE; PERFORMANCE; INHIBITORS; COMPLEXES; DATABASE; ACCURACY;
D O I
10.1002/aisy.202400238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the target-bound conformation of a drug-like molecule is a crucial step in drug design, as it affects the outcome of virtual screening (VS), and paves the way for hit-to-lead and lead optimization. While most docking programs usually manage to produce at least a near-native pose for a bioactive molecule inside its binding pocket, their integrated classical scoring functions (SFs) generally fail to prioritize this pose. Many studies have been carried out to tackle this SF problem, offering multiple pose refinement and/or classification methods, albeit with limitations. This study presents a new support vector machine model for pose classification, called "ClassyPose", which predicts the probability that a receptor-bound ligand conformation could be near-native, without any additional pose optimization step. Trained on protein-ligand extended connectivity features extracted from over 21 600 crystal and docking poses of diverse ligands, this model outperformed other machine-learning algorithms and three existing SFs in terms of docking power, identifying the native ligand pose as top-ranked solution for more than 90% of entries in two test sets. It also achieved high specificity (above 0.96), and improved VS performance when used for pose selection. This efficient, user-friendly tool and all related data are available at https://github.com/vktrannguyen/Classy_Pose. ClassyPose is a new support vector machine model for correct ligand pose selection. Trained on protein-ligand features extracted from native and redocked binding modes of diverse ligands, it has strong docking power, achieves high specificity, and improves virtual screening performance when used as a pose selection tool. The code and all data are user-friendly and available free of charge.image (c) 2024 WILEY-VCH GmbH
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Machine-Learning Techniques Applied to Antibacterial Drug Discovery
    Durrant, Jacob D.
    Amaro, Rommie E.
    CHEMICAL BIOLOGY & DRUG DESIGN, 2015, 85 (01) : 14 - 21
  • [2] Machine-learning scoring functions for structure-based virtual screening
    Li Hongjian
    Sze, Kam-Heung
    Lu Gang
    Ballester, Pedro J.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2021, 11 (01)
  • [3] Molecular Docking for Drug Discovery: Machine-Learning Approaches for Native Pose Prediction of Protein-Ligand Complexes
    Ashtawy, Hossam M.
    Mahapatra, Nihar R.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS: 10TH INTERNATIONAL MEETING, 2014, 8452 : 15 - 32
  • [4] Evaluation of machine-learning methods for ligand-based virtual screening
    Beining Chen
    Robert F. Harrison
    George Papadatos
    Peter Willett
    David J. Wood
    Xiao Qing Lewell
    Paulette Greenidge
    Nikolaus Stiefl
    Journal of Computer-Aided Molecular Design, 2007, 21 : 53 - 62
  • [5] Evaluation of machine-learning methods for ligand-based virtual screening
    Chen, Beining
    Harrison, Robert F.
    Papadatos, George
    Willett, Peter
    Wood, David J.
    Lewell, Xiao Qing
    Greenidge, Paulette
    Stiefl, Nikolaus
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2007, 21 (1-3) : 53 - 62
  • [6] Structure-Based Drug Screening and Ligand-Based Drug Screening with Machine Learning
    Fukunishi, Yoshifumi
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2009, 12 (04) : 397 - 408
  • [7] Machine Learning-based Virtual Screening and Its Applications to Alzheimer's Drug Discovery: A Review
    Carpenter, Kristy A.
    Huang, Xudong
    CURRENT PHARMACEUTICAL DESIGN, 2018, 24 (28) : 3347 - 3358
  • [8] Consensus holistic virtual screening for drug discovery: a novel machine learning model approach
    Moshawih, Said
    Bu, Zhen Hui
    Goh, Hui Poh
    Kifli, Nurolaini
    Lee, Lam Hong
    Goh, Khang Wen
    Ming, Long Chiau
    JOURNAL OF CHEMINFORMATICS, 2024, 16 (01):
  • [9] When drug discovery meets web search: Learning to Rank for ligand-based virtual screening
    Zhang, Wei
    Ji, Lijuan
    Chen, Yanan
    Tang, Kailin
    Wang, Haiping
    Zhu, Ruixin
    Jia, Wei
    Cao, Zhiwei
    Liu, Qi
    JOURNAL OF CHEMINFORMATICS, 2015, 7 : 1 - 13
  • [10] How can machine-learning methods assist in virtual screening for hyperuricemia? A healthcare machine-learning approach
    Ichikawa, Daisuke
    Saito, Toki
    Ujita, Waka
    Oyama, Hiroshi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 : 20 - 24