QSARtuna: An Automated QSAR Modeling Platform for Molecular Property Prediction in Drug Design

被引:4
作者
Mervin, Lewis [1 ]
Voronov, Alexey [2 ]
Kabeshov, Mikhail [2 ]
Engkvist, Ola [2 ,3 ]
机构
[1] AstraZeneca, Mol AI, Discovery Sci, R&D, Cambridge CB2 0AA, England
[2] AstraZeneca, Mol AI, Discovery Sci, R&D, S-41296 Gothenburg, Sweden
[3] Chalmers Univ Technol, Univ Gothenburg, Dept Comp Sci & Engn, S-41296 Gothenburg, Sweden
关键词
SOLUBILITY; CURATION; TOOL;
D O I
10.1021/acs.jcim.4c00457
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Machine-learning (ML) and deep-learning (DL) approaches to predict the molecular properties of small molecules are increasingly deployed within the design-make-test-analyze (DMTA) drug design cycle to predict molecular properties of interest. Despite this uptake, there are only a few automated packages to aid their development and deployment that also support uncertainty estimation, model explainability, and other key aspects of model usage. This represents a key unmet need within the field, and the large number of molecular representations and algorithms (and associated parameters) means it is nontrivial to robustly optimize, evaluate, reproduce, and deploy models. Here, we present QSARtuna, a molecule property prediction modeling pipeline, written in Python and utilizing the Optuna, Scikit-learn, RDKit, and ChemProp packages, which enables the efficient and automated comparison between molecular representations and machine learning models. The platform was developed by considering the increasingly important aspect of model uncertainty quantification and explainability by design. We provide details for our framework and provide illustrative examples to demonstrate the capability of the software when applied to simple molecular property, reaction/reactivity prediction, and DNA encoded library enrichment classification. We hope that the release of QSARtuna will further spur innovation in automatic ML modeling and provide a platform for education of best practices in molecular property modeling. The code for the QSARtuna framework is made freely available via GitHub.
引用
收藏
页码:5365 / 5374
页数:10
相关论文
共 5 条
  • [1] Drug design of new anti-EBOV inhibitors: QSAR, homology modeling, molecular docking and molecular dynamics studies
    Lahcen, Nouhaila Ait
    Liman, Wissal
    Oubahmane, Mehdi
    Hdoufane, Ismail
    Habibi, Youssef
    Alanazi, Ashwag S.
    Alanazi, Mohammed M.
    Delaite, Christelle
    Maatallah, Mohamed
    Cherqaoui, Driss
    ARABIAN JOURNAL OF CHEMISTRY, 2024, 17 (09)
  • [2] Molecular Thermodynamic Modeling and Design of Microencapsulation Systems for Drug Delivery
    Abildskov, Jens
    O'Connell, John P.
    JOURNAL OF CHEMICAL AND ENGINEERING DATA, 2011, 56 (04) : 1229 - 1237
  • [3] Modeling and Prediction of Drug Dispersability in Polyvinylpyrrolidone-Vinyl Acetate Copolymer Using a Molecular Descriptor
    DeBoyace, Kevin
    Buckner, Ira S.
    Gong, Yuchuan
    Ju, Tzu-chi Rob
    Wildfong, Peter L. D.
    JOURNAL OF PHARMACEUTICAL SCIENCES, 2018, 107 (01) : 334 - 343
  • [4] Prediction of Novel Anoctamin1 (ANO1) Inhibitors Using 3D-QSAR Pharmacophore Modeling and Molecular Docking
    Lee, Yoon Hyeok
    Yi, Gwan-Su
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (10)
  • [5] AI-Driven De Novo Design and Molecular Modeling for Discovery of Small-Molecule Compounds as Potential Drug Candidates Targeting SARS-CoV-2 Main Protease
    Andrianov, Alexander M.
    Shuldau, Mikita A.
    Furs, Konstantin V.
    Yushkevich, Artsemi M.
    Tuzikov, Alexander V.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (09)