Machine learning-assisted search for novel coagulants: When machine learning can be efficient even if data availability is low

被引:1
作者
Rovenchak, Andrij [1 ,2 ]
Druchok, Maksym [1 ,3 ,4 ]
机构
[1] SoftServe Inc, Lvov, Ukraine
[2] Ivan Franko Natl Univ Lviv, Prof Ivan Vakarchuk Dept Theoret Phys, Lvov, Ukraine
[3] Inst Condensed Matter Phys, Lvov, Ukraine
[4] Inst Condensed Matter Phys, 1 Svientsitskii St, UA-79011 Lvov, Ukraine
关键词
anticoagulants; coagulants; machine learning; molecular design; PROTEIN-C; VARIATIONAL AUTOENCODER; BINDING AFFINITIES; LIGAND; PREDICTION; DISCOVERY; DESIGN; SMILES;
D O I
10.1002/jcc.27292
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Design of new drugs is a challenging process: a candidate molecule should satisfy multiple conditions to act properly and make the least side-effect-perfect candidates selectively attach to and influence only targets, leaving off-targets intact. The amount of experimental data about various properties of molecules constantly grows, promoting data-driven approaches. However, the applicability of typical predictive machine learning techniques can be substantially limited by a lack of experimental data about a particular target. For example, there are many known Thrombin inhibitors (acting as anticoagulants), but a very limited number of known Protein C inhibitors (coagulants). In this study, we present our approach to suggest new inhibitor candidates by building an effective representation of chemical space. For this aim, we developed a deep learning model-autoencoder, trained on a large set of molecules in the SMILES format to map the chemical space. Further, we applied different sampling strategies to generate novel coagulant candidates. Symmetrically, we tested our approach on anticoagulant candidates, where we were able to predict their inhibition towards Thrombin. We also compare our approach with MegaMolBART-another deep learning generative model, but exploiting similar principles of navigation in a chemical space. This study employs machine learning to generate new drugs, emphasizing cases with low data availability. Focusing on coagulants, underrepresented in databases, our approach generates molecular encodings based on the assumption that similar structures share properties. Strategies tested on anticoagulants are applied to discover novel coagulant candidates, navigating the encoding space. image
引用
收藏
页码:937 / 952
页数:16
相关论文
共 50 条
  • [41] Machine Learning-Assisted Man Overboard Detection Using Radars
    Tsekenis, Vasileios
    Armeniakos, Charalampos K.
    Nikolaidis, Viktor
    Bithas, Petros S.
    Kanatas, Athanasios G.
    ELECTRONICS, 2021, 10 (11)
  • [42] Monitoring of Fibre Optic Links With a Machine Learning-Assisted Low-Cost Polarimeter
    Slapak, Martin
    Vojtech, Josef
    Havlis, Ondrej
    Slavik, Radan
    IEEE ACCESS, 2020, 8 : 183965 - 183971
  • [43] Machine learning-assisted directed protein evolution with combinatorial libraries
    Wu, Zachary
    Kan, S. B. Jennifer
    Lewis, Russell D.
    Wittmann, Bruce J.
    Arnold, Frances H.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (18) : 8852 - 8858
  • [44] GUARDIAML: Machine Learning-Assisted Dynamic Information Flow Control
    Pupo, Angel Luis Scull
    Nicolay, Jens
    Efthymiadis, Kyriakos
    Nowe, Ann
    De Roover, Coen
    Boix, Elisa Gonzalez
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER), 2019, : 624 - 628
  • [45] Machine learning-assisted screening for cognitive impairment in the emergency department
    Yadgir, Simon R.
    Engstrom, Collin
    Jacobsohn, Gwen Costa
    Green, Rebecca K.
    Jones, Courtney M. C.
    Cushman, Jeremy T.
    Caprio, Thomas, V
    Kind, Amy J. H.
    Lohmeier, Michael
    Shah, Manish N.
    Patterson, Brian W.
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2022, 70 (03) : 831 - 837
  • [46] Machine Learning-Assisted In Vitro Rooting Optimization in Passiflora caerulea
    Jafari, Marziyeh
    Daneshvar, Mohammad Hosein
    Jafari, Sahar
    Hesami, Mohsen
    FORESTS, 2022, 13 (12):
  • [47] Machine learning-assisted investigation of anisotropic elasticity in metallic alloys
    Zhang, Weimin
    Alkhazaleh, Hamzah Ali
    Samavatian, Majid
    Samavatian, Vahid
    MATERIALS TODAY COMMUNICATIONS, 2024, 40
  • [49] Machine Learning-Assisted Low-Dimensional Electrocatalysts Design for Hydrogen Evolution Reaction
    Jin Li
    Naiteng Wu
    Jian Zhang
    Hong-Hui Wu
    Kunming Pan
    Yingxue Wang
    Guilong Liu
    Xianming Liu
    Zhenpeng Yao
    Qiaobao Zhang
    Nano-Micro Letters, 2023, 15
  • [50] Machine Learning-Assisted Codebook Design for MMSE Channel Estimation
    Tian, Xiaowen
    Hu, Yeqing
    Li, Yang
    Wang, Tiexing
    Zhang, Jianzhong
    2023 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS, 2023, : 283 - 288