Machine learning-assisted search for novel coagulants: When machine learning can be efficient even if data availability is low

被引:1
作者
Rovenchak, Andrij [1 ,2 ]
Druchok, Maksym [1 ,3 ,4 ]
机构
[1] SoftServe Inc, Lvov, Ukraine
[2] Ivan Franko Natl Univ Lviv, Prof Ivan Vakarchuk Dept Theoret Phys, Lvov, Ukraine
[3] Inst Condensed Matter Phys, Lvov, Ukraine
[4] Inst Condensed Matter Phys, 1 Svientsitskii St, UA-79011 Lvov, Ukraine
关键词
anticoagulants; coagulants; machine learning; molecular design; PROTEIN-C; VARIATIONAL AUTOENCODER; BINDING AFFINITIES; LIGAND; PREDICTION; DISCOVERY; DESIGN; SMILES;
D O I
10.1002/jcc.27292
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Design of new drugs is a challenging process: a candidate molecule should satisfy multiple conditions to act properly and make the least side-effect-perfect candidates selectively attach to and influence only targets, leaving off-targets intact. The amount of experimental data about various properties of molecules constantly grows, promoting data-driven approaches. However, the applicability of typical predictive machine learning techniques can be substantially limited by a lack of experimental data about a particular target. For example, there are many known Thrombin inhibitors (acting as anticoagulants), but a very limited number of known Protein C inhibitors (coagulants). In this study, we present our approach to suggest new inhibitor candidates by building an effective representation of chemical space. For this aim, we developed a deep learning model-autoencoder, trained on a large set of molecules in the SMILES format to map the chemical space. Further, we applied different sampling strategies to generate novel coagulant candidates. Symmetrically, we tested our approach on anticoagulant candidates, where we were able to predict their inhibition towards Thrombin. We also compare our approach with MegaMolBART-another deep learning generative model, but exploiting similar principles of navigation in a chemical space. This study employs machine learning to generate new drugs, emphasizing cases with low data availability. Focusing on coagulants, underrepresented in databases, our approach generates molecular encodings based on the assumption that similar structures share properties. Strategies tested on anticoagulants are applied to discover novel coagulant candidates, navigating the encoding space. image
引用
收藏
页码:937 / 952
页数:16
相关论文
共 50 条
  • [31] A machine learning-assisted data aggregation and offloading system for cloud–IoT communication
    Osama Alfarraj
    Peer-to-Peer Networking and Applications, 2021, 14 : 2554 - 2564
  • [32] Machine Learning-Assisted Device Modeling With Process Variations for Advanced Technology
    Lyu, Yaoyang
    Chen, Wangyong
    Zheng, Mingyue
    Yin, Binyu
    Li, Jinning
    Cai, Linlin
    IEEE JOURNAL OF THE ELECTRON DEVICES SOCIETY, 2023, 11 : 303 - 310
  • [33] Machine learning-assisted point-of-care diagnostics for cardiovascular healthcare
    Wang, Kaidong
    Tan, Bing
    Wang, Xinfei
    Qiu, Shicheng
    Zhang, Qiuping
    Wang, Shaolei
    Yen, Ying-Tzu
    Jing, Nan
    Liu, Changming
    Chen, Xuxu
    Liu, Shichang
    Yu, Yan
    BIOENGINEERING & TRANSLATIONAL MEDICINE, 2025,
  • [34] Machine Learning-Assisted High-Throughput Screening for Electrocatalytic Hydrogen Evolution Reaction
    Yin, Guohao
    Zhu, Haiyan
    Chen, Shanlin
    Li, Tingting
    Wu, Chou
    Jia, Shaobo
    Shang, Jianxiao
    Ren, Zhequn
    Ding, Tianhao
    Li, Yawei
    MOLECULES, 2025, 30 (04):
  • [35] Vul-Mixer: Efficient and Effective Machine Learning-Assisted Software Vulnerability Detection
    Grahn, Daniel
    Chen, Lingwei
    Zhang, Junjie
    ELECTRONICS, 2024, 13 (13)
  • [36] Efficient Removal of Greenhouse Gases: Machine Learning-Assisted Exploration of Metal-Organic Framework Space
    Xin, Ruiqi
    Wang, Chaohai
    Zhang, Yingchao
    Peng, Rongfu
    Li, Rui
    Wang, Junning
    Mao, Yanli
    Zhu, Xinfeng
    Zhu, Wenkai
    Kim, Minjun
    Nam, Ho Ngoc
    Yamauchi, Yusuke
    ACS NANO, 2024, 18 (30) : 19403 - 19422
  • [37] Machine learning-assisted macro simulation for yard arrival prediction
    Minbashi, Niloofar
    Sipila, Hans
    Palmqvist, Carl -William
    Bohlin, Markus
    Kordnejad, Behzad
    JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2023, 25
  • [38] Machine Learning-Assisted Microfluidic Synthesis of Perovskite Quantum Dots
    Chen, Gaoyu
    Zhu, Xia
    Xing, Chenyu
    Wang, Yongkai
    Xu, Xiangxing
    Bao, Jianchun
    Huang, Jinghan
    Zhao, Yurong
    Wang, Xuan
    Zhou, Xiuqing
    Du, Xiuli
    Wang, Xun
    ADVANCED PHOTONICS RESEARCH, 2023, 4 (01):
  • [39] Uncertainty as a Predictor of Classification Accuracy in Machine Learning-Assisted Measurements
    Shirmohammadi, Shervin
    Amiri, Mohammad Hadi
    Al Osman, Hussein
    IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2024, 27 (07) : 37 - 45
  • [40] Machine learning-assisted cancer diagnosis in patients with paraneoplastic autoantibodies
    Maleki, Alireza
    Mohammadi, Mohammad Mahdi Mirza Ali
    Gholizadeh, Shahab
    Dalvandi, Behnaz
    Rahimi, Mohammad
    Tarokhian, Aidin
    DISCOVER ONCOLOGY, 2025, 16 (01)