Enhancing reinforcement learning for de novo molecular design applying self-attention mechanisms

被引:1
作者
Pereira, Tiago O. [1 ]
Abbasi, Maryam [2 ]
Arrais, Joel P. [3 ]
机构
[1] Univ Coimbra, Dept Informat Engn, Informat Engn, Coimbra, Portugal
[2] Univ Coimbra, Ctr Informat & Syst, Coimbra, Portugal
[3] Univ Coimbra, Dept Informat Engn, Coimbra, Portugal
关键词
drug design; smiles; deep learning; transformer; reinforcement learning; GENERATION; USP7; CANCER; TRANSFORMER; LIBRARIES;
D O I
10.1093/bib/bbad368
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The drug discovery process can be significantly improved by applying deep reinforcement learning (RL) methods that learn to generate compounds with desired pharmacological properties. Nevertheless, RL-based methods typically condense the evaluation of sampled compounds into a single scalar value, making it difficult for the generative agent to learn the optimal policy. This work combines self-attention mechanisms and RL to generate promising molecules. The idea is to evaluate the relative significance of each atom and functional group in their interaction with the target, and to utilize this information for optimizing the Generator. Therefore, the framework for de novo drug design is composed of a Generator that samples new compounds combined with a Transformer-encoder and a biological affinity Predictor that evaluate the generated structures. Moreover, it takes the advantage of the knowledge encapsulated in the Transformer's attention weights to evaluate each token individually. We compared the performance of two output prediction strategies for the Transformer: standard and masked language model (MLM). The results show that the MLM Transformer is more effective in optimizing the Generator compared with the state-of-the-art works. Additionally, the evaluation models identified the most important regions of each molecule for the biological interaction with the target. As a case study, we generated synthesizable hit compounds that can be putative inhibitors of the enzyme ubiquitin-specific protein 7 (USP7).
引用
收藏
页数:10
相关论文
共 41 条
  • [1] Benhenda M, 2017, Arxiv, DOI arXiv:1708.08227
  • [2] Generative chemistry: drug discovery with deep learning generative models
    Bian, Yuemin
    Xie, Xiang-Qun
    [J]. JOURNAL OF MOLECULAR MODELING, 2021, 27 (03)
  • [3] Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]
  • [4] Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arXiv.1810.04805]
  • [5] Deep learning for molecular design-a review of the state of the art
    Elton, Daniel C.
    Boukouvalas, Zois
    Fuge, Mark D.
    Chung, Peter W.
    [J]. MOLECULAR SYSTEMS DESIGN & ENGINEERING, 2019, 4 (04) : 828 - 849
  • [6] Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
    Ertl, Peter
    Schuffenhauer, Ansgar
    [J]. JOURNAL OF CHEMINFORMATICS, 2009, 1
  • [7] Biochemical characterization of USP7 reveals post-translational modification sites and structural requirements for substrate processing and subcellular localization
    Fernandez-Montalvan, Amaury
    Bouwmeester, Tewis
    Joberty, Gerard
    Mader, Robert
    Mahnke, Marion
    Pierrat, Benoit
    Schlaeppi, Jean-Marc
    Worpenberg, Susanne
    Gerhartz, Bernd
    [J]. FEBS JOURNAL, 2007, 274 (16) : 4256 - 4270
  • [8] The importance of regulatory ubiquitination in cancer and metastasis
    Gallo, L. H.
    Ko, J.
    Donoghue, D. J.
    [J]. CELL CYCLE, 2017, 16 (07) : 634 - 648
  • [9] Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules
    Gomez-Bombarelli, Rafael
    Wei, Jennifer N.
    Duvenaud, David
    Hernandez-Lobato, Jose Miguel
    Sanchez-Lengeling, Benjamin
    Sheberla, Dennis
    Aguilera-Iparraguirre, Jorge
    Hirzel, Timothy D.
    Adams, Ryan P.
    Aspuru-Guzik, Alan
    [J]. ACS CENTRAL SCIENCE, 2018, 4 (02) : 268 - 276
  • [10] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144