Enhancing reinforcement learning for de novo molecular design applying self-attention mechanisms

被引：1

作者：

Pereira, Tiago O. ^{[1
]}

Abbasi, Maryam ^{[2
]}

Arrais, Joel P. ^{[3
]}

机构：

[1] Univ Coimbra, Dept Informat Engn, Informat Engn, Coimbra, Portugal

[2] Univ Coimbra, Ctr Informat & Syst, Coimbra, Portugal

[3] Univ Coimbra, Dept Informat Engn, Coimbra, Portugal

来源：

BRIEFINGS IN BIOINFORMATICS | 2023年 / 24卷 / 06期

关键词：

drug design; smiles; deep learning; transformer; reinforcement learning; GENERATION; USP7; CANCER; TRANSFORMER; LIBRARIES;

D O I：

10.1093/bib/bbad368

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

The drug discovery process can be significantly improved by applying deep reinforcement learning (RL) methods that learn to generate compounds with desired pharmacological properties. Nevertheless, RL-based methods typically condense the evaluation of sampled compounds into a single scalar value, making it difficult for the generative agent to learn the optimal policy. This work combines self-attention mechanisms and RL to generate promising molecules. The idea is to evaluate the relative significance of each atom and functional group in their interaction with the target, and to utilize this information for optimizing the Generator. Therefore, the framework for de novo drug design is composed of a Generator that samples new compounds combined with a Transformer-encoder and a biological affinity Predictor that evaluate the generated structures. Moreover, it takes the advantage of the knowledge encapsulated in the Transformer's attention weights to evaluate each token individually. We compared the performance of two output prediction strategies for the Transformer: standard and masked language model (MLM). The results show that the MLM Transformer is more effective in optimizing the Generator compared with the state-of-the-art works. Additionally, the evaluation models identified the most important regions of each molecule for the biological interaction with the target. As a case study, we generated synthesizable hit compounds that can be putative inhibitors of the enzyme ubiquitin-specific protein 7 (USP7).

引用

页数：10

共 41 条

[1] Benhenda M, 2017, Arxiv, DOI arXiv:1708.08227
[2] Generative chemistry: drug discovery with deep learning generative models
Bian, Yuemin
Xie, Xiang-Qun
[J]. JOURNAL OF MOLECULAR MODELING, 2021, 27 (03)
[3] Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]
[4] Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arXiv.1810.04805]
[5] Deep learning for molecular design-a review of the state of the art
Elton, Daniel C.
Boukouvalas, Zois
Fuge, Mark D.
Chung, Peter W.
[J]. MOLECULAR SYSTEMS DESIGN & ENGINEERING, 2019, 4 (04) : 828 - 849
[6] Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
Ertl, Peter
Schuffenhauer, Ansgar
[J]. JOURNAL OF CHEMINFORMATICS, 2009, 1
[7] Biochemical characterization of USP7 reveals post-translational modification sites and structural requirements for substrate processing and subcellular localization
Fernandez-Montalvan, Amaury
Bouwmeester, Tewis
Joberty, Gerard
Mader, Robert
Mahnke, Marion
Pierrat, Benoit
Schlaeppi, Jean-Marc
Worpenberg, Susanne
Gerhartz, Bernd
[J]. FEBS JOURNAL, 2007, 274 (16) : 4256 - 4270
[8] The importance of regulatory ubiquitination in cancer and metastasis
Gallo, L. H.
Ko, J.
Donoghue, D. J.
[J]. CELL CYCLE, 2017, 16 (07) : 634 - 648
[9] Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules
Gomez-Bombarelli, Rafael
Wei, Jennifer N.
Duvenaud, David
Hernandez-Lobato, Jose Miguel
Sanchez-Lengeling, Benjamin
Sheberla, Dennis
Aguilera-Iparraguirre, Jorge
Hirzel, Timothy D.
Adams, Ryan P.
Aspuru-Guzik, Alan
[J]. ACS CENTRAL SCIENCE, 2018, 4 (02) : 268 - 276
[10] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144

← 1 2 3 4 5 →