Transformer neural network for protein-specific de novo drug generation as a machine translation problem

被引:109
作者
Grechishnikova, Daria [1 ]
机构
[1] Lomonosov Moscow State Univ, Dept Phys, Fac Phys, Leninskie Gory 1-2, Moscow 119991, Russia
关键词
MOLECULAR-PROPERTIES; DESIGN; MODEL; AUTOENCODER; DISCOVERY;
D O I
10.1038/s41598-020-79682-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Drug discovery for a protein target is a very laborious, long and costly process. Machine learning approaches and, in particular, deep generative networks can substantially reduce development time and costs. However, the majority of methods imply prior knowledge of protein binders, their physicochemical characteristics or the three-dimensional structure of the protein. The method proposed in this work generates novel molecules with predicted ability to bind a target protein by relying on its amino acid sequence only. We consider target-specific de novo drug design as a translational problem between the amino acid "language" and simplified molecular input line entry system representation of the molecule. To tackle this problem, we apply Transformer neural network architecture, a state-of-the-art approach in sequence transduction tasks. Transformer is based on a self-attention technique, which allows the capture of long-range dependencies between items in sequence. The model generates realistic diverse compounds with structural novelty. The computed physicochemical properties and common metrics used in drug discovery fall within the plausible drug-like range of values.
引用
收藏
页数:13
相关论文
共 52 条
  • [1] [Anonymous], 2015, ABS151105644 CORR
  • [2] [Anonymous], 2017, OBJECTIVE REINFORCED
  • [3] Targeting the insulin-like growth factor-1 receptor in human cancer
    Arcaro, Alexandre
    [J]. FRONTIERS IN PHARMACOLOGY, 2013, 4
  • [4] The $2.6 Billion Pill - Methodologic and Policy Considerations
    Avorn, Jerry
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2015, 372 (20) : 1877 - 1879
  • [5] Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]
  • [6] Application of Generative Autoencoder in De Novo Molecular Design
    Blaschke, Thomas
    Olivecrona, Marcus
    Engkvist, Ola
    Bajorath, Jurgen
    Chen, Hongming
    [J]. MOLECULAR INFORMATICS, 2018, 37 (1-2)
  • [7] Cho K., C EMP METH NAT LANG
  • [8] Cohen E, 2019, 36 INT C MACHINE LEA, V97
  • [9] Edunov S, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P489
  • [10] Deep learning for molecular design-a review of the state of the art
    Elton, Daniel C.
    Boukouvalas, Zois
    Fuge, Mark D.
    Chung, Peter W.
    [J]. MOLECULAR SYSTEMS DESIGN & ENGINEERING, 2019, 4 (04) : 828 - 849