Recurrent Neural Network Model for Constructive Peptide Design

被引:167
作者
Mueller, Alex T. [1 ]
Hiss, Jan A. [1 ]
Schneider, Gisbert [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Chem & Appl Biosci, Vladimir Prelog Weg 4, CH-8093 Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
ANTIMICROBIAL PEPTIDE; SECONDARY STRUCTURE; PREDICTION; DATABASE; SEQUENCES;
D O I
10.1021/acs.jcim.7b00414
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
We present a generative long short-term memory (LSTM) recurrent neural network (RNN) for combinatorial de novo peptide design. RNN models capture patterns in sequential data and generate new data instances from the learned context. Amino acid sequences represent a suitable input for these machine-learning models. Generative models trained on peptide sequences could therefore facilitate the design of bespoke peptide libraries. We trained RNNs with LSTM units on pattern recognition of helical antimicrobial peptides and used the resulting model for de novo sequence generation. Of these sequences, 82% were predicted to be active antimicrobial peptides compared to 65% of randomly sampled sequences with the same amino acid distribution as the training set. The generated sequences also lie closer to the training data than manually designed amphipathic helices. The results of this study showcase the ability of LSTM RNNs to construct new amino acid sequences within the applicability domain of the model and motivate their prospective application to peptide and protein design without the need for the exhaustive enumeration of sequence libraries.
引用
收藏
页码:472 / 479
页数:8
相关论文
共 62 条
[1]  
Abadi M., 2015, PREPRINT
[2]  
[Anonymous], 2014, Generating sequences with recurrent neural networks
[3]  
[Anonymous], MOL INF
[4]  
[Anonymous], MOL INF
[5]  
[Anonymous], 1997, Neural Computation
[6]  
[Anonymous], 2011, P 28 INT C MACH LEAR
[7]  
[Anonymous], P 32 INT C MACH LEAR
[8]  
[Anonymous], 2016, ADV NEURAL INFORM PR
[9]   The principled design of large-scale recursive neural network architectures-DAG-RNNs and the protein structure prediction problem [J].
Baldi, P ;
Pollastri, G .
JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (04) :575-602
[10]   A renaissance of neural networks in drug discovery [J].
Baskin, Igor I. ;
Winkler, David ;
Tetko, Igor V. .
EXPERT OPINION ON DRUG DISCOVERY, 2016, 11 (08) :785-795