ProDCoNN: Protein design using a convolutional neural network

被引:43
作者
Zhang, Yuan [1 ]
Chen, Yang [1 ]
Wang, Chenran [1 ]
Lo, Chun-Chao [1 ]
Liu, Xiuwen [2 ]
Wu, Wei [1 ]
Zhang, Jinfeng [1 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[2] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
基金
美国国家卫生研究院;
关键词
convolutional neural network; inverse folding problem; ProDCoNN; protein design; protein engineering; COMPUTATIONAL DESIGN; SEQUENCE PROFILES; PREDICTION; STABILITY; CHALLENGES; EVOLUTION; ALIGNMENT; IMPROVE; MODELS;
D O I
10.1002/prot.25868
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Designing protein sequences that fold to a given three-dimensional (3D) structure has long been a challenging problem in computational structural biology with significant theoretical and practical implications. In this study, we first formulated this problem as predicting the residue type given the 3D structural environment around the C-alpha atom of a residue, which is repeated for each residue of a protein. We designed a nine-layer 3D deep convolutional neural network (CNN) that takes as input a gridded box with the atomic coordinates and types around a residue. Several CNN layers were designed to capture structure information at different scales, such as bond lengths, bond angles, torsion angles, and secondary structures. Trained on a very large number of protein structures, the method, called ProDCoNN (protein design with CNN), achieved state-of-the-art performance when tested on large numbers of test proteins and benchmark datasets.
引用
收藏
页码:819 / 829
页数:11
相关论文
共 67 条
[1]   Convolutional Neural Networks for Speech Recognition [J].
Abdel-Hamid, Ossama ;
Mohamed, Abdel-Rahman ;
Jiang, Hui ;
Deng, Li ;
Penn, Gerald ;
Yu, Dong .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) :1533-1545
[2]   An exciting but challenging road ahead for computational enzyme design [J].
Baker, David .
PROTEIN SCIENCE, 2010, 19 (10) :1817-1819
[3]  
BUSIA A, 2017, ARXIV170203865V1
[4]   Computational structure-based redesign of enzyme activity [J].
Chen, Cheng-Yu ;
Georgiev, Ivelin ;
Anderson, Amy C. ;
Donald, Bruce R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (10) :3764-3769
[5]   Computational protein design: a review [J].
Coluzza, Ivan .
JOURNAL OF PHYSICS-CONDENSED MATTER, 2017, 29 (14)
[6]   Protein Native-State Stabilization by Placing Aromatic Side Chains in N-Glycosylated Reverse Turns [J].
Culyba, Elizabeth K. ;
Price, Joshua L. ;
Hanson, Sarah R. ;
Dhar, Apratim ;
Wong, Chi-Huey ;
Gruebele, Martin ;
Powers, Evan T. ;
Kelly, Jeffery W. .
SCIENCE, 2011, 331 (6017) :571-575
[7]   De novo protein design: Towards fully automated sequence selection [J].
Dahiyat, BI ;
Sarisky, CA ;
Mayo, SL .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (04) :789-796
[8]   Improving computational protein design by using structure-derived sequence profile [J].
Dai, Liang ;
Yang, Yuedong ;
Kim, Hyung Rae ;
Zhou, Yaoqi .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2010, 78 (10) :2338-2348
[9]   A large scale test of computational protein design: Folding and stability of nine completely redesigned globular proteins [J].
Dantas, G ;
Kuhlman, B ;
Callender, D ;
Wong, M ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (02) :449-460
[10]   DE-NOVO DESIGN OF THE HYDROPHOBIC CORES OF PROTEINS [J].
DESJARLAIS, JR ;
HANDEL, TM .
PROTEIN SCIENCE, 1995, 4 (10) :2006-2018