Prediction of Enzyme Function Based on Three Parallel Deep CNN and Amino Acid Mutation

被引:19
作者
Gao, Ruibo [1 ]
Wang, Mengmeng [1 ]
Zhou, Jiaoyan [1 ]
Fu, Yuhang [1 ]
Liang, Meng [1 ]
Guo, Dongliang [1 ]
Nie, Junlan [1 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066004, Hebei, Peoples R China
关键词
enzyme function prediction; DCNN; amino acid sequence; mutation information; ELECTRON-TRANSPORT PROTEINS; BASIS FUNCTION NETWORKS; MOLECULAR FUNCTIONS; SEQUENCE; VISUALIZATION;
D O I
10.3390/ijms20112845
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
During the past decade, due to the number of proteins in PDB database being increased gradually, traditional methods cannot better understand the function of newly discovered enzymes in chemical reactions. Computational models and protein feature representation for predicting enzymatic function are more important. Most of existing methods for predicting enzymatic function have used protein geometric structure or protein sequence alone. In this paper, the functions of enzymes are predicted from many-sided biological information including sequence information and structure information. Firstly, we extract the mutation information from amino acids sequence by the position scoring matrix and express structure information with amino acids distance and angle. Then, we use histogram to show the extracted sequence and structural features respectively. Meanwhile, we establish a network model of three parallel Deep Convolutional Neural Networks (DCNN) to learn three features of enzyme for function prediction simultaneously, and the outputs are fused through two different architectures. Finally, The proposed model was investigated on a large dataset of 43,843 enzymes from the PDB and achieved 92.34% correct classification when sequence information is considered, demonstrating an improvement compared with the previous result.
引用
收藏
页数:12
相关论文
共 39 条
[11]   Metagenomics and the protein universe [J].
Godzik, Adam .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2011, 21 (03) :398-403
[12]   The FEATURE framework for protein function annotation: modelling new functions, improving performance, and extending to novel applications [J].
Halperin, Inbal ;
Glazer, Dariya S. ;
Wu, Shirley ;
Altman, Russ B. B. .
BMC GENOMICS, 2008, 9 (Suppl 2)
[13]   PFP: Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data [J].
Hawkins, Troy ;
Chitale, Meghana ;
Luban, Stanislav ;
Kihara, Daisuke .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 74 (03) :566-582
[14]   Structure is three to ten times more conserved than sequence-A study of structural response in protein cores [J].
Illergard, Kristoffer ;
Ardell, David H. ;
Elofison, Arne .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 (03) :499-508
[15]   Visualization and analysis of non-covalent contacts using the Protein Contacts Atlas [J].
Kayikci, Melis ;
Venkatakrishnan, A. J. ;
Scott-Brown, James ;
Ravarani, Charles N. J. ;
Flock, Tilman ;
Babu, M. Madan .
NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2018, 25 (02) :185-+
[16]   Visualization of Biomolecular Structures: State of the Art Revisited [J].
Kozlikova, B. ;
Krone, M. ;
Falk, M. ;
Lindow, N. ;
Baaden, M. ;
Baum, D. ;
Viola, I. ;
Parulek, J. ;
Hege, H-C. .
COMPUTER GRAPHICS FORUM, 2017, 36 (08) :178-204
[17]  
Kumar C, 2012, EURASIP J BIOINFORM, DOI [10.1186/1687-4153-2012-1, 10.1186/1687-4153-2012]
[18]   MS-kNN: protein function prediction by integrating multiple data sources [J].
Lan, Liang ;
Djuric, Nemanja ;
Guo, Yuhong ;
Vucetic, Slobodan .
BMC BIOINFORMATICS, 2013, 14
[19]   Identifying the molecular functions of electron transport proteins using radial basis function networks and biochemical properties [J].
Le, Nguyen-Quoc-Khanh ;
Nguyen, Trinh-Trung-Duong ;
Ou, Yu-Yen .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2017, 73 :166-178
[20]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444