Prediction of Enzyme Function Based on Three Parallel Deep CNN and Amino Acid Mutation

被引:19
作者
Gao, Ruibo [1 ]
Wang, Mengmeng [1 ]
Zhou, Jiaoyan [1 ]
Fu, Yuhang [1 ]
Liang, Meng [1 ]
Guo, Dongliang [1 ]
Nie, Junlan [1 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066004, Hebei, Peoples R China
关键词
enzyme function prediction; DCNN; amino acid sequence; mutation information; ELECTRON-TRANSPORT PROTEINS; BASIS FUNCTION NETWORKS; MOLECULAR FUNCTIONS; SEQUENCE; VISUALIZATION;
D O I
10.3390/ijms20112845
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
During the past decade, due to the number of proteins in PDB database being increased gradually, traditional methods cannot better understand the function of newly discovered enzymes in chemical reactions. Computational models and protein feature representation for predicting enzymatic function are more important. Most of existing methods for predicting enzymatic function have used protein geometric structure or protein sequence alone. In this paper, the functions of enzymes are predicted from many-sided biological information including sequence information and structure information. Firstly, we extract the mutation information from amino acids sequence by the position scoring matrix and express structure information with amino acids distance and angle. Then, we use histogram to show the extracted sequence and structural features respectively. Meanwhile, we establish a network model of three parallel Deep Convolutional Neural Networks (DCNN) to learn three features of enzyme for function prediction simultaneously, and the outputs are fused through two different architectures. Finally, The proposed model was investigated on a large dataset of 43,843 enzymes from the PDB and achieved 92.34% correct classification when sequence information is considered, demonstrating an improvement compared with the previous result.
引用
收藏
页数:12
相关论文
共 39 条
[1]   Iterated profile searches with PSI-BLAST - a tool for discovery in protein databases [J].
Altschul, SF ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (11) :444-447
[2]   EnzyNet: enzyme classification using 3D convolutional neural networks on spatial representation [J].
Amidi, Afshine ;
Amidi, Shervine ;
Vlachakis, Dimitrios ;
Megalooikonomou, Vasileios ;
Paragios, Nikos ;
Zacharaki, Evangelia, I .
PEERJ, 2018, 6
[3]   A Machine Learning Methodology for Enzyme Functional Classification Combining Structural and Protein Sequence Descriptors [J].
Amidi, Afshine ;
Amidi, Shervine ;
Vlachakis, Dimitrios ;
Paragios, Nikos ;
Zacharaki, Evangelia I. .
BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2016), 2016, 9656 :728-738
[4]   Automatic single- and multi-label enzymatic function prediction by machine learning [J].
Amidi, Shervine ;
Amidi, Afshine ;
Vlachakis, Dimitrios ;
Paragios, Nikos ;
Zacharaki, Evangelia I. .
PEERJ, 2017, 5
[5]  
Blomberg N., 2015, PROTEIN-STRUCT FUNCT, V37, P379, DOI [10.1002/(SICI)1097-0134(19991115)37:3, DOI 10.1002/(SICI)1097-0134(19991115)37:3]
[6]  
Borro LC, 2006, GENET MOL RES, V5, P193
[7]   AnimoAminoMiner: Exploration of Protein Tunnels and their Properties in Molecular Dynamics [J].
Byska, Jan ;
Le Muzic, Mathieu ;
Groller, M. Eduard ;
Viola, Ivan ;
Kozlikova, Barbora .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (01) :747-756
[8]   ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network [J].
Cao, Renzhi ;
Freitas, Colton ;
Chan, Leong ;
Sun, Miao ;
Jiang, Haiqing ;
Chen, Zhangxin .
MOLECULES, 2017, 22 (10)
[9]   PSSM-Suc: Accurately predicting succinylation using position specific scoring matrix into bigram for feature extraction [J].
Dehzangi, Abdollah ;
Lopez, Yosvany ;
Lal, Sunil Pranit ;
Taherzadeh, Ghazaleh ;
Michaelson, Jacob ;
Sattar, Abdul ;
Tsunoda, Tatsuhiko ;
Sharma, Alok .
JOURNAL OF THEORETICAL BIOLOGY, 2017, 425 :97-102
[10]  
Evangelia I., 2017, PEERJ COMPUT SCI, V3, pe124