enzyme function prediction;
DCNN;
amino acid sequence;
mutation information;
ELECTRON-TRANSPORT PROTEINS;
BASIS FUNCTION NETWORKS;
MOLECULAR FUNCTIONS;
SEQUENCE;
VISUALIZATION;
D O I:
10.3390/ijms20112845
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
During the past decade, due to the number of proteins in PDB database being increased gradually, traditional methods cannot better understand the function of newly discovered enzymes in chemical reactions. Computational models and protein feature representation for predicting enzymatic function are more important. Most of existing methods for predicting enzymatic function have used protein geometric structure or protein sequence alone. In this paper, the functions of enzymes are predicted from many-sided biological information including sequence information and structure information. Firstly, we extract the mutation information from amino acids sequence by the position scoring matrix and express structure information with amino acids distance and angle. Then, we use histogram to show the extracted sequence and structural features respectively. Meanwhile, we establish a network model of three parallel Deep Convolutional Neural Networks (DCNN) to learn three features of enzyme for function prediction simultaneously, and the outputs are fused through two different architectures. Finally, The proposed model was investigated on a large dataset of 43,843 enzymes from the PDB and achieved 92.34% correct classification when sequence information is considered, demonstrating an improvement compared with the previous result.
机构:
Sanford Bumham Med Res Inst, Program Bioinformat & Syst Biol, La Jolla, CA 92037 USASanford Bumham Med Res Inst, Program Bioinformat & Syst Biol, La Jolla, CA 92037 USA
机构:
Purdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Hawkins, Troy
;
Chitale, Meghana
论文数: 0引用数: 0
h-index: 0
机构:
Purdue Univ, Coll Sci, Dept Comp Sci, W Lafayette, IN 47907 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Chitale, Meghana
;
Luban, Stanislav
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Interdisciplinary Bioinformat Program, La Jolla, CA 92093 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Luban, Stanislav
;
Kihara, Daisuke
论文数: 0引用数: 0
h-index: 0
机构:
Purdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Purdue Univ, Coll Sci, Dept Comp Sci, W Lafayette, IN 47907 USA
Purdue Univ, Coll Sci, Markey Ctr Struct Biol, W Lafayette, IN 47907 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
机构:
Stockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Stockholm Univ, Stockholm Bioinformat Ctr, Dept Biochem & Biophys, SE-10691 Stockholm, SwedenStockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Illergard, Kristoffer
;
Ardell, David H.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif, Dept Nat Sci, Sch Nat Sci, Merced, CA 95344 USA
Uppsala Univ, Linnaeus Ctr Bioinformat, SE-75124 Uppsala, SwedenStockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Ardell, David H.
;
Elofison, Arne
论文数: 0引用数: 0
h-index: 0
机构:
Stockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Stockholm Univ, Stockholm Bioinformat Ctr, Dept Biochem & Biophys, SE-10691 Stockholm, SwedenStockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
机构:
Sanford Bumham Med Res Inst, Program Bioinformat & Syst Biol, La Jolla, CA 92037 USASanford Bumham Med Res Inst, Program Bioinformat & Syst Biol, La Jolla, CA 92037 USA
机构:
Purdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Hawkins, Troy
;
Chitale, Meghana
论文数: 0引用数: 0
h-index: 0
机构:
Purdue Univ, Coll Sci, Dept Comp Sci, W Lafayette, IN 47907 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Chitale, Meghana
;
Luban, Stanislav
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Interdisciplinary Bioinformat Program, La Jolla, CA 92093 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Luban, Stanislav
;
Kihara, Daisuke
论文数: 0引用数: 0
h-index: 0
机构:
Purdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
Purdue Univ, Coll Sci, Dept Comp Sci, W Lafayette, IN 47907 USA
Purdue Univ, Coll Sci, Markey Ctr Struct Biol, W Lafayette, IN 47907 USAPurdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
机构:
Stockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Stockholm Univ, Stockholm Bioinformat Ctr, Dept Biochem & Biophys, SE-10691 Stockholm, SwedenStockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Illergard, Kristoffer
;
Ardell, David H.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif, Dept Nat Sci, Sch Nat Sci, Merced, CA 95344 USA
Uppsala Univ, Linnaeus Ctr Bioinformat, SE-75124 Uppsala, SwedenStockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Ardell, David H.
;
Elofison, Arne
论文数: 0引用数: 0
h-index: 0
机构:
Stockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
Stockholm Univ, Stockholm Bioinformat Ctr, Dept Biochem & Biophys, SE-10691 Stockholm, SwedenStockholm Univ, Ctr Biomembrane Res, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden