Protein secondary structure prediction using different encoding schemes and neural network architectures

被引：0

作者：

Zhong, W ^{[1
]}

Pan, Y ^{[1
]}

Harrison, R ^{[1
]}

Tai, PC ^{[1
]}

机构：

[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA

来源：

DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY VI | 2004年 / 5433卷

关键词：

Multilayer Perceptron (MLP); protein secondary structure prediction; encoding scheme; orthogonal matrix; hydrophobicity matrix; BLOSUM62; matrix;

D O I：

10.1117/12.542225

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Protein secondary structure prediction is very important for drug design, protein engineering and immunological studies. This research uses fully connected multilayer perceptron (MLP) neural network with one, two and three hidden layers to predict protein secondary structure. Orthogonal matrix, BLOSUM62 matrix and hydrophobicity matrix are used for input profiles. To increase the input information for neural networks, the combined matrix from BLOSLTM62 and orthogonal matrix and the combined matrix from BLOSUM62 and hydrophobicity matrix are also experimented. Binary classifiers indicate test accuracy of one hidden layer is better than that of two and three hidden layers. This may indicate that increasing complexity of architecture may not help neural network to recognize structural pattern of protein sequence more accurately. The results also show that the combined input profile of BLOSUM62 matrix and orthogonal matrix is the best one among five encoding schemes. While accuracy of the tertiary classifier reaches 63.20%, binary classifier for H/similar toH is 78.70%, which is comparable to other researchers' results.

引用

页码：74 / 79

页数：6

共 9 条

[1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Altschul, SF
Madden, TL
Schaffer, AA
Zhang, JH
Zhang, Z
Miller, W
Lipman, DJ
[J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
[2] AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS
HENIKOFF, S
HENIKOFF, JG
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) : 10915 - 10919
[3] A novel method of protein secondary structure prediction with high segment overlap measure: Support vector machine approach
Hua, SJ
Sun, ZR
[J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 308 (02) : 397 - 407
[4] KARP G, 2002, CELL MOL BIOL, P55
[5] Protein secondary structure prediction based on an improved support vector machines approach
Kim, H
Park, H
[J]. PROTEIN ENGINEERING, 2003, 16 (08): : 553 - 560
[6] COMPARING THE POLARITIES OF THE AMINO-ACIDS - SIDE-CHAIN DISTRIBUTION COEFFICIENTS BETWEEN THE VAPOR-PHASE, CYCLOHEXANE, 1-OCTANOL, AND NEUTRAL AQUEOUS-SOLUTION
RADZICKA, A
WOLFENDEN, R
[J]. BIOCHEMISTRY, 1988, 27 (05) : 1664 - 1670
[7] COMBINING EVOLUTIONARY INFORMATION AND NEURAL NETWORKS TO PREDICT PROTEIN SECONDARY STRUCTURE
ROST, B
SANDER, C
[J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (01) : 55 - 72
[8] PREDICTION OF PROTEIN SECONDARY STRUCTURE AT BETTER THAN 70-PERCENT ACCURACY
ROST, B
SANDER, C
[J]. JOURNAL OF MOLECULAR BIOLOGY, 1993, 232 (02) : 584 - 599
[9] ROST B, 1992, INT J NEURAL SYST, V3, P209

← 1 →