Protein secondary structure prediction using different encoding schemes and neural network architectures

被引:0
作者
Zhong, W [1 ]
Pan, Y [1 ]
Harrison, R [1 ]
Tai, PC [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
来源
DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY VI | 2004年 / 5433卷
关键词
Multilayer Perceptron (MLP); protein secondary structure prediction; encoding scheme; orthogonal matrix; hydrophobicity matrix; BLOSUM62; matrix;
D O I
10.1117/12.542225
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein secondary structure prediction is very important for drug design, protein engineering and immunological studies. This research uses fully connected multilayer perceptron (MLP) neural network with one, two and three hidden layers to predict protein secondary structure. Orthogonal matrix, BLOSUM62 matrix and hydrophobicity matrix are used for input profiles. To increase the input information for neural networks, the combined matrix from BLOSLTM62 and orthogonal matrix and the combined matrix from BLOSUM62 and hydrophobicity matrix are also experimented. Binary classifiers indicate test accuracy of one hidden layer is better than that of two and three hidden layers. This may indicate that increasing complexity of architecture may not help neural network to recognize structural pattern of protein sequence more accurately. The results also show that the combined input profile of BLOSUM62 matrix and orthogonal matrix is the best one among five encoding schemes. While accuracy of the tertiary classifier reaches 63.20%, binary classifier for H/similar toH is 78.70%, which is comparable to other researchers' results.
引用
收藏
页码:74 / 79
页数:6
相关论文
共 9 条