Novel hybrid method for the evaluation of parameters contributing in determination of protein structural classes

被引:30
作者
Jahandideh, Samad
Abdolmaleki, Parviz
Jahandideh, Mina
Hayatshahi, Sayyed Hamed Sadat
机构
[1] Tarbiat Modares Univ, Fac Sci, Dept Biophys, Tehran, Iran
[2] Vali E Asr Univ, Fac Sci, Dept Math, Rafsanjan, Iran
关键词
multinomial logistic regression model; artificial neural network (ANN); sequence parameters; amino acid composition; protein structural class;
D O I
10.1016/j.jtbi.2006.08.011
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Due to the increasing gap between structure-determined and sequenced proteins, prediction of protein structural classes has been an important problem. It is very important to use efficient sequential parameters for developing class predictors because of the close sequence-structure relationship. The multinomial logistic regression model was used for the first time to evaluate the contribution of sequence parameters in determining the protein structural class. An in-house program generated parameters including single amino acid and all dipeptide composition frequencies. Then, the most effective parameters were selected by a multinomial logistic regression. Selected variables in the multinomial logistic model were Valine among single amino acid composition frequencies and Ala-Gly, Cys-Arg, Asp-Cys, Glu-Tyr, Gly-Glu, His-Tyr, Lys-Lys, Leu-Asp, Leu-Arg, Pro-Cys, Gln-Met, Gln-Thr, Ser-Trp, Val-Asn and Trp-Asn among dipeptide composition frequencies. Also a neural network model was constructed and fed by the parameters selected by multinomial logistic regression to build a hybrid predictor. In this study, self-consistency and jackknife tests on a database constructed by Zhou [1998. An intriguing controversy over protein structural class prediction. J. Protein Chem. 17(8), 729-738] containing 498 proteins are used to verify the performance of this hybrid method, and are compared with some of prior works. The results showed that our two-stage hybrid model approach is very promising and may play a complementary role to the existing powerful approaches. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:275 / 281
页数:7
相关论文
共 23 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]  
[Anonymous], 1989, Applied Logistic Regression
[3]   Support Vector Machines for predicting protein structural class [J].
Cai, Yu-Dong ;
Liu, Xiao-Jun ;
Xu, Xue-biao ;
Zhou, Guo-Ping .
BMC BIOINFORMATICS, 2001, 2 (1)
[4]   Prediction of protein structural class with Rough Sets [J].
Cao, YF ;
Liu, S ;
Zhang, LD ;
Qin, J ;
Wang, J ;
Tang, KX .
BMC BIOINFORMATICS, 2006, 7 (1)
[5]   Domain structural class prediction [J].
Chou, KC ;
Maggiora, GM .
PROTEIN ENGINEERING, 1998, 11 (07) :523-538
[6]   PREDICTION OF PROTEIN CONFORMATION [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :222-245
[7]   ANALYSIS OF ACCURACY AND IMPLICATIONS OF SIMPLE METHODS FOR PREDICTING SECONDARY STRUCTURE OF GLOBULAR PROTEINS [J].
GARNIER, J ;
OSGUTHORPE, DJ ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 120 (01) :97-120
[8]   Protein secondary structure prediction in different structural classes [J].
Gromiha, MM ;
Selvaraj, S .
PROTEIN ENGINEERING, 1998, 11 (04) :249-251
[9]  
Ito M, 1997, COMPUT APPL BIOSCI, V13, P415
[10]   Identification and application of the concepts important for accurate and reliable protein secondary structure prediction [J].
King, RD ;
Sternberg, MJE .
PROTEIN SCIENCE, 1996, 5 (11) :2298-2310