Prediction of contact maps using support vector machines

被引:5
作者
Zhao, Y [1 ]
Karypis, G [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
contact map prediction; correlated mutation analysis; support vector machines;
D O I
10.1142/S0218213005002429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contact map prediction is of great interest for its application in fold recognition and protein 3D structure determination. In this paper we present a contact-map prediction algorithm that employs Support Vector Machines as the machine learning tool and incorporates various features such as sequence profile and their conservations, correlated mutation analysis based on various amino acid physicochemical properties, and secondary structure. In addition, we evaluated the effectiveness of the different features on contact map prediction for different fold classes. On average, our predictor achieved a prediction accuracy of 0.224 with an improvement over a random predictor of a factor 11.7, which is better than reported studies. Our study showed that predicted secondary structure features play an important roles for the proteins containing beta-structures. Models based on secondary structure features and correlated mutation analysis features produce different sets of predictions. Our study also suggests that models learned separately for different protein fold families may achieve better performance than a unified model.
引用
收藏
页码:849 / 865
页数:17
相关论文
共 32 条
[1]   IMPACT OF LOCAL AND NONLOCAL INTERACTIONS ON THERMODYNAMICS AND KINETICS OF PROTEIN-FOLDING [J].
ABKEVICH, VI ;
GUTIN, AM ;
SHAKHNOVICH, EI .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 252 (04) :460-471
[2]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :15-18
[3]   The Protein Data Bank and the challenge of structural genomics [J].
Berman, HM ;
Bhat, TN ;
Bourne, PE ;
Feng, ZK ;
Gilliland, G ;
Weissig, H ;
Westbrook, J .
NATURE STRUCTURAL BIOLOGY, 2000, 7 (Suppl 11) :957-959
[4]  
DUMAIS S, 1998, IEEE INTELLIGENT SYS, V13
[5]   Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations [J].
Fariselli, P ;
Olmea, O ;
Valencia, A ;
Casadio, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, :157-162
[6]   Prediction of contact maps with neural networks and correlated mutations [J].
Fariselli, P ;
Olmea, O ;
Valencia, A ;
Casadio, R .
PROTEIN ENGINEERING, 2001, 14 (11) :835-843
[7]   THE FOLDING OF AN ENZYME .1. THEORY OF PROTEIN ENGINEERING ANALYSIS OF STABILITY AND PATHWAY OF PROTEIN FOLDING [J].
FERSHT, AR ;
MATOUSCHEK, A ;
SERRANO, L .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 224 (03) :771-782
[8]   Predicting protein stability changes upon mutation using database-derived potentials: Solvent accessibility determines the importance of local versus non-local interactions along the sequence [J].
Gilis, D ;
Rooman, M .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 272 (02) :276-290
[9]   A novel method of protein secondary structure prediction with high segment overlap measure: Support vector machine approach [J].
Hua, SJ ;
Sun, ZR .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 308 (02) :397-407
[10]  
Joachims J., 1999, ADV KERNEL METHODS S