Information of binding sites improves prediction of protein-protein interaction

被引:0
作者
Patel, Tapan [1 ]
Pillay, Manoj [1 ]
Jawa, Rahul [1 ]
Liao, Li [1 ]
机构
[1] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
来源
ICMLA 2006: 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS | 2006年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein-Protein interaction is essential to cellular Junctions. In this work, we describe a simple, novel approach to improve the accuracy of predicting protein-protein interaction by incorporating the binding site information. First, we assess the importance of the seven attributes that are used by Bradford et. al (2005) for predicting protein binding sites. The leave-one-out cross validation experiments and principal component analysis indicate that some attributes such as residue propensity and hydrophobicity are more important than other attributes such as curvedness and shape index in differentiating a binding patch from nonbinding patch. Second, we incorporate those attributes to predict protein-protein interaction by simple concatenation of the attribute vectors of candidate interacting partners. A support vector machine is trained to predict the interacting partners. This is combined with using the attributes directly derived from the primary sequence at the binding sites. The results from the leave-one-out cross validation experiments show significant improvement in prediction accuracy by incorporating the structural information at the binding sites.
引用
收藏
页码:205 / +
页数:2
相关论文
共 13 条
[1]   Kernel methods for predicting protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BIOINFORMATICS, 2005, 21 :I38-I46
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   Improved prediction of protein-protein binding sites using a support vector machines approach [J].
Bradford, JR ;
Westhead, DR .
BIOINFORMATICS, 2005, 21 (08) :1487-1494
[4]   Exploiting sequence and structure homologs to identify protein-protein binding sites [J].
Chung, JL ;
Wang, W ;
Bourne, PE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (03) :630-640
[5]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction, DOI 10.1017/CBO9780511801389
[6]   Discover protein sequence signatures from protein-protein interaction data [J].
Fang, JW ;
Haasl, RJ ;
Dong, YH ;
Lushington, GH .
BMC BIOINFORMATICS, 2005, 6 (1)
[7]   Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching [J].
Gribskov, M ;
Robinson, NL .
COMPUTERS & CHEMISTRY, 1996, 20 (01) :25-33
[8]   A comprehensive two-hybrid analysis to explore the yeast protein interactome [J].
Ito, T ;
Chiba, T ;
Ozawa, R ;
Yoshida, M ;
Hattori, M ;
Sakaki, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (08) :4569-4574
[9]  
Joachims J., 1999, ADV KERNEL METHODS S
[10]   Predicting protein-protein interactions using signature products [J].
Martin, S ;
Roe, D ;
Faulon, JL .
BIOINFORMATICS, 2005, 21 (02) :218-226