Prediction of protein-protein interaction sites using support vector machines

被引:136
|
作者
Koike, A
Takagi, T
机构
[1] Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwa, Chiba 2778561, Japan
[2] Hitachi Ltd, Cent Res Lab, Kokubunji, Tokyo 1858601, Japan
来源
PROTEIN ENGINEERING DESIGN & SELECTION | 2004年 / 17卷 / 02期
关键词
accessible surface area; hydrophobicity; interaction site ratio; protein interaction site; support vector machine;
D O I
10.1093/protein/gzh020
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The identification of protein-protein interaction sites is essential for the mutant design and prediction of protein-protein networks. The interaction sites of residue units were predicted using support vector machines (SVM) and the profiles of sequentially/spatially neighboring residues, plus additional information. When only sequence information was used, prediction performance was highest using the feature vectors, sequentially neighboring profiles and predicted interaction site ratios, which were calculated by SVM regression using amino acid compositions. When structural information was also used, prediction performance was highest using the feature vectors, spatially neighboring residue profiles, accessible surface areas, and the with/without protein interaction sites ratios predicted by SVM regression and amino acid compositions. In the latter case, the precision at recall = 50% was 54-56% for a homo-hetero mixed test set and >20% higher than for random prediction. Approximately 30% of the residues wrongly predicted as interaction sites were the closest sequentially/spatially neighboring on the interaction site residues. The predicted residues covered 86-87% of the actual interfaces (96-97% of interfaces with over 20 residues). This prediction performance appeared to be slightly higher than a previously reported study. Comparing the prediction accuracy of each molecule, it seems to be easier to predict interaction sites for stable complexes.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [31] Using Support Vector Machines for numerical prediction
    Hussain, Shahid
    Khamisani, Vaqar
    INMIC 2007: PROCEEDINGS OF THE 11TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE, 2007, : 88 - 92
  • [32] Large-scale Protein-Protein Interaction prediction using novel kernel methods
    Chen, Xue-wen
    Han, Bing
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2008, 2 (02) : 145 - 156
  • [33] Computational Prediction of Protein-Protein Interaction Networks: Algorithms and Resources
    Zahiri, Javad
    Bozorgmehr, Joseph Hannon
    Masoudi-Nejad, Ali
    CURRENT GENOMICS, 2013, 14 (06) : 397 - 414
  • [34] Prediction of Protein-Protein Interaction Based Only on Coding Sequences
    Wang, Yongcui
    Wang, Ji-Guang
    Yang, Zhi-Xia
    Deng, Naiyang
    OPTIMIZATION AND SYSTEMS BIOLOGY, 2009, 11 : 151 - +
  • [35] Novel Domain Identification Approach for Protein-protein Interaction Prediction
    Shatnawi, Maad
    Zaki, Nazar M.
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 145 - 152
  • [36] Some Remarks on Prediction of Protein-Protein Interaction with Machine Learning
    Zhang, Shao-Wu
    Wei, Ze-Gang
    MEDICINAL CHEMISTRY, 2015, 11 (03) : 254 - 264
  • [37] Prediction of Protein-Protein Interactions Based on Protein-Protein Correlation Using Least Squares Regression
    Huang, De-Shuang
    Zhang, Lei
    Han, Kyungsook
    Deng, Suping
    Yang, Kai
    Zhang, Hongbo
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2014, 15 (06) : 553 - 560
  • [38] Sequence-Based Prediction of Protein-Peptide Binding Sites Using Support Vector Machine
    Taherzadeh, Ghazaleh
    Yang, Yuedong
    Zhang, Tuo
    Liew, Alan Wee-Chung
    Zhou, Yaoqi
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (13) : 1223 - 1229
  • [39] Prediction of Protein Residue Contact Using Support Vector Machine
    Chan, Weng Howe
    Mohamad, Mohd Saberi
    KNOWLEDGE TECHNOLOGY, 2012, 295 : 323 - 332
  • [40] Sequence-Based Prediction of Protein Folding Rates Using Contacts, Secondary Structures and Support Vector Machines
    Lin, Guan Ning
    Wang, Zheng
    Xu, Dong
    Cheng, Jianlin
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 3 - +