Prediction of protein-protein interaction sites using support vector machines

被引:136
|
作者
Koike, A
Takagi, T
机构
[1] Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwa, Chiba 2778561, Japan
[2] Hitachi Ltd, Cent Res Lab, Kokubunji, Tokyo 1858601, Japan
来源
PROTEIN ENGINEERING DESIGN & SELECTION | 2004年 / 17卷 / 02期
关键词
accessible surface area; hydrophobicity; interaction site ratio; protein interaction site; support vector machine;
D O I
10.1093/protein/gzh020
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The identification of protein-protein interaction sites is essential for the mutant design and prediction of protein-protein networks. The interaction sites of residue units were predicted using support vector machines (SVM) and the profiles of sequentially/spatially neighboring residues, plus additional information. When only sequence information was used, prediction performance was highest using the feature vectors, sequentially neighboring profiles and predicted interaction site ratios, which were calculated by SVM regression using amino acid compositions. When structural information was also used, prediction performance was highest using the feature vectors, spatially neighboring residue profiles, accessible surface areas, and the with/without protein interaction sites ratios predicted by SVM regression and amino acid compositions. In the latter case, the precision at recall = 50% was 54-56% for a homo-hetero mixed test set and >20% higher than for random prediction. Approximately 30% of the residues wrongly predicted as interaction sites were the closest sequentially/spatially neighboring on the interaction site residues. The predicted residues covered 86-87% of the actual interfaces (96-97% of interfaces with over 20 residues). This prediction performance appeared to be slightly higher than a previously reported study. Comparing the prediction accuracy of each molecule, it seems to be easier to predict interaction sites for stable complexes.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [21] A novel representation of protein sequences for prediction of subcellular location using support vector machines
    Matsuda, S
    Vert, JP
    Saigo, H
    Ueda, N
    Toh, H
    Akutsu, T
    PROTEIN SCIENCE, 2005, 14 (11) : 2804 - 2813
  • [22] Protein-protein interaction sites prediction by ensembling SVM and sample-weighted random forests
    Wei, Zhi-Sen
    Han, Ke
    Yang, Jing-Yu
    Shen, Hong-Bin
    Yu, Dong-Jun
    NEUROCOMPUTING, 2016, 193 : 201 - 212
  • [23] Using Support Vector Machines for Prediction of Protein Structural Classes Based on Discrete Wavelet Transform
    Qiu, Jian-Ding
    Luo, San-Hua
    Huang, Jian-Hua
    Liang, Ru-Ping
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (08) : 1344 - 1350
  • [24] Enterobacteriaceae protein virulence prediction based on hybrid composition features using support vector machines
    Thirunavukkarasu, M.
    Dinakaran, K.
    Valarmathi, P.
    Shanmugam, Gnanendra
    Suryadevara, Nagaraja
    MATERIALS TODAY-PROCEEDINGS, 2019, 16 : 1603 - 1610
  • [25] PPIcons: identification of protein-protein interaction sites in selected organisms
    Brijesh K. Sriwastava
    Subhadip Basu
    Ujjwal Maulik
    Dariusz Plewczynski
    Journal of Molecular Modeling, 2013, 19 : 4059 - 4070
  • [26] PPIcons: identification of protein-protein interaction sites in selected organisms
    Sriwastava, Brijesh K.
    Basu, Subhadip
    Maulik, Ujjwal
    Plewczynski, Dariusz
    JOURNAL OF MOLECULAR MODELING, 2013, 19 (09) : 4059 - 4070
  • [27] Query-Adaptive Ranking with Support Vector Machines for Protein Homology Prediction
    Fu, Yan
    Pan, Rong
    Yang, Qiang
    Gao, Wen
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2011, 6674 : 320 - 331
  • [28] Multiclass fuzzy clustering support vector machines for protein local structure prediction
    Zhong, Wei
    He, Jieyue
    Pan, Yi
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 21 - +
  • [29] Inferring Protein-Protein Interactions Using a Hybrid Genetic Algorithm/Support Vector Machine Method
    Wang, Bing
    Chen, Peng
    Zhang, Jun
    Zhao, Guangxin
    Zhang, Xiang
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09): : 1079 - 1084
  • [30] PreBIND and Textomy – mining the biomedical literature for protein-protein interactions using a support vector machine
    Ian Donaldson
    Joel Martin
    Berry de Bruijn
    Cheryl Wolting
    Vicki Lay
    Brigitte Tuekam
    Shudong Zhang
    Berivan Baskin
    Gary D Bader
    Katerina Michalickova
    Tony Pawson
    Christopher WV Hogue
    BMC Bioinformatics, 4