Prediction of Protein-Protein Interaction with Pairwise Kernel Support Vector Machine

被引:49
作者
Zhang, Shao-Wu [1 ,2 ]
Hao, Li-Yang [1 ]
Zhang, Ting-He [1 ]
机构
[1] Northwestern Polytech Univ, Coll Automat, Xian 710072, Peoples R China
[2] Minist Educ, Key Lab Informat Fus Technol, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
amino acid distance frequency; amino acid index distribution; protein-protein interaction; pairwise kernel function; support vector machine; AMINO-ACID-COMPOSITION; SUBCELLULAR LOCATION; SEQUENCES; CLASSIFICATION; INFORMATION; PARAMETERS; NETWORKS;
D O I
10.3390/ijms15023220
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions (PPIs) play a key role in many cellular processes. Unfortunately, the experimental methods currently used to identify PPIs are both time-consuming and expensive. These obstacles could be overcome by developing computational approaches to predict PPIs. Here, we report two methods of amino acids feature extraction: (i) distance frequency with PCA reducing the dimension (DFPCA) and (ii) amino acid index distribution (AAID) representing the protein sequences. In order to obtain the most robust and reliable results for PPI prediction, pairwise kernel function and support vector machines (SVM) were employed to avoid the concatenation order of two feature vectors generated with two proteins. The highest prediction accuracies of AAID and DFPCA were 94% and 93.96%, respectively, using the 10 CV test, and the results of pairwise radial basis kernel function are considerably improved over those based on radial basis kernel function. Overall, the PPI prediction tool, termed PPI-PKSVM, which is freely available at http://159.226.118.31/PPI/index.html, promises to become useful in such areas as bio-analysis and drug development.
引用
收藏
页码:3220 / 3233
页数:14
相关论文
共 51 条
[1]   Kernel methods for predicting protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BIOINFORMATICS, 2005, 21 :I38-I46
[2]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[3]   THE STRUCTURAL DEPENDENCE OF AMINO-ACID HYDROPHOBICITY PARAMETERS [J].
CHARTON, M ;
CHARTON, BI .
JOURNAL OF THEORETICAL BIOLOGY, 1982, 99 (04) :629-644
[4]   iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Lin, Hao ;
Chou, Kuo-Chen .
NUCLEIC ACIDS RESEARCH, 2013, 41 (06) :e68
[5]   Prediction of protein-protein interaction types using the decision templates based on multiple classier fusion [J].
Chen, Wei ;
Zhang, Shao-Wu ;
Cheng, Yong-Mei ;
Pan, Quan .
MATHEMATICAL AND COMPUTER MODELLING, 2010, 52 (11-12) :2075-2084
[6]   An automated growth enclosure for metabolic labeling of Arabidopsis thaliana with 13C-carbon dioxide - an in vivo labeling system for proteomics and metabolomics research [J].
Chen, Wen-Ping ;
Yang, Xiao-Yuan ;
Harms, Geoffrey L. ;
Gray, William M. ;
Hegeman, Adrian D. ;
Cohen, Jerry D. .
PROTEOME SCIENCE, 2011, 9
[7]   Large-scale Protein-Protein Interaction prediction using novel kernel methods [J].
Chen, Xue-wen ;
Han, Bing .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2008, 2 (02) :145-156
[8]  
Chothia C., 1999, ANNU REV BIOCHEM, V59, P1007
[9]   Predicting protein-protein interactions from sequences in a hybridization space [J].
Chou, KC ;
Cai, YD .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (02) :316-322
[10]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349