Prediction of protein subcellular localization by incorporating sequence and protein-protein interaction features

被引:0
作者
Wang, Ming-Hui [1 ]
Gong, Yi [1 ]
Wang, Qiang [1 ]
Feng, Huan-Qing [1 ]
Li, Ao [1 ]
机构
[1] School of Information Science and Technology, University of Science and Technology of China, Hefei
来源
Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China | 2015年 / 44卷 / 03期
关键词
Bioinformatics; K-nearest neighbor algorithm; Protein-protein interaction; Subcellular localization;
D O I
10.3969/j.issn.1001-0548.2015.03.026
中图分类号
学科分类号
摘要
Information of protein subcellular localization is indispensable to study protein function, as a protein can perform its function only after it is correctly transported to a specific subcellular compartment. Thus it is very important to provide accurate prediction of protein subcellular localization in biological studies. In contrast to sequence features (e.g. amino acids composition) that are widely used in subcellular localization prediction, features extracting protein-protein interaction (PPI) are largely ignored, although they reflect the co-localization information of different proteins. In this study, we propose a novel distance formula based on both protein sequence and PPI features, which precisely measures the similarity of proteins by incorporating protein information including amino acid composition, PPI and the corresponding interaction scores. Based on this distance formula, we further introduce a k-nearest neighbor (KNN) algorithm for predicting subcellular localization. The results of leave-one-out test on a benchmark dataset show that PPI features significantly improve the performance of protein subcellular localization. Meanwhile, this KNN algorithm also outperformes SVM algorithm adopting the same features, suggesting the efficiency of the proposed algorithm for predicting protein subcellular localization. ©, 2015, Univ. of Electronic Science and Technology of China. All right reserved.
引用
收藏
页码:467 / 470
页数:3
相关论文
共 12 条
[1]  
Kenichiro I., Kenta N., Prediction of subcellular locations of proteins: Where to proceed?, Proteomics, 10, pp. 3970-3983, (2010)
[2]  
Chou K.-C., Wu Z.-C., Xiao X., iLoc-Hum: Using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites, Mol BioSyst, 8, pp. 629-641, (2012)
[3]  
Du P.-F., Yu Y., SubMito-PSPCP: Predicting protein submitochondrial locations by hybridizing positional specific physicochemical properties with pseudoamino acid compositions, Biomed Res Int, (2013)
[4]  
Pierleoni A., Martelli P.L., Casadio R., MemLoci: Predicting subcellular localization of membrane proteins in eukaryotes, Bioinformatics, 27, 9, pp. 1224-1230, (2011)
[5]  
Xie D., Li A., Wang M.-H., Et al., LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST, Nucleic Acids Research, 33, pp. 105-110, (2005)
[6]  
Li L.-Q., Zhang Y., Zou L.-Y., Et al., An ensemble classifier for eukaryotic protein subcellular location prediction using gene ontology categories and amino acid hydrophobicity, PLoS ONE, 7, 1, (2012)
[7]  
Marcin M., Marcin P., Janusz B.M., MetaLocGramN: a meta-predictor of protein subcellular localization for Gram-negative bacteria, Biochimica ET Biophysica Acta (BBA)-Proteins and Proteomics, 1824, 12, pp. 1425-1433, (2012)
[8]  
Chou K.-C., Shen H.-B., A new method for predicting the subcellular localization of eukaryotic proteins with both single and multiple sites: Euk-mPLoc 2.0, PLoS ONE, 5, 4, (2010)
[9]  
Liu H.-Q., Beck T.N., Golemis E.A., Et al., Integrating in silico resources to map a signaling network, Methods Mol Biol, 1101, pp. 197-245, (2014)
[10]  
Li B.-Q., You J., Chen L., Et al., Identification of lung-cancer-related genes with the shortest path approach in a protein-protein interaction network, BioMed Research International, (2013)