Predicting Protein Subcellular Localization by Pseudo Amino Acid Composition with a Segment-Weighted and Features-Combined Approach

被引:15
|
作者
Wang, Wei [1 ]
Geng, XingBo [2 ]
Dou, Yongchao [2 ]
Liu, Taigang [3 ]
Zheng, Xiaoqi [1 ,4 ]
机构
[1] Shanghai Normal Univ, Dept Math, Shanghai 200234, Peoples R China
[2] Dalian Univ Technol, Dept Appl Math, Dalian 116024, Peoples R China
[3] Shandong Agr Univ, Coll Informat Sci & Engn, Tai An 271018, Shandong, Peoples R China
[4] Sci Comp Key Lab Shanghai Univ, Shanghai 200234, Peoples R China
来源
PROTEIN AND PEPTIDE LETTERS | 2011年 / 18卷 / 05期
基金
中国国家自然科学基金;
关键词
Jackknife test; mature protein; optimal splice site; pseudo amino acid composition; sorting signal; subcellular localization; SUPPORT VECTOR MACHINES; FUNCTIONAL DOMAIN COMPOSITION; STRUCTURAL CLASS PREDICTION; ENZYME SUBFAMILY CLASSES; LOCATION PREDICTION; SIGNAL PEPTIDES; APOPTOSIS PROTEINS; CLEAVAGE SITES; TRANSMEMBRANE PROTEINS; APPROXIMATE ENTROPY;
D O I
10.2174/092986611794927947
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Information of protein subcellular location plays an important role in molecular cell biology. Prediction of the subcellular location of proteins will help to understand their functions and interactions. In this paper, a different mode of pseudo amino acid composition was proposed to represent protein samples for predicting their subcellular localization via the following procedures: based on the optimal splice site of each protein sequence, we divided a sequence into sorting signal part and mature protein part, and extracted sequence features from each part separately. Then, the combined features were fed into the SVM classifier to perform the prediction. By the jackknife test on a benchmark dataset in which none of proteins included has more than 90% pairwise sequence identity to any other, the overall accuracies achieved by the method are 94.5% and 90.3% for prokaryotic and eukaryotic proteins, respectively. The results indicate that the prediction quality by our method is quite satisfactory. It is anticipated that the current method may serve as an alternative approach to the existing prediction methods.
引用
收藏
页码:480 / 487
页数:8
相关论文
共 50 条
  • [1] A novel method for predicting protein subcellular localization based on pseudo amino acid composition
    Ma, Junwei
    Gu, Hong
    BMB REPORTS, 2010, 43 (10) : 670 - 676
  • [2] Prediction of Rat Protein Subcellular Localization with Pseudo Amino Acid Composition Based on Multiple Sequential Features
    Shi, Ruijia
    Xu, Cunshuan
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (06): : 625 - 633
  • [3] Predicting protein-protein interactions by weighted pseudo amino acid composition
    Goktepe, Yunus Emre
    Ilhan, Ilhan
    Kahramanli, Sirzat
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 15 (03) : 272 - 290
  • [4] Using a Novel AdaBoost Algorithm and Chou's Pseudo Amino Acid Composition for Predicting Protein Subcellular Localization
    Lin, Jie
    Wang, Yan
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (12): : 1219 - 1225
  • [5] Application of Pseudo Amino Acid Composition for Predicting Protein Subcellular Location: Stochastic Signal Processing Approach
    Yu-Xi Pan
    Zhi-Zhou Zhang
    Zong-Ming Guo
    Guo-Yin Feng
    Zhen-De Huang
    Lin He
    Journal of Protein Chemistry, 2003, 22 : 395 - 402
  • [6] Application of pseudo amino acid composition for predicting protein subcellular location: Stochastic signal processing approach
    Pan, YX
    Zhang, ZZ
    Guo, ZM
    Feng, GY
    Huang, ZD
    He, L
    JOURNAL OF PROTEIN CHEMISTRY, 2003, 22 (04): : 395 - 402
  • [7] Predicting subcellular localization of proteins by hybridizing functional domain composition and pseudo-amino acid composition
    Chou, KC
    Cai, YD
    JOURNAL OF CELLULAR BIOCHEMISTRY, 2004, 91 (06) : 1197 - 1203
  • [8] Predicting subcellular localization of mycobacterial proteins by using Chou's pseudo amino acid composition
    Lin, Hao
    Ding, Hui
    Guo, Feng-Biao
    Zhang, An-Ying
    Huang, Jian
    PROTEIN AND PEPTIDE LETTERS, 2008, 15 (07): : 739 - 744
  • [9] Predicting protein subcellular location using Chou's pseudo amino acid composition and improved hybrid approach
    Li, Feng-Min
    Li, Qian-Zhong
    PROTEIN AND PEPTIDE LETTERS, 2008, 15 (06): : 612 - 616
  • [10] Predicting Protein Solubility with a Hybrid Approach by Pseudo Amino Acid Composition
    Niu Xiaohui
    Li Nana
    Shi Feng
    Hu Xuehai
    Xia Jingbo
    Xiong Huijuan
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (12): : 1466 - 1472