Predicting protein subchloroplast locations with both single and multiple sites via three different modes of Chou's pseudo amino acid compositions

被引:41
作者
Huang, Chao [1 ]
Yuan, Jing-Qi
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
Pseudo amino acid composition; ET_KNN; OET_KNN; Sub-subcellular localization; Subchloroplast locations; SUPPORT VECTOR MACHINE; POSITIVE BACTERIAL PROTEINS; LABEL LEARNING CLASSIFIER; ENZYME SUBFAMILY CLASSES; OUTER-MEMBRANE PROTEINS; SUBCELLULAR-LOCALIZATION; GENERAL-FORM; WEB-SERVER; SUBNUCLEAR LOCALIZATION; ARABIDOPSIS-THALIANA;
D O I
10.1016/j.jtbi.2013.06.034
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Owing to the fact that location information can indicate important functionalities of proteins, developing computational tools to predict protein subcellular localization is one of the most efficient and meaningful tasks with no doubt. The existence methods dealing with prediction of protein subchloroplast locations can only handle the case of single location site. Therefore, it is meaningful and challenging to make effort in how to deal with the proteins with multiple subchloroplast location sites instead of just excluding them. To solve this problem, new systems for predicting protein subchloroplast localization with single or multiple sites are developed and discussed in the paper. Three different editions of KNN algorithms and four different types of feature extraction were adopted to construct the prediction systems. This is the first effort to predict the proteins with multiple subchloroplast locations. The overall jackknife success rates achieved by the best combination (features+classifier) on three dataset with different levels of homology were 89.08%, 81.29% and 71.11%. The performance of the prediction models indicate that the proposed methods might be applied as a useful and efficient assistant tool for the prediction of sub-subcellular localizations. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:205 / 212
页数:8
相关论文
共 73 条
  • [1] propy: a tool to generate various modes of Chou's PseAAC
    Cao, Dong-Sheng
    Xu, Qing-Song
    Liang, Yi-Zeng
    [J]. BIOINFORMATICS, 2013, 29 (07) : 960 - 962
  • [2] EuLoc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of Chou's PseAAC
    Chang, Tzu-Hao
    Wu, Li-Ching
    Lee, Tzong-Yi
    Chen, Shu-Pin
    Huang, Hsien-Da
    Horng, Jorng-Tzong
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2013, 27 (01) : 91 - 103
  • [3] Dual-Layer Wavelet SVM for Predicting Protein Structural Class Via the General Form of Chou's Pseudo Amino Acid Composition
    Chen, Chao
    Shen, Zhi-Bin
    Zou, Xiao-Yong
    [J]. PROTEIN AND PEPTIDE LETTERS, 2012, 19 (04) : 422 - 429
  • [4] Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine
    Chen, Chao
    Chen, Lixuan
    Zou, Xiaoyong
    Cai, Peixiang
    [J]. PROTEIN AND PEPTIDE LETTERS, 2009, 16 (01) : 27 - 31
  • [5] iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
    Chen, Wei
    Feng, Peng-Mian
    Lin, Hao
    Chou, Kuo-Chen
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (06) : e68
  • [6] iNuc-PhysChem: A Sequence-Based Predictor for Identifying Nucleosomes via Physicochemical Properties
    Chen, Wei
    Lin, Hao
    Feng, Peng-Mian
    Ding, Chen
    Zuo, Yong-Chun
    Chou, Kuo-Chen
    [J]. PLOS ONE, 2012, 7 (10):
  • [7] Predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of Chou's pseudo amino acid composition
    Chen, Yen-Kuang
    Li, Kuo-Bin
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2013, 318 : 1 - 12
  • [8] Prediction of membrane protein types by incorporating amphipathic effects
    Chou, KC
    Cai, YD
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (02) : 407 - 413
  • [9] Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes
    Chou, KC
    [J]. BIOINFORMATICS, 2005, 21 (01) : 10 - 19
  • [10] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349