iLoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites

被引:202
作者
Wu, Zhi-Cheng [1 ,2 ]
Xiao, Xuan [1 ,2 ]
Chou, Kuo-Chen [2 ]
机构
[1] Jing De Zhen Ceram Inst, Dept Comp, Jing De Zhen 333046, Peoples R China
[2] Gordon Life Sci Inst, San Diego, CA 92130 USA
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; GRAM-NEGATIVE BACTERIA; TERMINAL TARGETING SEQUENCES; IMPROVED HYBRID APPROACH; APOPTOSIS PROTEINS; LOCATION PREDICTION; GENE ONTOLOGY; SORTING SIGNALS; REPRESENTATION;
D O I
10.1039/c1mb05232b
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Predicting protein subcellular localization is a challenging problem, particularly when query proteins may simultaneously exist at, or move between, two or more different subcellular location sites. Most of the existing methods can only be used to deal with the single-location proteins. Actually, multiple-location proteins should not be ignored because they usually bear some special functions worthy of our notice. By introducing the "multi-labeled learning" approach, a new predictor, called iLoc-Plant, has been developed that can be used to deal with the systems containing both single-and multiple-location plant proteins. As a demonstration, the jackknife cross-validation was performed with iLoc-Plant on a benchmark dataset of plant proteins classified into the following 12 location sites: (1) cell membrane, (2) cell wall, (3) chloroplast, (4) cytoplasm, (5) endoplasmic reticulum, (6) extracellular, (7) Golgi apparatus, (8) mitochondrion, (9) nucleus, (10) peroxisome, (11) plastid, and (12) vacuole, where some proteins belong to two or three locations but none has >= 25% pairwise sequence identity to any other in a same subset. The overall success rate thus obtained by iLoc-Plant was 71%, which is remarkably higher than those achieved by any existing predictors that also have the capacity to deal with such a stringent and complicated plant protein system. As a user-friendly web-server, iLoc-Plant is freely accessible to the public at the web-site http://icpr.jci.edu.cn/bioinfo/iLoc-Plant or http://www.jci-bioinfo.cn/iLoc-Plant. Moreover, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated mathematic equations presented in this paper for its integrity. It is anticipated that iLoc-Plant may become a useful bioinformatics tool for Molecular Cell Biology, Proteomics, Systems Biology, and Drug Development.
引用
收藏
页码:3287 / 3297
页数:11
相关论文
共 86 条
  • [1] Altschul SE, 1997, THEORETICAL AND COMPUTATIONAL METHODS IN GENOME RESEARCH, P1
  • [2] [Anonymous], 1936, P NATL I SCI INDIA
  • [3] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [4] Prediction of Protein Subcellular Locations with Feature Selection and Analysis
    Cai, Yudong
    He, Jianfeng
    Li, Xinlei
    Feng, Kaiyan
    Lu, Lin
    Feng, Kairui
    Kong, Xiangyin
    Lu, Wencong
    [J]. PROTEIN AND PEPTIDE LETTERS, 2010, 17 (04) : 464 - 472
  • [5] The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology
    Camon, E
    Magrane, M
    Barrell, D
    Lee, V
    Dimmer, E
    Maslen, J
    Binns, D
    Harte, N
    Lopez, R
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D262 - D266
  • [6] The gene ontology annotation (GOA) project: Implementation of GO in SWISS-PROT, TrEMBL, and InterPro
    Camon, E
    Magrane, M
    Barrell, D
    Binns, D
    Fleischmann, W
    Kersey, P
    Mulder, N
    Oinn, T
    Maslen, J
    Cox, A
    Apweiler, R
    [J]. GENOME RESEARCH, 2003, 13 (04) : 662 - 672
  • [7] Relation between amino acid composition and cellular location of proteins
    Cedano, J
    Aloy, P
    PerezPons, JA
    Querol, E
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) : 594 - 600
  • [8] Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition
    Chen, Ying-Li
    Li, Qian-Zhong
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) : 377 - 381
  • [9] Prediction of the subcellular location of apoptosis proteins
    Chen, Ying-Li
    Li, Qian-Zhong
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2007, 245 (04) : 775 - 783
  • [10] Chou K., 2010, Nat. Sci, V2, P1090, DOI DOI 10.4236/NS.2010.210136