Plant-mPLoc: A Top-Down Strategy to Augment the Power for Predicting Plant Protein Subcellular Localization

被引:826
作者
Chou, Kuo-Chen [1 ,2 ]
Shen, Hong-Bin [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200030, Peoples R China
[2] Gordon Life Sci Inst, San Diego, CA USA
来源
PLOS ONE | 2010年 / 5卷 / 06期
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; FUNCTIONAL DOMAIN COMPOSITION; COUPLED RECEPTOR CLASSES; LOCATION PREDICTION; APOPTOSIS PROTEINS; GENE ONTOLOGY; EVOLUTION INFORMATION; APPROXIMATE ENTROPY; STRUCTURAL CLASSES;
D O I
10.1371/journal.pone.0011335
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the fundamental goals in proteomics and cell biology is to identify the functions of proteins in various cellular organelles and pathways. Information of subcellular locations of proteins can provide useful insights for revealing their functions and understanding how they interact with each other in cellular network systems. Most of the existing methods in predicting plant protein subcellular localization can only cover three or four location sites, and none of them can be used to deal with multiplex plant proteins that can simultaneously exist at two, or move between, two or more different location sits. Actually, such multiplex proteins might have special biological functions worthy of particular notice. The present study was devoted to improve the existing plant protein subcellular location predictors from the aforementioned two aspects. A new predictor called "Plant-mPLoc'' is developed by integrating the gene ontology information, functional domain information, and sequential evolutionary information through three different modes of pseudo amino acid composition. It can be used to identify plant proteins among the following 12 location sites: (1) cell membrane, (2) cell wall, (3) chloroplast, (4) cytoplasm, (5) endoplasmic reticulum, (6) extracellular, (7) Golgi apparatus, (8) mitochondrion, (9) nucleus, (10) peroxisome, (11) plastid, and (12) vacuole. Compared with the existing methods for predicting plant protein subcellular localization, the new predictor is much more powerful and flexible. Particularly, it also has the capacity to deal with multiple-location proteins, which is beyond the reach of any existing predictors specialized for identifying plant protein subcellular localization. As a user-friendly web-server, Plant-mPLoc is freely accessible at http://www.csbio.sjtu.edu.cn/bioinf/plant-multi/. Moreover, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results. It is anticipated that the Plant-mPLoc predictor as presented in this paper will become a very useful tool in plant science as well as all the relevant areas.
引用
收藏
页数:11
相关论文
共 69 条
  • [1] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [2] The GOA database in 2009-an integrated Gene Ontology Annotation resource
    Barrell, Daniel
    Dimmer, Emily
    Huntley, Rachael P.
    Binns, David
    O'Donovan, Claire
    Apweiler, Rolf
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D396 - D403
  • [3] Support vector machines for predicting membrane protein types by using functional domain composition
    Cai, YD
    Zhou, GP
    Chou, KC
    [J]. BIOPHYSICAL JOURNAL, 2003, 84 (05) : 3257 - 3263
  • [4] The gene ontology annotation (GOA) project: Implementation of GO in SWISS-PROT, TrEMBL, and InterPro
    Camon, E
    Magrane, M
    Barrell, D
    Binns, D
    Fleischmann, W
    Kersey, P
    Mulder, N
    Oinn, T
    Maslen, J
    Cox, A
    Apweiler, R
    [J]. GENOME RESEARCH, 2003, 13 (04) : 662 - 672
  • [5] Relation between amino acid composition and cellular location of proteins
    Cedano, J
    Aloy, P
    PerezPons, JA
    Querol, E
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) : 594 - 600
  • [6] Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine
    Chen, Chao
    Chen, Lixuan
    Zou, Xiaoyong
    Cai, Peixiang
    [J]. PROTEIN AND PEPTIDE LETTERS, 2009, 16 (01) : 27 - 31
  • [7] Prediction of protein structural class using novel evolutionary collocation-based sequence representation
    Chen, Ke
    Kurgan, Lukasz A.
    Ruan, Jishou
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (10) : 1596 - 1604
  • [8] Chou K.C., 2009, OPEN BIOINFORM J, V3, P31
  • [9] Structural bioinformatics and its impact to biomedical science
    Chou, KC
    [J]. CURRENT MEDICINAL CHEMISTRY, 2004, 11 (16) : 2105 - 2134
  • [10] Modelling extracellular domains of GABA-A receptors: subtypes 1, 2, 3, and 5
    Chou, KC
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 316 (03) : 636 - 642