pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC

被引:172
作者
Cheng, Xiang [1 ]
Xiao, Xuan [1 ,2 ]
Chou, Kuo-Chen [2 ,3 ]
机构
[1] Jingdezhen Ceram Inst, Comp Dept, Jingdezhen, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; MULTI-LABEL CLASSIFIER; ENZYME SUBFAMILY CLASSES; SUPPORT VECTOR MACHINE; ENSEMBLE CLASSIFIER; ANTIMICROBIAL PEPTIDES; DIPEPTIDE COMPOSITION; LEARNING CLASSIFIER; LOCATION PREDICTION; MEMBRANE-PROTEINS;
D O I
10.1039/c7mb00267j
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One of the fundamental goals in cellular biochemistry is to identify the functions of proteins in the context of compartments that organize them in the cellular environment. To realize this, it is indispensable to develop an automated method for fast and accurate identification of the subcellular locations of uncharacterized proteins. The current study is focused on plant protein subcellular location prediction based on the sequence information alone. Although considerable efforts have been made in this regard, the problem is far from being solved yet. Most of the existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions. This kind of multiplex protein is particularly important for both basic research and drug design. Using the multi-label theory, we present a new predictor called "pLoc-mPlant" by extracting the optimal GO (Gene Ontology) information into the Chou's general PseAAC (Pseudo Amino Acid Composition). Rigorous cross-validation on the same stringent benchmark dataset indicated that the proposed pLoc-mPlant predictor is remarkably superior to iLoc-Plant, the state-of-the-art method for predicting plant protein subcellular localization. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc-mPlant/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.
引用
收藏
页码:1722 / 1727
页数:6
相关论文
共 64 条
  • [51] Hum-mPLoc: An ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites
    Shen, Hong-Bin
    Chou, Kuo-Chen
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2007, 355 (04) : 1006 - 1011
  • [52] A top-down approach to enhance the power of predicting human protein subcellular localization: Hum-mPLoc 2.0
    Shen, Hong-Bin
    Chou, Kuo-Chen
    [J]. ANALYTICAL BIOCHEMISTRY, 2009, 394 (02) : 269 - 274
  • [53] iNuc-STNC: a sequence-based predictor for identification of nucleosome positioning in genomes by extending the concept of SAAC and Chou's PseAAC
    Tahir, Muhammad
    Hayat, Maqsood
    [J]. MOLECULAR BIOSYSTEMS, 2016, 12 (08) : 2587 - 2593
  • [54] GOASVM: A subcellular location predictor by incorporating term-frequency gene ontology into the general form of Chou's pseudo-amino acid composition
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2013, 323 : 40 - 48
  • [55] Predicting membrane protein types by the LLDA algorithm
    Wang, Tong
    Yang, Jie
    Shen, Hong-Bin
    Chou, Kuo-Chen
    [J]. PROTEIN AND PEPTIDE LETTERS, 2008, 15 (09) : 915 - 921
  • [56] iLoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites
    Wu, Zhi-Cheng
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. MOLECULAR BIOSYSTEMS, 2011, 7 (12) : 3287 - 3297
  • [57] iAMP-2L: A two-level multi-label classifier for identifying antimicrobial peptides and their functional types
    Xiao, Xuan
    Wang, Pu
    Lin, Wei-Zhong
    Jia, Jian-Hua
    Chou, Kuo-Chen
    [J]. ANALYTICAL BIOCHEMISTRY, 2013, 436 (02) : 168 - 177
  • [58] iLoc-Virus: A multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2011, 284 (01) : 42 - 51
  • [59] A Multi-Label Classifier for Predicting the Subcellular Localization of Gram-Negative Bacterial Proteins with Both Single and Multiple Sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    [J]. PLOS ONE, 2011, 6 (06):
  • [60] iPreny-PseAAC: Identify C-terminal Cysteine Prenylation Sites in Proteins by Incorporating Two Tiers of Sequence Couplings into PseAAC
    Xu, Yan
    Wang, Zu
    Li, Chunhui
    Chou, Kuo-Chen
    [J]. MEDICINAL CHEMISTRY, 2017, 13 (06) : 544 - 551