pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC

被引:117
作者
Cheng, Xiang [1 ,2 ]
Xiao, Xuan [1 ,2 ]
Chou, Kuo-Chen [2 ,3 ,4 ]
机构
[1] Jingdezhen Ceram Inst, Comp Dept, Jingdezhen, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu 610054, Sichuan, Peoples R China
[4] King Abdulaziz Univ, Fac Comp & Informat Technol Rabigh, Jeddah, Saudi Arabia
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Multi-label system; Chou's general PseAAC; Gene ontology; ML-GKR; Chou's metrics; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; MULTI-LABEL CLASSIFIER; LYSINE SUCCINYLATION SITES; ENSEMBLE CLASSIFIER; LOCATION PREDICTION; VIRUS PROTEINS; WEB SERVER; PHYSICOCHEMICAL PROPERTIES; DIPEPTIDE COMPOSITION;
D O I
10.1016/j.ygeno.2017.10.002
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Information of the proteins' subcellular localization is crucially important for revealing their biological functions in a cell, the basic unit of life. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to develop computational tools for timely identifying their subcellular locations based on the sequence information alone. The current study is focused on the Gram-negative bacterial proteins. Although considerable efforts have been made in protein subcellular prediction, the problem is far from being solved yet. This is because mounting evidences have indicated that many Gram-negative bacterial proteins exist in two or more location sites. Unfortunately, most existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions important for both basic research and drug design. In this study, by using the multi-label theory, we developed a new predictor called "pLoc-mGneg" for predicting the subcellular localization of Gram-negative bacterial proteins with both single and multiple locations. Rigorous cross-validation on a high quality benchmark dataset indicated that the proposed predictor is remarkably superior to "iLoc-Gneg", the state-of-the-art predictor for the same purpose. For the convenience of most experimental scientists, a user-friendly web-server for the novel predictor has been established at http://www.jci-bioinfo.cn/pLoc-mGneg/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.
引用
收藏
页码:231 / 239
页数:9
相关论文
共 124 条
  • [81] Chou's pseudo amino acid composition improves sequence-based antifreeze protein prediction
    Mondal, Sukanta
    Pai, Priyadarshini P.
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2014, 356 : 30 - 35
  • [82] Protein sorting signals and prediction of subcellular localization
    Nakai, K
    [J]. ADVANCES IN PROTEIN CHEMISTRY, VOL 54: ANALYSIS OF AMINO ACID SEQUENCES, 2000, 54 : 277 - 344
  • [83] Genetic programming for creating Chou's pseudo amino acid based features for submitochondria localization
    Nanni, Loris
    Lumini, Alessandra
    [J]. AMINO ACIDS, 2008, 34 (04) : 653 - 660
  • [84] Identifying Bacterial Virulent Proteins by Fusing a Set of Classifiers Based on Variants of Chou's Pseudo Amino Acid Composition and on Evolutionary Information
    Nanni, Loris
    Lumini, Alessandra
    Gupta, Dinesh
    Garg, Aarti
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (02) : 467 - 475
  • [85] Small molecular floribundiquinone B derived from medicinal plants inhibits acetylcholinesterase activity
    Niu, Bing
    Zhang, Mengying
    Du, Pu
    Jiang, Li
    Qin, Rui
    Su, Qiang
    Chen, Fuxue
    Du, Dongshu
    Shu, Yilai
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2017, 8 (34) : 57149 - 57162
  • [86] Predict Subcellular Locations of Singleplex and Multiplex Proteins by Semi-Supervised Learning and Dimension-Reducing General Mode of Chou's PseAAC
    Pacharawongsakda, Eakasit
    Theeramunkong, Thanaruk
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2013, 12 (04) : 311 - 320
  • [87] iRNA-2methyl: Identify RNA 2'-O-methylation Sites by Incorporating Sequence-Coupled Effects into General PseKNC and Ensemble Classifier
    Qiu, Wang-Ren
    Jiang, Shi-Yu
    Sun, Bi-Qian
    Xiao, Xuan
    Cheng, Xiang
    Chou, Kuo-Chen
    [J]. MEDICINAL CHEMISTRY, 2017, 13 (08) : 734 - 743
  • [88] iPhos-PseEvo: Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into General PseAAC via Grey System Theory
    Qiu, Wang-Ren
    Sun, Bi-Qian
    Xiao, Xuan
    Xu, Dong
    Chou, Kuo-Chen
    [J]. MOLECULAR INFORMATICS, 2017, 36 (5-6)
  • [89] iRNAm5C-PseDNC: identifying RNA 5-methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition
    Qiu, Wang-Ren
    Jiang, Shi-Yu
    Xu, Zhao-Chun
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2017, 8 (25) : 41178 - 41188
  • [90] iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
    Qiu, Wang-Ren
    Xiao, Xuan
    Xu, Zhao-Chun
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2016, 7 (32) : 51270 - 51283