Identification of Multiple Subcellular Locations for Proteins in Budding Yeast

被引:22
作者
Wan, Si-Bao [2 ]
Hu, Le-Le [1 ,3 ]
Niu, Sheng [4 ]
Wang, Kai [1 ]
Cai, Yu-Dong [1 ,5 ]
Lu, Wen-Cong [3 ]
Chou, Kuo-Chen [5 ]
机构
[1] Shanghai Univ, Inst Syst Biol, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Sch Life Sci, Shanghai Key Lab Bioenergy Crops, Shanghai 200444, Peoples R China
[3] Shanghai Univ, Coll Sci, Dept Chem, Shanghai 200444, Peoples R China
[4] Chinese Acad Sci, Key Lab Syst Biol, Shanghai Inst Biol Sci, Shanghai 200031, Peoples R China
[5] Gordon Life Sci Inst, San Diego, CA 92130 USA
基金
中国国家自然科学基金;
关键词
Multi subcellular locations; incremental feature selection; sort-PLoc; AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINE; OUTER-MEMBRANE PROTEINS; STRUCTURAL CLASS; DRUG-METABOLISM; TOPOLOGICAL INDEXES; FEATURE-SELECTION; COMPLEX NETWORKS; GENE ONTOLOGY; PHARMACEUTICAL DESIGN;
D O I
10.2174/157489311795222374
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Knowing the subcellular locations of a protein helps to explore its functions in vivo since a protein can only play its roles properly if and only if it is located at certain subcellular compartments. Since it is both time-consuming and costly to determine protein subcellular localization purely by means of the conventional biotechnology experiments, computational methods play an important complementary role in this regard. Although a number of computational methods have been developed for predicting protein subcellular localization, it remains a challenge to deal with the multiplex proteins that may simultaneously exist at, or move between, two or more different locations. Here, a new predictor called Sort-PLoc was developed to tackle such a difficult and challenging problem. The key step was to select protein domains to code the protein samples by Incremental Feature Selection method. In each prediction, a series of subcellular locations were sorted descendingly according to their likelihood to be the site where the query protein may reside. Based on the selected domain set, the importance of Gene Ontology (GO) terms and domains in the contribution to the prediction was analyzed that may provide useful insights to the relevant areas. For the convenience of the broad experimental scientists, a user-friendly web-server for Sort-PLoc was established that is freely accessible to the public at http://yscl.biosino.org/.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 50 条
  • [1] Using Protein-protein Interaction Network Information to Predict the Subcellular Locations of Proteins in Budding Yeast
    Hu, Le-Le
    Feng, Kai-Yan
    Cai, Yu-Dong
    Chou, Kuo-Chen
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (06) : 644 - 651
  • [2] A multiple information fusion method for predicting subcellular locations of two different types of bacterial protein simultaneously
    Chen, Jing
    Xu, Huimin
    He, Ping-an
    Dai, Qi
    Yao, Yuhua
    BIOSYSTEMS, 2016, 139 : 37 - 45
  • [3] Prediction of subcellular locations of proteins: Where to proceed?
    Imai, Kenichiro
    Nakai, Kenta
    PROTEOMICS, 2010, 10 (22) : 3970 - 3983
  • [4] Feature Combination Methods for Prediction of Subcellular Locations of Proteins with Both Single and Multiple Sites
    Wang, Luyao
    Wang, Dong
    Chen, Yuehui
    Qiao, Shanping
    Zhao, Yaou
    Cong, Hanhan
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT I, 2016, 9771 : 192 - 201
  • [5] Predict Subcellular Locations of Singleplex and Multiplex Proteins by Semi-Supervised Learning and Dimension-Reducing General Mode of Chou's PseAAC
    Pacharawongsakda, Eakasit
    Theeramunkong, Thanaruk
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2013, 12 (04) : 311 - 320
  • [6] iLoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites
    Chou, Kuo-Chen
    Wu, Zhi-Cheng
    Xiao, Xuan
    MOLECULAR BIOSYSTEMS, 2012, 8 (02) : 629 - 641
  • [7] A New Ensemble Scheme for Predicting Human Proteins Subcellular Locations
    Majid, Abdul
    Choi, Tae-Sun
    SIGNAL PROCESSING, IMAGE PROCESSING, AND PATTERN RECOGNITION, 2009, 61 : 185 - 192
  • [8] Identification of the subcellular localization of mycobacterial proteins using localization motifs
    Tang, Sheng-Nan
    Sun, Jiang-Ming
    Xiong, Wen-Wei
    Cong, Pei-Sheng
    Li, Tong-Hua
    BIOCHIMIE, 2012, 94 (03) : 847 - 853
  • [9] A graphic representation of protein sequence and predicting the subcellular locations of prokaryotic proteins
    Feng, ZP
    Zhang, CT
    INTERNATIONAL JOURNAL OF BIOCHEMISTRY & CELL BIOLOGY, 2002, 34 (03) : 298 - 307
  • [10] DBMLoc: a Database of proteins with multiple subcellular localizations
    Song Zhang
    Xuefeng Xia
    Jincheng Shen
    Yun Zhou
    Zhirong Sun
    BMC Bioinformatics, 9