WegoLoc: accurate prediction of protein subcellular localization using weighted Gene Ontology terms

被引:45
作者
Chi, Sang-Mun [1 ]
Dougu Nam [2 ]
机构
[1] Kyungsung Univ, Sch Comp Sci & Engn, Pusan 608736, South Korea
[2] UNIST, Sch Nanobiosci & Chem Engn, Ulsan, South Korea
关键词
SEQUENCE;
D O I
10.1093/bioinformatics/bts062
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present an accurate and fast web server, WegoLoc for predicting subcellular localization of proteins based on sequence similarity and weighted Gene Ontology (GO) information. A term weighting method in the text categorization process is applied to GO terms for a support vector machine classifier. As a result, WegoLoc surpasses the state-of-the-art methods for previously used test datasets. WegoLoc supports three eukaryotic kingdoms (animals, fungi and plants) and provides human-specific analysis, and covers several sets of cellular locations. In addition, WegoLoc provides (i) multiple possible localizations of input protein(s) as well as their corresponding probability scores, (ii) weights of GO terms representing the contribution of each GO term in the prediction, and (iii) a BLAST E-value for the best hit with GO terms. If the similarity score does not meet a given threshold, an amino acid composition-based prediction is applied as a backup method.
引用
收藏
页码:1028 / 1030
页数:3
相关论文
共 21 条
[1]   MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction [J].
Blum, Torsten ;
Briesemeister, Sebastian ;
Kohlbacher, Oliver .
BMC BIOINFORMATICS, 2009, 10 :274
[2]  
Brady Scott, 2008, Pac Symp Biocomput, P604
[3]   SherLoc2: A High-Accuracy Hybrid Method for Predicting Subcellular Localization of Proteins [J].
Briesemeister, Sebastian ;
Blum, Torsten ;
Brady, Scott ;
Lam, Yin ;
Kohlbacher, Oliver ;
Shatkay, Hagit .
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (11) :5363-5366
[4]  
Casadio Rita, 2008, Briefings in Functional Genomics & Proteomics, V7, P63, DOI 10.1093/bfgp/eln003
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Locating proteins in the cell using TargetP, SignalP and related tools [J].
Emanuelsson, Olof ;
Brunak, Soren ;
von Heijne, Gunnar ;
Nielsen, Henrik .
NATURE PROTOCOLS, 2007, 2 (04) :953-971
[8]   Improving subcellular localization prediction using text classification and the gene ontology [J].
Fyshe, Alona ;
Liu, Yifeng ;
Szafron, Duane ;
Greiner, Russ ;
Lu, Paul .
BIOINFORMATICS, 2008, 24 (21) :2512-2517
[9]   Automated subcellular location determination and high-throughput microscopy [J].
Glory, Estelle ;
Murphy, Robert F. .
DEVELOPMENTAL CELL, 2007, 12 (01) :7-16
[10]   MultiLoc:: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition [J].
Höglund, A ;
Dönnes, P ;
Blum, T ;
Adolph, HW ;
Kohlbacher, O .
BIOINFORMATICS, 2006, 22 (10) :1158-1165