Fusion of cleavage site detection and pairwise alignment for fast subcellular localization

被引:0
作者
Mak, Man-Wai [1 ]
Kung, Sun-Yuan [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
[2] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
pairwise alignment; subcellular localization; cleavage sites; TargetP; profile; protein sequences;
D O I
10.1109/ICASSP.2008.4517674
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent years, homology-based and signal-based methods have been proposed for predicting the subcellular localization of proteins. While it has been known that homology-based methods can detect more subcellular locations than signal-based methods, the former generally requires a lot more computational resources during both training and prediction. The problem will become intractable for annotating large databases. One possible solution is to reduce the sequence length. This paper proposes to use the cleavage sites detected by signal-based methods (e.g., TargetP) to extract the sequence or profile segments that contain the most localization information for alignment. It was found that the method can reduce computation time of full-length alignment by 27-fold at a cost of only 8% reduction in prediction accuracy. Moreover, the method can increase the accuracy by 0.8% and at the same time reduce the computation time by 41%. Results also show that cutting the sequences at the cleavage sites detected by TargetP is better than cutting them at a fixed position.
引用
收藏
页码:573 / +
页数:2
相关论文
共 14 条
[1]   Predicting subcellular localization of proteins based on their N-terminal amino acid sequence [J].
Emanuelsson, O ;
Nielsen, H ;
Brunak, S ;
von Heijne, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (04) :1005-1016
[2]   ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites [J].
Emanuelsson, O ;
Nielsen, H ;
Von Heijne, G .
PROTEIN SCIENCE, 1999, 8 (05) :978-984
[3]   Locating proteins in the cell using TargetP, SignalP and related tools [J].
Emanuelsson, Olof ;
Brunak, Soren ;
von Heijne, Gunnar ;
Nielsen, Henrik .
NATURE PROTOCOLS, 2007, 2 (04) :953-971
[4]   Methods for predicting bacterial protein subcellular localization [J].
Gardy, Jennifer L. ;
Brinkman, Fiona S. L. .
NATURE REVIEWS MICROBIOLOGY, 2006, 4 (10) :741-751
[5]  
GUO J, 2006, 2006 IEEE INT WORKSH, P391
[6]   Prediction of protein subcellular locations using fuzzy k-NN method [J].
Huang, Y ;
Li, Y .
BIOINFORMATICS, 2004, 20 (01) :21-28
[7]   Combining pairwise-sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships [J].
Liao, L ;
Noble, WS .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) :857-868
[8]  
Lodish H, 2008, MOL CELL BIOL
[9]  
MAK MW, IEEE ACM T IN PRESS
[10]   Sequence conserved for subcellular localization [J].
Nair, R ;
Rost, B .
PROTEIN SCIENCE, 2002, 11 (12) :2836-2847