Prediction of subcellular localization of proteins using pairwise sequence alignment and support vector machine

被引:23
作者
Kim, Jong Kyoung
Raghava, G. P. S.
Bang, Sung-Yang
Choi, Seungjin
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci, Pohang 790784, South Korea
[2] Inst Microbial Technol, Bioinformat Ctr, Chandigarh, India
关键词
subcellular localization; pairwise sequence alignment; support vector machine;
D O I
10.1016/j.patrec.2005.11.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the destination of a protein in a cell is important for annotating the function of the protein. Recent advances have allowed us to develop more accurate methods for predicting the subcellular localization of proteins. One of the most important factors for improving the accuracy of these methods is related to the introduction of new useful features for protein sequences. In this paper we present a new method for extracting appropriate features from the sequence data by computing pairwise sequence alignment scores. As a classifier, support vector machine (SVM) is used. The overall prediction accuracy evaluated by the jackknife validation technique reached 94.70% for the eukaryotic non-plant data set and 92.10% for the eukaryotic plant data set, which is the highest prediction accuracy among the methods reported so far with such data sets. Our experimental results confirm that our feature extraction method based on pairwise sequence alignment is useful for this classification problem. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:996 / 1001
页数:6
相关论文
共 50 条
[31]   Using support vector machine to predict β- and γ-turns in proteins [J].
Hu, Xiuzhen ;
Ll, Qianzhong .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (12) :1867-1875
[32]   Identification of the subcellular localization of mycobacterial proteins using localization motifs [J].
Tang, Sheng-Nan ;
Sun, Jiang-Ming ;
Xiong, Wen-Wei ;
Cong, Pei-Sheng ;
Li, Tong-Hua .
BIOCHIMIE, 2012, 94 (03) :847-853
[33]   Rockburst prediction using evolutionary support vector machine [J].
Zhao, HB .
PROGRESS IN SAFETY SCIENCE AND TECHNOLOGY, VOL V, PTS A AND B, 2005, 5 :494-498
[34]   WLAN Traffic Prediction Using Support Vector Machine [J].
Feng, Huifang ;
Shu, Yantai ;
Ma, Maode .
IEICE TRANSACTIONS ON COMMUNICATIONS, 2009, E92B (09) :2915-2921
[35]   AutoMotif Server for prediction of phosphorylation sites in proteins using support vector machine: 2007 update [J].
Plewczynski, Dariusz ;
Tkacz, Adrian ;
Wyrwicz, Lucjan S. ;
Rychlewski, Leszek ;
Ginalski, Krzysztof .
JOURNAL OF MOLECULAR MODELING, 2008, 14 (01) :69-76
[36]   AutoMotif Server for prediction of phosphorylation sites in proteins using support vector machine: 2007 update [J].
Dariusz Plewczynski ;
Adrian Tkacz ;
Lucjan S. Wyrwicz ;
Leszek Rychlewski ;
Krzysztof Ginalski .
Journal of Molecular Modeling, 2008, 14 :69-76
[37]   LipoSVM: Prediction of Lysine Lipoylation in Proteins based on the Support Vector Machine [J].
Wu, Meiqi ;
Lu, Pengchao ;
Yang, Yingxi ;
Liu, Liwen ;
Wang, Hui ;
Xu, Yan ;
Chu, Jixun .
CURRENT GENOMICS, 2019, 20 (05) :362-370
[38]   AOPs-SVM: A Sequence-Based Classifier of Antioxidant Proteins Using a Support Vector Machine [J].
Meng, Chaolu ;
Jin, Shunshan ;
Wang, Lei ;
Guo, Fei ;
Zou, Quan .
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2019, 7
[39]   Prediction of protein subcellular localization using machine learning with novel use of generic feature set [J].
Upama, Paramita Basak ;
Tanny, Nawshin Tabassum ;
Akhter, Shahin .
PROCEEDINGS OF 2020 6TH IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE 2020), 2020, :98-101
[40]   Sequence-Based Prediction of Protein-Peptide Binding Sites Using Support Vector Machine [J].
Taherzadeh, Ghazaleh ;
Yang, Yuedong ;
Zhang, Tuo ;
Liew, Alan Wee-Chung ;
Zhou, Yaoqi .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (13) :1223-1229