Prediction of subcellular localization of proteins using pairwise sequence alignment and support vector machine

被引:23
作者
Kim, Jong Kyoung
Raghava, G. P. S.
Bang, Sung-Yang
Choi, Seungjin
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci, Pohang 790784, South Korea
[2] Inst Microbial Technol, Bioinformat Ctr, Chandigarh, India
关键词
subcellular localization; pairwise sequence alignment; support vector machine;
D O I
10.1016/j.patrec.2005.11.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the destination of a protein in a cell is important for annotating the function of the protein. Recent advances have allowed us to develop more accurate methods for predicting the subcellular localization of proteins. One of the most important factors for improving the accuracy of these methods is related to the introduction of new useful features for protein sequences. In this paper we present a new method for extracting appropriate features from the sequence data by computing pairwise sequence alignment scores. As a classifier, support vector machine (SVM) is used. The overall prediction accuracy evaluated by the jackknife validation technique reached 94.70% for the eukaryotic non-plant data set and 92.10% for the eukaryotic plant data set, which is the highest prediction accuracy among the methods reported so far with such data sets. Our experimental results confirm that our feature extraction method based on pairwise sequence alignment is useful for this classification problem. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:996 / 1001
页数:6
相关论文
共 50 条
[41]   LyFor:Prediction of lysine formylation sites from sequence based features using support vector machine [J].
Sohrawordi, Md ;
Hasan, Md Al Mehedi .
2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, :250-253
[42]   Prediction of transporter family from protein sequence by support vector machine approach [J].
Lin, HH ;
Han, LY ;
Cai, CZ ;
Ji, ZL ;
Chen, YZ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (01) :218-231
[43]   Sequence/structure similarity and support vector machine for protein secondary structure prediction [J].
Lin, JH ;
Tsai, CL ;
Lin, MR .
8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIII, PROCEEDINGS: INDUSTRIAL SYSTEMS, 2004, :71-76
[44]   Support vector machine with hypergraph-based pairwise constraints [J].
Hou, Qiuling ;
Lv, Meng ;
Zhen, Ling ;
Jing, Ling .
SPRINGERPLUS, 2016, 5
[45]   Prediction of endoplasmic reticulum resident proteins using fragmented amino acid composition and support vector machine [J].
Kumar, Ravindra ;
Kumari, Bandana ;
Kumar, Manish .
PEERJ, 2017, 5
[46]   Prediction of pile bearing capacity using support vector machine [J].
Samui, Pijush .
INTERNATIONAL JOURNAL OF GEOTECHNICAL ENGINEERING, 2011, 5 (01) :95-102
[47]   Prediction of Cotton Yarn Properties Using Support Vector Machine [J].
Ghosh, Anindya ;
Chatterjee, Pritam .
FIBERS AND POLYMERS, 2010, 11 (01) :84-88
[48]   Prediction of unusual plasma discharge by using Support Vector Machine [J].
Nakagawa, Shota ;
Hochin, Teruhisa ;
Nomiya, Hiroki ;
Nakanishi, Hideya ;
Shoji, Mamoru .
FUSION ENGINEERING AND DESIGN, 2021, 167 (167)
[49]   Prediction of Epileptic Seizures using Support Vector Machine and Regularization [J].
Ahmad, Shaikh Rezwan Rafid ;
Sayeed, Samee Mohammad ;
Ahmed, Zaziba ;
Siddique, Nusayer Masud ;
Parvez, Mohammad Zavid .
2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, :1217-1220
[50]   Indonesian Stock Prediction using Support Vector Machine (SVM) [J].
Santoso, Murtiyanto ;
Sutjiadi, Raymond ;
Lim, Resmana .
3RD INTERNATIONAL CONFERENCE ON ELECTRICAL SYSTEMS, TECHNOLOGY AND INFORMATION (ICESTI 2017), 2018, 164