Prediction of subcellular localization of proteins using pairwise sequence alignment and support vector machine

被引:23
作者
Kim, Jong Kyoung
Raghava, G. P. S.
Bang, Sung-Yang
Choi, Seungjin
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci, Pohang 790784, South Korea
[2] Inst Microbial Technol, Bioinformat Ctr, Chandigarh, India
关键词
subcellular localization; pairwise sequence alignment; support vector machine;
D O I
10.1016/j.patrec.2005.11.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the destination of a protein in a cell is important for annotating the function of the protein. Recent advances have allowed us to develop more accurate methods for predicting the subcellular localization of proteins. One of the most important factors for improving the accuracy of these methods is related to the introduction of new useful features for protein sequences. In this paper we present a new method for extracting appropriate features from the sequence data by computing pairwise sequence alignment scores. As a classifier, support vector machine (SVM) is used. The overall prediction accuracy evaluated by the jackknife validation technique reached 94.70% for the eukaryotic non-plant data set and 92.10% for the eukaryotic plant data set, which is the highest prediction accuracy among the methods reported so far with such data sets. Our experimental results confirm that our feature extraction method based on pairwise sequence alignment is useful for this classification problem. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:996 / 1001
页数:6
相关论文
共 50 条
  • [21] Improving secretory proteins prediction in Mycobacterium tuberculosis using the unbiased dipeptide composition with support vector machine
    Ahmed, Saeed
    Kabir, Muhammad
    Arif, Muhammad
    Ali, Zakir
    Ali, Farman
    Swati, Zar Nawab Khan
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 21 (03) : 212 - 229
  • [22] Prediction in marketing using the support vector machine
    Cui, DP
    Curry, D
    MARKETING SCIENCE, 2005, 24 (04) : 595 - 615
  • [23] Prediction using online support vector machine
    Zhang, ZL
    Guo, CG
    Yu, S
    Qi, DY
    Long, SQ
    ICTAI 2005: 17TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, : 451 - 456
  • [24] A novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine
    Zhang, Zhen-Hui
    Wang, Zheng-Hua
    Zhang, Zhen-Rong
    Wang, Yong-Xian
    FEBS LETTERS, 2006, 580 (26) : 6169 - 6174
  • [25] Better prediction of the location of α-turns in proteins with support vector machine
    Wang, Yan
    Xue, Zhidong
    Xu, Jin
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 65 (01) : 49 - 54
  • [26] Prediction of Eukaryotic Protein Subcellular Location Using a Novel Feature Extraction Method and Support Vector Machine
    Zhang ShaowuPan QuanWu YonghongCheng YongmeiSchool of Life SciencesNorthwestern Polytechnic UniversityXian ChinaSchool of Automatic ControlNorthwestern Polytechnic UniversityXian Chin
    西北工业大学学报, 2005, (06) : 798 - 803
  • [27] Understanding the prediction of transmembrane proteins by support vector machine using association rule mining
    Hu, Hae-Jin
    Wang, Hao
    Harrison, Robert
    Tai, Phang C.
    Pan, Yi
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2007, : 418 - +
  • [28] Predicting Protein Subcellular Localization using PsePSSM and Support Vector Machines
    Juan, Eric Y. T.
    Jhang, J. H.
    Li, W. J.
    PROCEEDINGS OF THE 11TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2008,
  • [29] Identification of functionally diverse lipocalin proteins from sequence information using support vector machine
    Ganesan Pugalenthi
    Krishna Kumar Kandaswamy
    P. N. Suganthan
    G. Archunan
    R. Sowdhamini
    Amino Acids, 2010, 39 : 777 - 783
  • [30] Identification of functionally diverse lipocalin proteins from sequence information using support vector machine
    Pugalenthi, Ganesan
    Kandaswamy, Krishna Kumar
    Suganthan, P. N.
    Archunan, G.
    Sowdhamini, R.
    AMINO ACIDS, 2010, 39 (03) : 777 - 783