Prediction of cis/trans isomerization using feature selection and support vector machines

被引:16
|
作者
Exarchos, Konstantinos P. [1 ,2 ,3 ]
Papaloukas, Costas [1 ,4 ]
Exarchos, Themis P. [1 ,2 ,3 ]
Troganis, Anastassios N. [4 ]
Fotiadis, Dimitrios I. [1 ,2 ]
机构
[1] Univ Ioannina, Unit Med Technol & Intelligent Informat Syst, Dept Comp Sci, GR-45110 Ioannina, Greece
[2] CERETETH, Inst Biomed Technol, Larisa, Greece
[3] Univ Ioannina, Dept Med Phys, Sch Med, GR-45110 Ioannina, Greece
[4] Univ Ioannina, Dept Biol Applicat & Technol, GR-45110 Ioannina, Greece
关键词
Peptide bond; cis/trans Isomerization; Support vector machines; CIS-TRANS ISOMERIZATION; PEPTIDE-BOND CONFORMATION; SECONDARY STRUCTURE; PROTEINS; SEQUENCE; RESIDUES;
D O I
10.1016/j.jbi.2008.05.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In protein structures the peptide bond is found to be in trans conformation in the majority of the cases. Only a small fraction of peptide bonds in proteins is reported to be in cis conformation. Most of these instances (>90%) occur when the peptide bond is an imide (X-Pro) rather than an amide bond (X-nonPro). Due to the implication of cis/trans isomerization in many biologically significant processes, the accurate prediction of the peptide bond conformation is of high interest. In this study, we evaluate the effect of a wide range of features, towards the reliable prediction of both proline and non-proline cis/trans isomerization. We use evolutionary profiles, secondary structure information, real-valued solvent accessibility predictions for each amino acid and the physicochemical properties of the surrounding residues. We also explore the predictive impact of a modified feature vector, which consists of condensed position-specific scoring matrices (PSSMX), secondary structure and solvent accessibility. The best discriminating ability is achieved using the first feature vector combined with a wrapper feature selection algorithm and a support vector machine (SVM). The proposed method results in 70% accuracy, 75% sensitivity and 71% positive predictive value (PPV) in the prediction of the peptide bond conformation between any two amino acids. The output of the feature selection stage is investigated in order to identify discriminatory features as well as the contribution of each neighboring residue in the formation of the peptide bond, thus, advancing our knowledge towards cis/trans isomerization. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:140 / 149
页数:10
相关论文
共 50 条
  • [41] FEATURE SELECTION AND PARAMETER OPTIMIZATION FOR SUPPORT VECTOR MACHINES USING PARTICLE SWARM OPTIMIZATION AND HARMONY SEARCH
    Han, Jihee
    Seo, Yoonho
    INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2021, 28 (01): : 1 - 13
  • [42] Data selection using SASH trees for support vector machines
    Sun, Chaofan
    Vilalta, Ricardo
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 : 286 - +
  • [43] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [44] Gene Selection for Cancer Classification using Support Vector Machines
    Isabelle Guyon
    Jason Weston
    Stephen Barnhill
    Vladimir Vapnik
    Machine Learning, 2002, 46 : 389 - 422
  • [45] Prediction of human disease-associated phosphorylation sites with combined feature selection approach and support vector machine
    Xu, Xiaoyi
    Li, Ao
    Wang, Minghui
    IET SYSTEMS BIOLOGY, 2015, 9 (04) : 155 - 163
  • [46] Simultaneous Support Vector Selection and Parameter Optimization Using Support Vector Machines for Sentiment Classification
    Fei, Ye
    PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 59 - 62
  • [47] Protein secondary structure prediction using genetic neural support vector machines
    Reyaz-Ahmed, Anjum
    Zhang, Yan-Qing
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 1355 - 1359
  • [48] Prediction of protein secondary structure using Bayesian method and support vector machines
    Nguyen, MN
    Rajapakse, JC
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 616 - 620
  • [49] Prediction of Active Site Cleft Using Support Vector Machines
    Sonavane, Shrihari
    Chakrabarti, Pinak
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (12) : 2266 - 2273
  • [50] A New Promoter Prediction Method using Support Vector Machines
    Arslan, Hilal
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,