Prediction of cis/trans isomerization using feature selection and support vector machines

被引:16
|
作者
Exarchos, Konstantinos P. [1 ,2 ,3 ]
Papaloukas, Costas [1 ,4 ]
Exarchos, Themis P. [1 ,2 ,3 ]
Troganis, Anastassios N. [4 ]
Fotiadis, Dimitrios I. [1 ,2 ]
机构
[1] Univ Ioannina, Unit Med Technol & Intelligent Informat Syst, Dept Comp Sci, GR-45110 Ioannina, Greece
[2] CERETETH, Inst Biomed Technol, Larisa, Greece
[3] Univ Ioannina, Dept Med Phys, Sch Med, GR-45110 Ioannina, Greece
[4] Univ Ioannina, Dept Biol Applicat & Technol, GR-45110 Ioannina, Greece
关键词
Peptide bond; cis/trans Isomerization; Support vector machines; CIS-TRANS ISOMERIZATION; PEPTIDE-BOND CONFORMATION; SECONDARY STRUCTURE; PROTEINS; SEQUENCE; RESIDUES;
D O I
10.1016/j.jbi.2008.05.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In protein structures the peptide bond is found to be in trans conformation in the majority of the cases. Only a small fraction of peptide bonds in proteins is reported to be in cis conformation. Most of these instances (>90%) occur when the peptide bond is an imide (X-Pro) rather than an amide bond (X-nonPro). Due to the implication of cis/trans isomerization in many biologically significant processes, the accurate prediction of the peptide bond conformation is of high interest. In this study, we evaluate the effect of a wide range of features, towards the reliable prediction of both proline and non-proline cis/trans isomerization. We use evolutionary profiles, secondary structure information, real-valued solvent accessibility predictions for each amino acid and the physicochemical properties of the surrounding residues. We also explore the predictive impact of a modified feature vector, which consists of condensed position-specific scoring matrices (PSSMX), secondary structure and solvent accessibility. The best discriminating ability is achieved using the first feature vector combined with a wrapper feature selection algorithm and a support vector machine (SVM). The proposed method results in 70% accuracy, 75% sensitivity and 71% positive predictive value (PPV) in the prediction of the peptide bond conformation between any two amino acids. The output of the feature selection stage is investigated in order to identify discriminatory features as well as the contribution of each neighboring residue in the formation of the peptide bond, thus, advancing our knowledge towards cis/trans isomerization. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:140 / 149
页数:10
相关论文
共 50 条
  • [21] Kernel/feature selection for support vector machines applied to materials design
    Martin, S
    Kirby, M
    Miranda, R
    ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 2000, 2001, : 25 - 30
  • [22] Support vector machines combined with feature selection for breast cancer diagnosis
    Akay, Mehmet Fatih
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3240 - 3247
  • [23] Stock Market Trend Prediction Using Support Vector Machines and Variable Selection Methods
    Grigoryan, Hakob
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELLING AND STATISTICS APPLICATION (AMMSA 2017), 2017, 141 : 210 - 213
  • [24] Gene selection and prediction for cancer classification using support vector machines with a reject option
    Choi, Hosik
    Yeo, Donghwa
    Kwon, Sunghoon
    Kim, Yongdai
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (05) : 1897 - 1908
  • [25] Feature selection using binary particle swarm optimization and support vector machines for medical diagnosis
    Daliri, Mohammad Reza
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2012, 57 (05): : 395 - 402
  • [26] PBOND:Web Server for the Prediction of Proline and Non-Proline cis/trans Isomerization
    Konstantinos P. Exarchos
    Themis P. Exarchos
    Costas Papaloukas
    Anastassios N. Troganis
    Dimitrios I. Fotiadis
    Genomics,Proteomics & Bioinformatics, 2009, (03) : 138 - 142
  • [27] Cardiovascular disease prediction using support vector machines
    Alty, SR
    Millasseau, SC
    Chowienczyk, PJ
    Jakobsson, A
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 376 - 379
  • [28] Credit risk prediction using support vector machines
    Trustorff J.-H.
    Konrad P.M.
    Leker J.
    Review of Quantitative Finance and Accounting, 2011, 36 (4) : 565 - 581
  • [29] Soil moisture prediction using support vector machines
    Gill, M. Kashif
    Asefa, Tirusew
    Kemblowski, Mariush W.
    McKee, Mae
    JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 2006, 42 (04): : 1033 - 1046
  • [30] Pressure Ulcers Prediction Using Support Vector Machines
    Chen, Yan-Cheng
    Wang, Pa-Chun
    Su, Chao-Ton
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11685 - +