Prediction of cis/trans isomerization using feature selection and support vector machines

被引:16
|
作者
Exarchos, Konstantinos P. [1 ,2 ,3 ]
Papaloukas, Costas [1 ,4 ]
Exarchos, Themis P. [1 ,2 ,3 ]
Troganis, Anastassios N. [4 ]
Fotiadis, Dimitrios I. [1 ,2 ]
机构
[1] Univ Ioannina, Unit Med Technol & Intelligent Informat Syst, Dept Comp Sci, GR-45110 Ioannina, Greece
[2] CERETETH, Inst Biomed Technol, Larisa, Greece
[3] Univ Ioannina, Dept Med Phys, Sch Med, GR-45110 Ioannina, Greece
[4] Univ Ioannina, Dept Biol Applicat & Technol, GR-45110 Ioannina, Greece
关键词
Peptide bond; cis/trans Isomerization; Support vector machines; CIS-TRANS ISOMERIZATION; PEPTIDE-BOND CONFORMATION; SECONDARY STRUCTURE; PROTEINS; SEQUENCE; RESIDUES;
D O I
10.1016/j.jbi.2008.05.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In protein structures the peptide bond is found to be in trans conformation in the majority of the cases. Only a small fraction of peptide bonds in proteins is reported to be in cis conformation. Most of these instances (>90%) occur when the peptide bond is an imide (X-Pro) rather than an amide bond (X-nonPro). Due to the implication of cis/trans isomerization in many biologically significant processes, the accurate prediction of the peptide bond conformation is of high interest. In this study, we evaluate the effect of a wide range of features, towards the reliable prediction of both proline and non-proline cis/trans isomerization. We use evolutionary profiles, secondary structure information, real-valued solvent accessibility predictions for each amino acid and the physicochemical properties of the surrounding residues. We also explore the predictive impact of a modified feature vector, which consists of condensed position-specific scoring matrices (PSSMX), secondary structure and solvent accessibility. The best discriminating ability is achieved using the first feature vector combined with a wrapper feature selection algorithm and a support vector machine (SVM). The proposed method results in 70% accuracy, 75% sensitivity and 71% positive predictive value (PPV) in the prediction of the peptide bond conformation between any two amino acids. The output of the feature selection stage is investigated in order to identify discriminatory features as well as the contribution of each neighboring residue in the formation of the peptide bond, thus, advancing our knowledge towards cis/trans isomerization. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:140 / 149
页数:10
相关论文
共 50 条
  • [31] Prediction of contact maps using support vector machines
    Zhao, Y
    Karypis, G
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2005, 14 (05) : 849 - 865
  • [32] Prediction of arterial hypertension using support vector machines
    Seijas, Cesar
    Caralli, Antonino
    Villasana, Sergio
    Saenz, Laura
    Arteaga, Francisco
    INGENIERIA UC, 2006, 13 (03): : 13 - 18
  • [33] Prediction of repeat visitation using support vector machines
    Department of Logistics Engineering and Management, National Taichung Institute of Technology, No. 129, Sec. 3, Sanmin Rd., Taichung 404, Taiwan
    不详
    WSEAS Trans. Inf. Sci. Appl., 2007, 2 (369-376):
  • [34] Facial image feature extraction using support vector machines
    Moghaddam, H. Abrishami
    Ghayoumi, M.
    VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2006, : 480 - +
  • [35] Multi-View Scaling Support Vector Machines for Classification and Feature Selection
    Xu, Jinglin
    Han, Junwei
    Nie, Feiping
    Li, Xuelong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (07) : 1419 - 1430
  • [36] A GA-based feature selection and parameters optimization for support vector machines
    Huang, Cheng-Lung
    Wang, Chieh-Jen
    EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (02) : 231 - 240
  • [37] Geographical Classification of Tannat Wines Based on Support Vector Machines and Feature Selection
    Costa, Nattane Luiza
    Garcia Llobodanin, Laura Andrea
    Castro, Inar Alves
    Barbosa, Rommel
    BEVERAGES, 2018, 4 (04):
  • [38] Intelligent Consensus Modeling for Proline Cis-Trans Isomerization Prediction
    Yoo, Paul D.
    Muhaidat, Sami
    Taha, Kamal
    Bentahar, Jamal
    Shami, Abdallah
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (01) : 26 - 32
  • [39] Robust feature selection for multiclass Support Vector Machines using second-order cone programming
    Lopez, Julio
    Maldonado, Sebastian
    INTELLIGENT DATA ANALYSIS, 2015, 19 : S117 - S133
  • [40] Relevance Feedback for Content-Based Image Retrieval Using Support Vector Machines and Feature Selection
    Marakakis, Apostolos
    Galatsanos, Nikolaos
    Likas, Aristidis
    Stafylopatis, Andreas
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 942 - +