Analysis and Prediction of Nitrated Tyrosine Sites with the mRMR Method and Support Vector Machine Algorithm

被引:61
作者
Wang, Shao Peng [1 ]
Zhang, Qing [1 ]
Lu, Jing [2 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[2] Yantai Univ, Minist Educ, Sch Pharm,Collaborative Innovat Ctr Adv Drug Deli, Key Lab Mol Pharmacol & Drug Evaluat, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Post-translational modification; tyrosine nitration prediction; minimum redundancy maximum relevance; support vector machine; incremental feature selection; MANGANESE-SUPEROXIDE-DISMUTASE; PROTEIN SECONDARY STRUCTURE; AMINO-ACID-COMPOSITION; RIBOSOMAL-RNA-BINDING; FEATURE-SELECTION; NITRIC-OXIDE; NITROTYROSINE FORMATION; METABOLIC PATHWAY; PLASMA-PROTEINS; PEROXYNITRITE;
D O I
10.2174/1574893611666160608075753
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The post-translational modification of tyrosine nitration is an important covalently substituted process in many biochemical processes that are closely related to several human diseases. Objective: Therefore, the correct recognition of nitration sites is useful for diseases diagnosis and the design of effective treatments. However, traditional experimental techniques and methods for the identification of nitration sites are time-consuming, labor-intensive and expensive. Alternatively, effective computational methods can be designed to tackle this problem. Method: In this study, we proposed a computational workflow to identify and analyze nitrated tyrosine residues in proteins. Specifically, each nitrated tyrosine was represented by features derived from a segment of an amino acid sequence of the protein containing the nitrated tyrosine site. A reliable feature selection method, minimum redundancy maximum relevance, was adopted to analyze these features, and incremental feature selection and a type of support vector machine, SMO (sequential minimal optimization), were employed to extract core features and build an optimal prediction classifier. Results: 223 features were extracted and used to build the optimal prediction classifier, with which the Matthew's correlation coefficient (MCC) of the training set was 0.717. The nitration sites in the testing set were extracted from the UniProt database based on the sequence similarity technique and were all denoted as positive samples. The sensitivity of the optimal classifier was 0.950 for the testing set. The results demonstrate the effectiveness and importance of optimal features and the classifier for the recognition of nitration sites. In addition, three other methods, the nearest neighbor algorithm (NNA), Dagging and random forest (RF) methods were also applied to the training and testing set, and the results were compared with those of the SMO. Conclusion: 61 core features of the 223 total features were analyzed, and this analysis revealed the essential residue types and conserved sites proximal to the central tyrosine residue.
引用
收藏
页码:3 / 13
页数:11
相关论文
共 85 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], J SENS
[3]  
[Anonymous], 1997, P 14 INT C MACH LEAR
[4]  
Aoyama K, 2000, ANN NEUROL, V47, P524, DOI 10.1002/1531-8249(200004)47:4<524::AID-ANA19>3.0.CO
[5]  
2-5
[6]   Solving the protein sequence metric problem [J].
Atchley, WR ;
Zhao, JP ;
Fernandes, AD ;
Drüke, T .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (18) :6395-6400
[7]   The Universal Protein Resource (UniProt) 2009 [J].
Bairoch, Amos ;
Consortium, UniProt ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bolleman, Jerven ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
deCastro, Edouard ;
Ciapina, Luciane ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
Delbard, Gwennaelle ;
Dornevil, Dolnide ;
Roggli, Paula Duek ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Feuermann, Marc ;
Gehant, Sebastian ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
James, Janet ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Kappler, Thomas ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra ;
Lara, Vicente .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D169-D174
[8]   Protein tyrosine nitration in hydrophilic and hydrophobic environments [J].
Bartesaghi, S. ;
Ferrer-Sueta, G. ;
Peluffo, G. ;
Valez, V. ;
Zhang, H. ;
Kalyanaraman, B. ;
Radi, R. .
AMINO ACIDS, 2007, 32 (04) :501-515
[9]   Mechanistic studies of peroxynitrite-mediated tyrosine nitration in membranes using the hydrophobic probe N-t-BOC-L-tyrosine tert-butyl ester [J].
Bartesaghi, Silvina ;
Valez, Valeria ;
Trujillo, Madia ;
Peluffo, Gonzalo ;
Romero, Natalia ;
Zhang, Hao ;
Kalyanaraman, Balaraman ;
Radi, Rafael .
BIOCHEMISTRY, 2006, 45 (22) :6813-6825
[10]   Factors influencing protein tyrosine nitration-structure-based predictive models [J].
Bayden, Alexander S. ;
Yakovlev, Vasily A. ;
Graves, Paul R. ;
Mikkelsen, Ross B. ;
Kellogg, Glen E. .
FREE RADICAL BIOLOGY AND MEDICINE, 2011, 50 (06) :749-762