Predicting structural classes of proteins by incorporating their global and local physicochemical and conformational properties into general Chou's PseAAC

被引:34
作者
Contreras-Torres, Ernesto [1 ,2 ]
机构
[1] Univ Ciencias Informat, Fac Ciencias & Tecnol Computacionales, Dept Programac & Sistemas Digitales, Havana, Cuba
[2] Univ Ciencias Informat, CEMC, Grp Invest Bioinformat, Havana, Cuba
关键词
OD-protein descriptor; Total sum of squares; Amino acid-type; Amino acid-group; Protein structural classes; Nearest Neighbor; AMINO-ACID-COMPOSITION; LOW-SIMILARITY SEQUENCES; SUPPORT VECTOR MACHINE; EVOLUTIONARY INFORMATION; ACCURATE PREDICTION; FOLDING TYPES; WEB SERVER; BIOINFORMATICS; DESCRIPTORS; CLASSIFIER;
D O I
10.1016/j.jtbi.2018.05.033
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, I introduce novel global and local OD-protein descriptors based on a statistical quantity named Total Sum of Squares (TSS). This quantity represents the sum of the squares differences of amino acid properties from the arithmetic mean property. As an extension, the amino acid-types and amino acid groups formalisms are used for describing zones of interest in proteins. To assess the effectiveness of the proposed descriptors, a Nearest Neighbor model for predicting the major four protein structural classes was built. This model has a success rate of 98.53% on the jackknife cross-validation test; this performance being superior to other reported methods despite the simplicity of the predictor. Additionally, this predictor has an average success rate of 98.35% in different cross-validation tests performed. A value of 0.98 for the Kappa statistic clearly discriminates this model from a random predictor. The results obtained by the Nearest Neighbor model demonstrated the ability of the proposed descriptors not only to reflect relevant biochemical information related to the structural classes of proteins but also to allow appropriate interpretability. It can thus be expected that the current method may play a supplementary role to other existing approaches for protein structural class prediction and other protein attributes. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:139 / 145
页数:7
相关论文
共 89 条
[1]   iMem-2LSAAC: A two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into chou's pseudo amino acid composition [J].
Arif, Muhammad ;
Hayat, Maqsood ;
Jan, Zahoor .
JOURNAL OF THEORETICAL BIOLOGY, 2018, 442 :11-21
[2]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[3]  
Cai Y.-D., 2002, J MOL DES, V1, P332
[4]   Predicting membrane protein type by functional domain composition and pseudo-amino acid composition [J].
Cai, YD ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 238 (02) :395-400
[5]   Using LogitBoost classifier to predict protein structural classes [J].
Cai, YD ;
Feng, KY ;
Lu, WC ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 238 (01) :172-176
[6]   Prediction of protein structural classes by support vector machines [J].
Cai, YD ;
Liu, XJ ;
Xu, XB ;
Chou, KC .
COMPUTERS & CHEMISTRY, 2002, 26 (03) :293-296
[7]   The Role of Hydrophobic Amino Acids in the Structure and Function of the Rhodopsin Family of G Protein-Coupled Receptors [J].
Caltabiano, Gianluigi ;
Gonzalez, Angel ;
Cordomi, Arnau ;
Campillo, Mercedes ;
Pardo, Leonardo .
G PROTEIN COUPLED RECEPTORS: STRUCTURE, 2013, 520 :99-115
[8]   Predicting protein structural class based on multi-features fusion [J].
Chen, Chao ;
Chen, Li-Xuan ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang .
JOURNAL OF THEORETICAL BIOLOGY, 2008, 253 (02) :388-392
[9]   Using pseudo-amino acid composition and support vector machine to predict protein structural class [J].
Chen, Chao ;
Tian, Yuan-Xin ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang ;
Mo, Jin-Yuan .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) :444-448
[10]   Dual-Layer Wavelet SVM for Predicting Protein Structural Class Via the General Form of Chou's Pseudo Amino Acid Composition [J].
Chen, Chao ;
Shen, Zhi-Bin ;
Zou, Xiao-Yong .
PROTEIN AND PEPTIDE LETTERS, 2012, 19 (04) :422-429