Improving succinylation prediction accuracy by incorporating the secondary structure via helix, strand and coil, and evolutionary information from profile bigrams

被引:32
作者
Dehzangi, Abdollah [1 ]
Lopez, Yosvany [2 ,3 ]
Lal, Sunil Pranit [4 ]
Taherzadeh, Ghazaleh [5 ]
Sattar, Abdul [5 ,6 ]
Tsunoda, Tatsuhiko [2 ,3 ,7 ]
Sharma, Alok [3 ,6 ,8 ]
机构
[1] Morgan State Univ, Dept Comp Sci, Baltimore, MD 21239 USA
[2] Tokyo Med & Dent Univ, Med Res Inst, Dept Med Sci Math, Tokyo, Japan
[3] RIKEN, Ctr Integrat Med Sci, Lab Med Sci Math, Yokohama, Kanagawa, Japan
[4] Massey Univ, Sch Engn & Adv Technol, Palmerston North, New Zealand
[5] Griffith Univ, Sch Informat & Commun Technol, Nathan, Qld, Australia
[6] Griffith Univ, Inst Integrated & Intelligent Syst, Nathan, Qld, Australia
[7] JST, CREST, Tokyo, Japan
[8] Univ South Pacific, Sch Phys & Engn, Suva, Fiji
关键词
ACCESSIBLE SURFACE-AREA; AMINO-ACID-COMPOSITION; LYSINE SUCCINYLATION; SUBCELLULAR-LOCALIZATION; SCORING MATRIX; POSTTRANSLATIONAL MODIFICATION; PSEUDO COMPONENTS; WEB SERVER; K-TUPLE; SITES;
D O I
10.1371/journal.pone.0191900
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Post-translational modification refers to the biological mechanism involved in the enzymatic modification of proteins after being translated in the ribosome. This mechanism comprises a wide range of structural modifications, which bring dramatic variations to the biological function of proteins. One of the recently discovered modifications is succinylation. Although succinylation can be detected through mass spectrometry, its current experimental detection turns out to be a timely process unable to meet the exponential growth of sequenced proteins. Therefore, the implementation of fast and accurate computational methods has emerged as a feasible solution. This paper proposes a novel classification approach, which effectively incorporates the secondary structure and evolutionary information of proteins through profile bigrams for succinylation prediction. The proposed predictor, abbreviated as SSEvol-Suc, made use of the above features for training an AdaBoost classifier and consequently predicting succinylated lysine residues. When SSEvol-Suc was compared with four benchmark predictors, it outperformed them in metrics such as sensitivity (0.909), accuracy (0.875) and Matthews correlation coefficient (0.75).
引用
收藏
页数:16
相关论文
共 87 条
[11]   pLoc-mEuk: Predict subcellular localization of multi-label eukaryotic proteins by extracting the key GO information into general PseAAC [J].
Cheng, Xiang ;
Xiao, Xuan ;
Chou, Kuo-Chen .
GENOMICS, 2018, 110 (01) :50-58
[12]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255
[13]   Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms [J].
Chou, Kuo-Chen ;
Shen, Hong-Bin .
NATURE PROTOCOLS, 2008, 3 (02) :153-162
[14]   An Unprecedented Revolution in Medicinal Chemistry Driven by the Progress of Biological Science [J].
Chou, Kuo-Chen .
CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2017, 17 (21) :2337-2358
[15]   Impacts of Bioinformatics to Medicinal Chemistry [J].
Chou, Kuo-Chen .
MEDICINAL CHEMISTRY, 2015, 11 (03) :218-234
[16]   Some remarks on protein attribute prediction and pseudo amino acid composition [J].
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) :236-247
[17]  
Chou Kuo-Chen., 2009, Natural Science, V1, P63
[18]  
COMB DG, 1966, J BIOL CHEM, V241, P1857
[19]  
Dehzangi Abdollah, 2013, Pattern Recognition in Bioinformatics. 8th IAPR International Conference, PRIB 2013. Proceedings: LNCS 7986, P208, DOI 10.1007/978-3-642-39159-0_19
[20]  
Dehzangi Abdollah, 2013, Pattern Recognition in Bioinformatics. 8th IAPR International Conference, PRIB 2013. Proceedings: LNCS 7986, P196, DOI 10.1007/978-3-642-39159-0_18