NLScore: a novel quantitative algorithm based on 3 dimensional structural determinants to predict the probability of nuclear localization in proteins containing classical nuclear localization signals

被引:0
作者
P. S. Hari
T. S. Sridhar
R. Pravin Kumar
机构
[1] St. John’s Research Institute,
来源
Journal of Molecular Modeling | 2017年 / 23卷
关键词
Sub-cellular location; Tertiary structure; In-silico model; Multi-parameter; Linear-regression;
D O I
暂无
中图分类号
学科分类号
摘要
The presence of a nuclear localization signal (NLS) in proteins can be inferred by the presence of a stretch of basic amino acids (KRKK). These NLSs are termed classical NLS (cNLS). However, only a fraction of proteins containing the cNLS pattern are transported into the nucleus by binding to importin α. Hence, there must exist, additional structural determinants that guide the appropriate interaction between putative NLSs containing cargo and importin α. Using 52 protein structures containing cNLS obtained from RCSB PDB, we assembled a training set and a validation set such that both sets were comprised of a combination of proteins with proven nuclear localization and ones that were non-nuclear. We modeled the interface between cargoes containing cNLS and importin α. We conducted rigid body docking and produced induced-fit modes by allowing both side chain and the backbone to be flexible. The output of these studies and additional determinants such as energy of interaction, atomic contacts, hydrophilic interaction, cationic interaction, and penetration of the cargo protein were used to derive a 26 parameter quantitative structure activity relationship based regression equation. This was further optimized by a step-wise backward elimination approach to derive a 15 parameter score. This NLScore was not only able to correctly classify confirmed nuclear and non-nuclear localized proteins but it was able to perform better than currently implemented algorithms like NucPred, Euk-mPLoc 2.0, cNls Mapper, and NLStradamus. Leave-one-out cross validation (LOOCV) showed that NLScore correctly predicted 78.6% and 81.6% of non-nuclear and nuclear proteins respectively.
引用
收藏
相关论文
共 120 条
  • [1] Kalderon D(1984)A short amino acid sequence able to specify nuclear location Cell 39 499-509
  • [2] Roberts BL(1986)Synthetic peptides as nuclear localization signals Nature 322 641-644
  • [3] Richardson WD(2000)Nuclear targeting of proteins: how many different signals? Cell Signal. 12 337-341
  • [4] Smith AE(1998)Crystallographic analysis of the recognition of a nuclear localization signal by the nuclear import factor karyopherin alpha Cell 94 193-204
  • [5] Goldfarb DS(2008)Functional and structural basis of the nuclear localization signal in the ZIC3 zinc finger domain Hum Mol Genet 17 3459-3473
  • [6] Gariepy J(2007)Classical nuclear localization signals: definition, function, and interaction with importin alpha J Biol Chem 282 5101-5105
  • [7] Schoolnik G(2003)Importin alpha nuclear localization signal binding sites for STAT1, STAT2, and influenza a virus nucleoprotein J Biol Chem 278 28193-28200
  • [8] Kornberg RD(2000)Quantitative analysis of nuclear localization signal (NLS)-importin alpha interaction through fluorescence depolarization. Evidence for auto-inhibitory regulation of NLS binding J Biol Chem 275 21218-21223
  • [9] Christophe D(2009)Six classes of nuclear localization signals specific to different binding grooves of importin alpha J Biol Chem 284 478-485
  • [10] Christophe-Hobertus C(2006)The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling Bioinformatics (Oxf) 22 195-201