A high-throughput predictive method for sequence-similar fold switchers

被引:11
作者
Kim, Allen K. [1 ,2 ]
Looger, Loren L. [3 ]
Porter, Lauren L. [1 ,2 ]
机构
[1] NIH, Natl Lib Med, Bethesda, MD 20894 USA
[2] NHLBI, NIH, Bldg 10, Bethesda, MD 20892 USA
[3] Howard Hughes Med Inst, Janelia Res Campus, Ashburn, VA USA
基金
美国国家卫生研究院;
关键词
bioinformatics; metamorphic proteins; protein fold switching; protein secondary structure prediction; protein structure prediction; PROTEIN SECONDARY STRUCTURE; COMPUTATIONAL DESIGN; CRO; GENERATION;
D O I
10.1002/bip.23416
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although most experimentally characterized proteins with similar sequences assume the same folds and perform similar functions, an increasing number of exceptions is emerging. One class of exceptions comprises sequence-similar fold switchers, whose secondary structures shift from alpha-helix beta-sheet through a small number of mutations, a sequence insertion, or a deletion. Predictive methods for identifying sequence-similar fold switchers are desirable because some are associated with disease and/or can perform different functions in cells. Here, we use homology-based secondary structure predictions to identify sequence-similar fold switchers from their amino acid sequences alone. To do this, we predicted the secondary structures of sequence-similar fold switchers using three different homology-based secondary structure predictors: PSIPRED, JPred4, and SPIDER3. We found that alpha-helix beta-strand prediction discrepancies from JPred4 discriminated between the different conformations of sequence-similar fold switchers with high statistical significance (P < 1.8*10(-19)). Thus, we used these discrepancies as a classifier and found that they can often robustly discriminate between sequence-similar fold switchers and sequence-similar proteins that maintain the same folds (Matthews Correlation Coefficient of 0.82). We found that JPred4 is a more robust predictor of sequence-similar fold switchers because of (a) the curated sequence database it uses to produce multiple sequence alignments and (b) its use of sequence profiles based on Hidden Markov Models. Our results indicate that inconsistencies between JPred4 secondary structure predictions can be used to identify some sequence-similar fold switchers from their sequences alone. Thus, the negative information from inconsistent secondary structure predictions can potentially be leveraged to identify sequence-similar fold switchers from the broad base of genomic sequences.
引用
收藏
页数:13
相关论文
共 58 条
  • [41] Extant fold-switching proteins are widespread
    Porter, Lauren L.
    Looger, Loren L.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (23) : 5968 - 5973
  • [42] HMMER web server: 2018 update
    Potter, Simon C.
    Luciani, Aurelien
    Eddy, Sean R.
    Park, Youngmi
    Lopez, Rodrigo
    Finn, Robert D.
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (W1) : W200 - W204
  • [43] Remmert M, 2012, NAT METHODS, V9, P173, DOI [10.1038/nmeth.1818, 10.1038/NMETH.1818]
  • [44] Transitive homology-guided structural studies lead to discovery of Cro proteins with 40% sequence identity but different folds
    Roessler, Christian G.
    Hall, Branwen M.
    Anderson, William J.
    Ingram, Wendy M.
    Roberts, Sue A.
    Montfort, William R.
    Cordes, Matthew H. J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (07) : 2343 - 2348
  • [45] Twilight zone of protein sequence alignments
    Rost, B
    [J]. PROTEIN ENGINEERING, 1999, 12 (02): : 85 - 94
  • [46] A disease state mutation unfolds the parkin ubiquitin-like domain
    Safadi, Susan S.
    Shaw, Gary S.
    [J]. BIOCHEMISTRY, 2007, 46 (49) : 14162 - 14169
  • [47] Residues Coevolution Guides the Systematic Identification of Alternative Functional Conformations in Proteins
    Sfriso, Pedro
    Duran-Frigola, Miquel
    Mosca, Roberto
    Emperador, Agusti
    Aloy, Patrick
    Orozco, Modesto
    [J]. STRUCTURE, 2016, 24 (01) : 116 - 126
  • [48] Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega
    Sievers, Fabian
    Wilm, Andreas
    Dineen, David
    Gibson, Toby J.
    Karplus, Kevin
    Li, Weizhong
    Lopez, Rodrigo
    McWilliam, Hamish
    Remmert, Michael
    Soeding, Johannes
    Thompson, Julie D.
    Higgins, Desmond G.
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2011, 7
  • [49] Theoretical Insights into the Biophysics of Protein Bi-stability and Evolutionary Switches
    Sikosek, Tobias
    Krobath, Heinrich
    Chan, Hue Sun
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (06)
  • [50] Exploring the sequence fitness landscape of a bridge between protein folds
    Tian, Pengfei
    Best, Robert B.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (10)