A sequence-based method for predicting extant fold switchers that undergo α-helix ⇆ β-strand transitions

被引:10
|
作者
Mishra, Soumya [1 ,3 ]
Looger, Loren L. [3 ]
Porter, Lauren L. [1 ,2 ]
机构
[1] NLM, NIH, Bethesda, MD 20894 USA
[2] NHLBI, NIH, Bldg 10, Bethesda, MD 20892 USA
[3] Janelia Res Campus, Howard Hughes Med Inst, Ashburn, VA USA
基金
美国国家卫生研究院;
关键词
fold-switching proteins; metamorphic proteins; protein folding; bioinformatics; PROTEIN SECONDARY STRUCTURE; CONFORMATIONAL-CHANGES; COMPUTATIONAL DESIGN; CRYSTAL-STRUCTURE; OSCILLATOR; OVALBUMIN; DATABASE; NMR;
D O I
10.1002/bip.23471
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Extant fold-switching proteins remodel their secondary structures and change their functions in response to cellular stimuli, regulating biological processes and affecting human health. Despite their biological importance, these proteins remain understudied. Predictive methods are needed to expedite the process of discovering and characterizing more of these shapeshifting proteins. Most previous approaches require a solved structure or all-atom simulations, greatly constraining their use. Here, we propose a high-throughput sequence-based method for predicting extant fold switchers that transition from alpha-helix in one conformation to beta-strand in the other. This method leverages two previous observations: (a) alpha-helix <-> beta-strand prediction discrepancies from JPred4 are a robust predictor of fold switching, and (b) the fold-switching regions (FSRs) of some extant fold switchers have different secondary structure propensities when expressed by themselves (isolated FSRs) than when expressed within the context of their parent protein (contextualized FSRs). Combining these two observations, we ran JPred4 on 99-fold-switching proteins and found strong correspondence between predicted and experimentally observed alpha-helix <-> beta-strand discrepancies. To test the overall robustness of this finding, we randomly selected regions of proteins not expected to switch folds (single-fold proteins) and found significantly fewer predicted alpha-helix <-> beta-strand discrepancies. Combining these discrepancies with the overall percentage of predicted secondary structure, we developed a classifier to identify extant fold switchers (Matthews correlation coefficient of .71). Although this classifier had a high false-negative rate (7/17), its false-positive rate was very low (2/136), suggesting that it can be used to predict a subset of extant fold switchers from a multitude of available genomic sequences.
引用
收藏
页数:10
相关论文
共 21 条
  • [1] A high-throughput predictive method for sequence-similar fold switchers
    Kim, Allen K.
    Looger, Loren L.
    Porter, Lauren L.
    BIOPOLYMERS, 2021, 112 (10)
  • [2] Predicting the beta-helix fold from protein sequence data
    Cowen, L
    Bradley, P
    Menke, M
    King, J
    Berger, B
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) : 261 - 276
  • [3] A Sequence-based Approach for Predicting Protein Disordered Regions
    Huang, Tao
    He, Zhi-Song
    Cui, Wei-Ren
    Cai, Yu-Dong
    Shi, Xiao-He
    Hu, Le-Le
    Chou, Kuo-Chen
    PROTEIN AND PEPTIDE LETTERS, 2013, 20 (03) : 243 - 248
  • [4] Sequence-Based Predicting Bacterial Essential ncRNAs Algorithm by Machine Learning
    Ye, Yuan-Nong
    Liang, Ding-Fa
    Labena, Abraham Alemayehu
    Zeng, Zhu
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03) : 2731 - 2741
  • [5] A Novel Sequence-Based Method of Predicting Protein DNA-Binding Residues, Using a Machine Learning Approach
    Cai, Yudong
    He, ZhiSong
    Shi, Xiaohe
    Kong, Xiangying
    Gu, Lei
    Xie, Lu
    MOLECULES AND CELLS, 2010, 30 (02) : 99 - 105
  • [6] Sequence-based prediction of protein interaction sites with an integrative method
    Chen, Xue-Wen
    Jeong, Jong Cheol
    BIOINFORMATICS, 2009, 25 (05) : 585 - 591
  • [7] Analyses of the folding sites of irregular β-trefoil fold proteins through sequence-based techniques and Go-model simulations
    Kimura, Risako
    Aumpuchin, Panyavut
    Hamaue, Shoya
    Shimomura, Takumi
    Kikuchi, Takeshi
    BMC MOLECULAR AND CELL BIOLOGY, 2020, 21 (01)
  • [8] OOgenesis_Pred: A sequence-based method for predicting oogenesis proteins by six different modes of Chou's pseudo amino acid composition
    Rahimi, Maryam
    Bakhtiarizadeh, Mohammad Reza
    Mohammadi-Sangcheshmeh, Abdollah
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 414 : 128 - 136
  • [9] The s2D Method: Simultaneous Sequence-Based Prediction of the Statistical Populations of Ordered and Disordered Regions in Proteins
    Sormanni, Pietro
    Camilloni, Carlo
    Fariselli, Piero
    Vendruscolo, Michele
    JOURNAL OF MOLECULAR BIOLOGY, 2015, 427 (04) : 982 - 996
  • [10] GoFDR: A sequence alignment based method for predicting protein functions
    Gong, Qingtian
    Ning, Wei
    Tian, Weidong
    METHODS, 2016, 93 : 3 - 14