A method for partitioning the information contained in a protein sequence between its structure and function

被引:5
|
作者
Possenti, Andrea [1 ,2 ,3 ,4 ]
Vendruscolo, Michele [4 ]
Camilloni, Carlo [5 ]
Tiana, Guido [1 ,2 ,3 ]
机构
[1] Univ Milan, Ctr Complex & Biosyst, Via Celoria 16, I-20133 Milan, Italy
[2] Univ Milan, Dept Phys, Via Celoria 16, I-20133 Milan, Italy
[3] INFN, Via Celoria 16, I-20133 Milan, Italy
[4] Univ Cambridge, Dept Chem, Lensfield Rd, Cambridge CB2 1EW, England
[5] Univ Milan, Dipartimento Biosci, Via Celoria 26, I-20133 Milan, Italy
关键词
designed proteins; information content; intrinsically disordered proteins; protein folding/function; structure prediction; TRANSITION-STATE; PREDICTION; RESIDUES; ENTROPY; AGGREGATION; FRUSTRATION; PRINCIPLES; STABILITY; MECHANISM; DATABASE;
D O I
10.1002/prot.25527
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences.
引用
收藏
页码:956 / 964
页数:9
相关论文
共 50 条
  • [1] A homology identification method that combines protein sequence and structure information
    Yu, LH
    White, JV
    Smith, TF
    PROTEIN SCIENCE, 1998, 7 (12) : 2499 - 2510
  • [2] AlphaFold 2: Why It Works and Its Implications for Understanding the Relationships of Protein Sequence, Structure, and Function
    Skolnick, Jeffrey
    Gao, Mu
    Zhou, Hongyi
    Singh, Suresh
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (10) : 4827 - 4831
  • [3] COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information
    Zhang, Chengxin
    Freddolino, Peter L.
    Zhang, Yang
    NUCLEIC ACIDS RESEARCH, 2017, 45 (W1) : W291 - W299
  • [4] An Improved Protein Structural Classes Prediction Method by Incorporating Both Sequence and Structure Information
    Wei, Leyi
    Liao, Minghong
    Gao, Xing
    Zou, Quan
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2015, 14 (04) : 339 - 349
  • [5] Analysis of protein function and its prediction from amino acid sequence
    Clark, Wyatt T.
    Radivojac, Predrag
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (07) : 2086 - 2096
  • [6] Exploring the relationships between protein sequence, structure and solubility
    Trainor, Kyle
    Broom, Aron
    Meiering, Elizabeth M.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 42 : 136 - 146
  • [7] Multiple sequence alignments as tools for protein structure and function prediction
    Valencia, A
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (04): : 424 - 427
  • [8] Internal organization of large protein families: Relationship between the sequence, structure, and function-based clustering
    Cai, Xiao-Hui
    Jaroszewski, Lukasz
    Wooley, John
    Godzik, Adam
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (08) : 2389 - 2402
  • [9] Guiding discovery of protein sequence-structure-function modeling
    Hussain, Azam
    Brooks, Charles L., III
    BIOINFORMATICS, 2024, 40 (01)
  • [10] A New Hidden Markov Model for Protein Quality Assessment Using Compatibility Between Protein Sequence and Structure
    He, Zhiquan
    Ma, Wenji
    Zhang, Jingfen
    Xu, Dong
    TSINGHUA SCIENCE AND TECHNOLOGY, 2014, 19 (06) : 559 - 567