A method for partitioning the information contained in a protein sequence between its structure and function

被引:5
|
作者
Possenti, Andrea [1 ,2 ,3 ,4 ]
Vendruscolo, Michele [4 ]
Camilloni, Carlo [5 ]
Tiana, Guido [1 ,2 ,3 ]
机构
[1] Univ Milan, Ctr Complex & Biosyst, Via Celoria 16, I-20133 Milan, Italy
[2] Univ Milan, Dept Phys, Via Celoria 16, I-20133 Milan, Italy
[3] INFN, Via Celoria 16, I-20133 Milan, Italy
[4] Univ Cambridge, Dept Chem, Lensfield Rd, Cambridge CB2 1EW, England
[5] Univ Milan, Dipartimento Biosci, Via Celoria 26, I-20133 Milan, Italy
关键词
designed proteins; information content; intrinsically disordered proteins; protein folding/function; structure prediction; TRANSITION-STATE; PREDICTION; RESIDUES; ENTROPY; AGGREGATION; FRUSTRATION; PRINCIPLES; STABILITY; MECHANISM; DATABASE;
D O I
10.1002/prot.25527
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences.
引用
收藏
页码:956 / 964
页数:9
相关论文
共 50 条
  • [31] RELATIONS BETWEEN PROTEIN-SEQUENCE AND STRUCTURE AND THEIR SIGNIFICANCE
    ROOMAN, MJ
    RODRIGUEZ, J
    WODAK, SJ
    JOURNAL OF MOLECULAR BIOLOGY, 1990, 213 (02) : 337 - 350
  • [32] Exploring the relationships between protein sequence, structure and solubility
    Trainor, Kyle
    Broom, Aron
    Meiering, Elizabeth M.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 42 : 136 - 146
  • [33] Study on relationship between protein sequence pattern and protein secondary structure
    Li, Minghui
    Lin, Lei
    Wang, Xiaolong
    Dong, Qiwen
    Liu, Tao
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4751 - 4754
  • [34] The characteristics of protein as a function of its detailed structure
    von Przylecki, SJ
    Grynberg, MZ
    BIOCHEMISCHE ZEITSCHRIFT, 1934, 270 : 203 - 218
  • [36] New methods for the prediction of protein structure and function from sequence
    Skolnick, J
    Fetrow, J
    Ortiz, AR
    Kolinski, A
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1999, 217 : U594 - U594
  • [37] Guiding discovery of protein sequence-structure-function modeling
    Hussain, Azam
    Brooks, Charles L., III
    BIOINFORMATICS, 2024, 40 (01)
  • [38] Internal organization of large protein families: Relationship between the sequence, structure, and function-based clustering
    Cai, Xiao-Hui
    Jaroszewski, Lukasz
    Wooley, John
    Godzik, Adam
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (08) : 2389 - 2402
  • [39] Multiple sequence alignments as tools for protein structure and function prediction
    Valencia, A
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (04): : 424 - 427
  • [40] DNA-protein interactions: Integrating structure, sequence, and function
    Bulyk, Martha L.
    Hartemink, Alexander J.
    Fraenkel, Ernest
    Stormo, Gary
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2007, 2007, : 470 - +