A method for partitioning the information contained in a protein sequence between its structure and function

被引:5
|
作者
Possenti, Andrea [1 ,2 ,3 ,4 ]
Vendruscolo, Michele [4 ]
Camilloni, Carlo [5 ]
Tiana, Guido [1 ,2 ,3 ]
机构
[1] Univ Milan, Ctr Complex & Biosyst, Via Celoria 16, I-20133 Milan, Italy
[2] Univ Milan, Dept Phys, Via Celoria 16, I-20133 Milan, Italy
[3] INFN, Via Celoria 16, I-20133 Milan, Italy
[4] Univ Cambridge, Dept Chem, Lensfield Rd, Cambridge CB2 1EW, England
[5] Univ Milan, Dipartimento Biosci, Via Celoria 26, I-20133 Milan, Italy
关键词
designed proteins; information content; intrinsically disordered proteins; protein folding/function; structure prediction; TRANSITION-STATE; PREDICTION; RESIDUES; ENTROPY; AGGREGATION; FRUSTRATION; PRINCIPLES; STABILITY; MECHANISM; DATABASE;
D O I
10.1002/prot.25527
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences.
引用
收藏
页码:956 / 964
页数:9
相关论文
共 50 条
  • [41] Sequence-structure-function relationships in the microbial protein universe
    Leman, Julia Koehler
    Szczerbiak, Pawel
    Renfrew, P. Douglas
    Gligorijevic, Vladimir
    Berenberg, Daniel
    Vatanen, Tommi
    Taylor, Bryn C.
    Chandler, Chris
    Janssen, Stefan
    Pataki, Andras
    Carriero, Nick
    Fisk, Ian
    Xavier, Ramnik J.
    Knight, Rob
    Bonneau, Richard
    Kosciolek, Tomasz
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [42] Sequence-structure-function relationships in the microbial protein universe
    Julia Koehler Leman
    Pawel Szczerbiak
    P. Douglas Renfrew
    Vladimir Gligorijevic
    Daniel Berenberg
    Tommi Vatanen
    Bryn C. Taylor
    Chris Chandler
    Stefan Janssen
    Andras Pataki
    Nick Carriero
    Ian Fisk
    Ramnik J. Xavier
    Rob Knight
    Richard Bonneau
    Tomasz Kosciolek
    Nature Communications, 14
  • [43] Protein Sequence Coevolution, Energy Landscapes and their Connections to Protein Structure, Folding and Function
    Onuchic, Jose N.
    Morcos, Faruck
    BIOPHYSICAL JOURNAL, 2018, 114 (03) : 389A - 389A
  • [44] Sequence and structure-based method to predict diacylglycerol lipases in protein sequence
    Ali, Shahid
    Liu, Xiaohui
    Sen, Lin
    Lan, Dongming
    Wang, Jiaqi
    Hassan, Md Imtiyaz
    Wang, Yonghua
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2021, 182 (182) : 455 - 463
  • [45] A method for probabilistic mapping between protein structure and function taxonomies through cross training
    Gupta, Kshitiz
    Sehgal, Vivek
    Levchenko, Andre
    BMC STRUCTURAL BIOLOGY, 2008, 8 : 1 - 12
  • [46] Computational Method in Protein Structure and Function Data
    Lin, Hao
    PROTEIN AND PEPTIDE LETTERS, 2020, 27 (04): : 257 - 258
  • [47] Amalgamation of 3D structure and sequence information for protein–protein interaction prediction
    Kanchan Jha
    Sriparna Saha
    Scientific Reports, 10
  • [48] RNA-Transverse and Longitudinal Protein Sequence Encoding: An Encoding Method for Protein Sequence and Its Application
    Liao, Bo
    Hu, Qingming
    Cai, Lijun
    Chen, Haowen
    Zhu, Wen
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2014, 11 (04) : 1169 - 1173
  • [49] Information-theoretic analysis and prediction of protein atomic burials: on the search for an informational intermediate between sequence and structure
    Rocha, Juliana R.
    van der Linden, Marx G.
    Ferreira, Diogo C.
    Azevedo, Paulo H.
    Pereira de Araujo, Antonio F.
    BIOINFORMATICS, 2012, 28 (21) : 2755 - 2762
  • [50] A Heuristic Method to Bias Protein's Primary Sequence in Protein Structure Prediction
    Mozavani, Nasser
    Parineh, Hossein
    2015 SIGNAL PROCESSING AND INTELLIGENT SYSTEMS CONFERENCE (SPIS), 2015, : 37 - 42