A method for partitioning the information contained in a protein sequence between its structure and function

被引:5
|
作者
Possenti, Andrea [1 ,2 ,3 ,4 ]
Vendruscolo, Michele [4 ]
Camilloni, Carlo [5 ]
Tiana, Guido [1 ,2 ,3 ]
机构
[1] Univ Milan, Ctr Complex & Biosyst, Via Celoria 16, I-20133 Milan, Italy
[2] Univ Milan, Dept Phys, Via Celoria 16, I-20133 Milan, Italy
[3] INFN, Via Celoria 16, I-20133 Milan, Italy
[4] Univ Cambridge, Dept Chem, Lensfield Rd, Cambridge CB2 1EW, England
[5] Univ Milan, Dipartimento Biosci, Via Celoria 26, I-20133 Milan, Italy
关键词
designed proteins; information content; intrinsically disordered proteins; protein folding/function; structure prediction; TRANSITION-STATE; PREDICTION; RESIDUES; ENTROPY; AGGREGATION; FRUSTRATION; PRINCIPLES; STABILITY; MECHANISM; DATABASE;
D O I
10.1002/prot.25527
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences.
引用
收藏
页码:956 / 964
页数:9
相关论文
共 50 条
  • [31] Hybrid Protein Model (HPM): a method to compact protein 3D-structure information and physicochemical properties
    de Brevern, AG
    Hazout, SA
    SPIRE 2000: SEVENTH INTERNATIONAL SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL - PROCEEDINGS, 2000, : 49 - 54
  • [32] On the Potential of Machine Learning to Examine the Relationship Between Sequence, Structure, Dynamics and Function of Intrinsically Disordered Proteins
    Lindorff-Larsen, Kresten
    Kragelund, Birthe B.
    JOURNAL OF MOLECULAR BIOLOGY, 2021, 433 (20)
  • [33] Ab initio protein folding simulations using atomic burials as informational intermediates between sequence and structure
    van der Linden, Marx Gomes
    Ferreira, Diogo Cesar
    de Oliveira, Leandro Cristante
    Onuchic, Jose N.
    Pereira de Araujo, Antonio F.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (07) : 1186 - 1199
  • [34] The crystal structure of the Leishmania infantum Silent Information Regulator 2 related protein 1: Implications to protein function and drug design
    Ronin, Celine
    Costa, David Mendes
    Tavares, Joana
    Faria, Joana
    Ciesielski, Fabrice
    Ciapetti, Paola
    Smith, Terry K.
    MacDougall, Jane
    Cordeiro-da-Silva, Anabela
    Pemberton, Lain K.
    PLOS ONE, 2018, 13 (03):
  • [35] DPFunc: accurately predicting protein function via deep learning with domain-guided structure information
    Wang, Wenkang
    Shuai, Yunyan
    Zeng, Min
    Fan, Wei
    Li, Min
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [36] Granular multiple kernel learning for identifying RNA-binding protein residues via integrating sequence and structure information
    Yang, Chao
    Ding, Yijie
    Meng, Qiaozhen
    Tang, Jijun
    Guo, Fei
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17) : 11387 - 11399
  • [37] PPI-Miner: A Structure and Sequence Motif Co-Driven Protein-Protein Interaction Mining and Modeling Computational Method
    Wang, Lin
    Li, Feng-lei
    Ma, Xin-yue
    Cang, Yong
    Bai, Fang
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (23) : 6160 - 6171
  • [38] CovET: A covariation-evolutionary trace method that identifies protein structure-function modules
    Konecki, Daniel M.
    Hamrick, Spencer
    Wang, Chen
    Agosto, Melina A.
    Wensel, Theodore G.
    Lichtarge, Olivier
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2023, 299 (07)
  • [39] SFAPS: an R package for Structure/Function Analysis of Protein Sequences based on Informational Spectrum Method
    Deng, Suping
    Yuan, Jinghua
    Huang, Deshuang
    Wang, Zhen
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [40] SFAPS: An R package for structure/function analysis of protein sequences based on informational spectrum method
    Deng, Su-Ping
    Huang, De-Shuang
    METHODS, 2014, 69 (03) : 207 - 212