Graphormer supervised de novo protein design method and function validation

被引:2
|
作者
Mu, Junxi [1 ]
Li, Zhengxin [2 ]
Zhang, Bo [2 ]
Zhang, Qi [3 ]
Iqbal, Jamshed [4 ]
Wadood, Abdul [5 ]
Wei, Ting [2 ,7 ]
Feng, Yan [6 ,7 ]
Chen, Hai-Feng [2 ,7 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Bioinformat, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Microbiol, Shanghai, Peoples R China
[4] COMSATS Univ Islamabad, Chem, Islamabad, Pakistan
[5] Abdul Wali Khan Univ Mardan, Bioinformat, Mardan, Pakistan
[6] Shanghai Jiao Tong Univ, Biochem, Shanghai, Peoples R China
[7] Shanghai Jiao Tong Univ, Natl Expt Teaching Ctr Life Sci & Biotechnol, Sch Life Sci & Biotechnol, Dept Bioinformat & Biostat,State Key Lab Microbial, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
protein sequence design; Graphormer architecture; GPD model; function validation; SEQUENCE DESIGN; MODEL;
D O I
10.1093/bib/bbae135
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein design is central to nearly all protein engineering problems, as it can enable the creation of proteins with new biological functions, such as improving the catalytic efficiency of enzymes. One key facet of protein design, fixed-backbone protein sequence design, seeks to design new sequences that will conform to a prescribed protein backbone structure. Nonetheless, existing sequence design methods present limitations, such as low sequence diversity and shortcomings in experimental validation of the designed functional proteins. These inadequacies obstruct the goal of functional protein design. To improve these limitations, we initially developed the Graphormer-based Protein Design (GPD) model. This model utilizes the Transformer on a graph-based representation of three-dimensional protein structures and incorporates Gaussian noise and a sequence random masks to node features, thereby enhancing sequence recovery and diversity. The performance of the GPD model was significantly better than that of the state-of-the-art ProteinMPNN model on multiple independent tests, especially for sequence diversity. We employed GPD to design CalB hydrolase and generated nine artificially designed CalB proteins. The results show a 1.7-fold increase in catalytic activity compared to that of the wild-type CalB and strong substrate selectivity on p-nitrophenyl acetate with different carbon chain lengths (C2-C16). Thus, the GPD method could be used for the de novo design of industrial enzymes and protein drugs. The code was released at https://github.com/decodermu/GPD.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Sparks of function by de novo protein design
    Chu, Alexander E.
    Lu, Tianyu
    Huang, Po-Ssu
    NATURE BIOTECHNOLOGY, 2024, 42 (02) : 203 - 215
  • [2] Sparks of function by de novo protein design
    Alexander E. Chu
    Tianyu Lu
    Po-Ssu Huang
    Nature Biotechnology, 2024, 42 : 203 - 215
  • [3] De novo design of protein structure and function with RFdiffusion
    Joseph L. Watson
    David Juergens
    Nathaniel R. Bennett
    Brian L. Trippe
    Jason Yim
    Helen E. Eisenach
    Woody Ahern
    Andrew J. Borst
    Robert J. Ragotte
    Lukas F. Milles
    Basile I. M. Wicky
    Nikita Hanikel
    Samuel J. Pellock
    Alexis Courbet
    William Sheffler
    Jue Wang
    Preetham Venkatesh
    Isaac Sappington
    Susana Vázquez Torres
    Anna Lauko
    Valentin De Bortoli
    Emile Mathieu
    Sergey Ovchinnikov
    Regina Barzilay
    Tommi S. Jaakkola
    Frank DiMaio
    Minkyung Baek
    David Baker
    Nature, 2023, 620 : 1089 - 1100
  • [4] De novo design of protein structure and function with RFdiffusion
    Watson, Joseph L.
    Juergens, David
    Bennett, Nathaniel R.
    Trippe, Brian L.
    Yim, Jason
    Eisenach, Helen E.
    Ahern, Woody
    Borst, Andrew J.
    Ragotte, Robert J.
    Milles, Lukas F.
    Wicky, Basile I. M.
    Hanikel, Nikita
    Pellock, Samuel J.
    Courbet, Alexis
    Sheffler, William
    Wang, Jue
    Venkatesh, Preetham
    Sappington, Isaac
    Torres, Susana Vazquez
    Lauko, Anna
    De Bortoli, Valentin
    Mathieu, Emile
    Ovchinnikov, Sergey
    Barzilay, Regina
    Jaakkola, Tommi S.
    Dimaio, Frank
    Baek, Minkyung
    Baker, David
    NATURE, 2023, 620 (7976) : 1089 - 1100
  • [5] De novo protein design
    O'Driscoll, Cath
    CHEMISTRY & INDUSTRY, 2020, 84 (03) : 8 - 8
  • [6] De novo protein design
    Degrado, William
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [7] De novo protein design
    Degrado, William
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [8] De novo protein design
    Koehl, P
    Levitt, M
    DYNAMICS, STRUCTURE AND FUNCTION OF BIOLOGICAL MACROMOLECULES, 2001, 315 : 57 - 75
  • [9] Protein de novo design
    Tuchscherer, G
    Dumy, P
    Mutter, M
    CHIMIA, 1996, 50 (12) : 644 - 648
  • [10] Validation of the SPROUT de novo design program
    Law, JMS
    Fung, DYK
    Zsoldos, Z
    Simon, A
    Szabo, Z
    Csizmadia, IG
    Johnson, AP
    JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2003, 666 : 651 - 657