Integration of Machine Learning Improves The Prediction Accuracy of Molecular Modelling for M. jannaschii Tyrosyl-tRNA Synthetase Substrate Specificity

被引:0
作者
Duan Bing-Ya [1 ]
Sun Ying-Fei [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
tyrosyl-tRNA synthetase; genetic code expansion; enzyme substrate specificity; Rosetta; molecular modelling; machine learning;
D O I
10.16476/j.pibb.2020.0425
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Design of enzyme binding pocket to accommodate substrates with different chemical structure is a great challenge. Traditionally, thousands even millions of mutants have to be screened in wet-lab experiments to find a ligand-specific mutant and large amount of time and resources are consumed. To accelerate the screening process, we propose a novel workflow through integration of molecular modeling and data-driven machine learning method to generate mutant libraries with high enrichment ratio for recognition of specific substrate. We collected all the M. janonschii tyrosyl-tRNA synthetase (Mj. TyrRS) mutants reported in the literature to compare and analyze the sequence and structural feature and difference between mutant and wild type Mj. TyrRS. Mj. TyrRS is used as an example since the sequences and structures of many unnatural amino acid specific Mj. TyrRS mutants have been reported. Based on the crystal structures of different Mj. TyrRS mutants and Rosetta modeling result, we found D158G/P is the critical residue which influences the backbone disruption of helix with residue 158-163. Our results showed that compared with random mutation, Rosetta modeling and score function calculation can elevate the enrichment ratio of desired mutants by 2-fold in a test library having 687 mutants, while after calibration by machine learning model trained using known data of Mj. TyrRS mutants and ligand, the enrichment ratio can be elevated by 11-fold. This molecular modeling and machine learning-integrated workflow is anticipated to significantly benefit to the Mj. tyrRS mutant screening and substantially reduce the time and cost of wet-lab experiments. Besides, this novel process will have broad application in the field of computational protein design.
引用
收藏
页码:1214 / 1232
页数:19
相关论文
共 67 条
[1]  
Chin J W., Expanding and reprogramming the genetic code of cells and animals, Annu Rev Biochem, 83, pp. 379-408, (2014)
[2]  
Yang F, Yu X, Liu C, Et al., Phospho-selective mechanisms of arrestin conformations and functions revealed by unnatural amino acid incorporation and (19)F-NMR, Nat Commun, 6, (2015)
[3]  
Li F, Shi P, Li J, Et al., A genetically encoded 19F NMR probe for tyrosine phosphorylation, Angew Chem Int Ed Engl, 52, 14, pp. 3958-3962, (2013)
[4]  
Yokoyama K, Uhlin U, Stubbe J., Site-specific incorporation of 3-nitrotyrosine as a probe of pk(a) perturbation of redox-active tyrosines in ribonucleotide reductase, J Am Chem Soc, 132, 24, pp. 8385-8397, (2010)
[5]  
Ugwumba I N, Ozawa K, Xu Z Q, Et al., Improving a natural enzyme activity through incorporation of unnatural amino acids, J Am Chem Soc, 133, 2, pp. 326-333, (2011)
[6]  
Drienovska I, Roelfes G., Expanding the enzyme universe with genetically encoded unnatural amino acids, Nat Catal, 3, pp. 193-202, (2020)
[7]  
Liu X H, Kang F Y, Hu C, Et al., A genetically encoded photosensitizerproteinfacilitatestherationaldesignofaminiature photocatalytic CO<sub>2</sub>-reducing enzyme, Nat Chem, 10, 12, pp. 1201-1206, (2018)
[8]  
Drienovska I, Alonso-Cotchico L, Vidossich P, Et al., Design of an enantioselective artificial metallo-hydratase enzyme containing an unnatural metal-binding amino acid, Chem Sci, 8, 10, pp. 7228-7235, (2017)
[9]  
Li Q, Chen Q, Klauser P C, Et al., Developing covalent protein drugs via proximity-enabled reactive therapeutics, Cell, 182, 1, pp. 85-97, (2020)
[10]  
Vyas V K, Ukawala R D, Ghate M, Et al., Homology modeling a fast tool for drug discovery: current perspectives, Indian J Pharm Sci, 74, 1, pp. 1-17, (2012)