Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli

被引:154
作者
Cambray, Guillaume [1 ,2 ]
Guimaraes, Joao C. [1 ,3 ]
Arkin, Adam Paul [3 ,4 ]
机构
[1] Univ Calif Berkeley, Calif Inst Quantitat Biosci, Berkeley, CA 94720 USA
[2] Univ Montpellier, INRA, DGIMI, Montpellier, France
[3] Univ Calif Berkeley, Dept Bioengn, Berkeley, CA 94720 USA
[4] Lawrence Berkeley Natl Lab, Environm Genom & Syst Biol Div, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
MESSENGER-RNA; GENE-EXPRESSION; CODON USAGE; PROTEIN-SYNTHESIS; TRANSCRIPTION; INITIATION; GROWTH; DETERMINANTS; PREFERENCES; EFFICIENCY;
D O I
10.1038/nbt.4238
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Comparative analyses of natural and mutated sequences have been used to probe mechanisms of gene expression, but small sample sizes may produce biased outcomes. We applied an unbiased design-of-experiments approach to disentangle factors suspected to affect translation efficiency in E.coli. We precisely designed 244,000 DNA sequences implementing 56 replicates of a full factorial design to evaluate nucleotide, secondary structure, codon and amino acid properties in combination. For each sequence, we measured reporter transcript abundance and decay, polysome profiles, protein production and growth rates. Associations between designed sequences properties and these consequent phenotypes were dominated by secondary structures and their interactions within transcripts. We confirmed that transcript structure generally limits translation initiation and demonstrated its physiological cost using an epigenetic assay. Codon composition has a sizable impact on translatability, but only in comparatively rare elongation-limited transcripts. We propose a set of design principles to improve translation efficiency that would benefit from more accurate prediction of secondary structures in vivo.
引用
收藏
页码:1005 / +
页数:17
相关论文
共 76 条
  • [1] SCANNING MODEL FOR TRANSLATIONAL REINITIATION IN EUBACTERIA
    ADHIN, MR
    VANDUIN, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 213 (04) : 811 - 818
  • [2] Multifactorial Determinants of Protein Expression in Prokaryotic Open Reading Frames
    Allert, Malin
    Cox, J. Colin
    Hellinga, Homme W.
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2010, 402 (05) : 905 - 918
  • [3] CODON PREFERENCES IN FREE-LIVING MICROORGANISMS
    ANDERSSON, SGE
    KURLAND, CG
    [J]. MICROBIOLOGICAL REVIEWS, 1990, 54 (02) : 198 - 210
  • [4] [Anonymous], FAST AGNOSTIC METHOD
  • [5] [Anonymous], 2017, R LANG ENV STAT COMP
  • [6] Codon influence on protein expression in E. coli correlates with mRNA levels
    Boel, Gregory
    Letso, Reka
    Neely, Helen
    Price, W. Nicholson
    Wong, Kam-Ho
    Su, Min
    Luff, Jon D.
    Valecha, Mayank
    Everett, John K.
    Acton, Thomas B.
    Xiao, Rong
    Montelione, Gaetano T.
    Aalberts, Daniel P.
    Hunt, John F.
    [J]. NATURE, 2016, 529 (7586) : 358 - +
  • [7] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [8] Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites
    Borujeni, Amin Espah
    Channarasappa, Anirudh S.
    Salis, Howard M.
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (04) : 2646 - 2659
  • [9] BULMER M, 1991, GENETICS, V129, P897
  • [10] Measurement and modeling of intrinsic transcription terminators
    Cambray, Guillaume
    Guimaraes, Joao C.
    Mutalik, Vivek K.
    Lam, Colin
    Quynh-Anh Mai
    Thimmaiah, Tim
    Carothers, James M.
    Arkin, Adam P.
    Endy, Drew
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (09) : 5139 - 5148