Evidence for widespread translation of 5′ untranslated regions

被引:0
作者
Rodriguez, Jose Manuel [1 ,2 ]
Abascal, Federico [3 ]
Cerdan-Velez, Daniel [4 ]
Gomez, Laura Martinez [4 ]
Vazquez, Jesus [1 ,2 ]
Tress, Michael L. [4 ]
机构
[1] Ctr Nacl Invest Cardiovasc Carlos III CN, Cardiovasc Prote Lab, Madrid 28029, Spain
[2] CIBER Enfermedades Cardiovasc CIBERCV, Madrid 28029, Spain
[3] Wellcome Sanger Inst, Somat Evolut Grp, Wellcome Genome Campus, Hinxton CB10 1SA, England
[4] Spanish Natl Canc Res Ctr CNIO, Bioinformat Unit, 28029 AC, Madrid, Spain
基金
美国国家卫生研究院;
关键词
INITIATION; SEQUENCE; DATABASE; IDENTIFICATION; ANNOTATION; PROTEINS; PROTEOME;
D O I
10.1093/nar/gkae571
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ribosome profiling experiments support the translation of a range of novel human open reading frames. By contrast, most peptides from large-scale proteomics experiments derive from just one source, 5 ' untranslated regions. Across the human genome we find evidence for 192 translated upstream regions, most of which would produce protein isoforms with extended N-terminal ends. Almost all of these N-terminal extensions are from highly abundant genes, which suggests that the novel regions we detect are just the tip of the iceberg. These upstream regions have characteristics that are not typical of coding exons. Their GC-content is remarkably high, even higher than 5 ' regions in other genes, and a large majority have non-canonical start codons. Although some novel upstream regions have cross-species conservation - five have orthologues in invertebrates for example - the reading frames of two thirds are not conserved beyond simians. These non-conserved regions also have no evidence of purifying selection, which suggests that much of this translation is not functional. In addition, non-conserved upstream regions have significantly more peptides in cancer cell lines than would be expected, a strong indication that an aberrant or noisy translation initiation process may play an important role in translation from upstream regions. Graphical Abstract
引用
收藏
页码:8112 / 8126
页数:15
相关论文
共 83 条
  • [1] Loose ends: almost one in five human genes still have unresolved coding status
    Abascal, Federico
    Juan, David
    Jungreis, Irwin
    Martinez, Laura
    Rigau, Maria
    Manuel Rodriguez, Jose
    Vazquez, Jesus
    Tress, Michael L.
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (14) : 7070 - 7084
  • [2] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [3] Progressive Cactus is a multiple-genome aligner for the thousand-genome era
    Armstrong, Joel
    Hickey, Glenn
    Diekhans, Mark
    Fiddes, Ian T.
    Novak, Adam M.
    Deran, Alden
    Fang, Qi
    Xie, Duo
    Feng, Shaohong
    Stiller, Josefin
    Genereux, Diane
    Johnson, Jeremy
    Marinescu, Voichita Dana
    Alfoldi, Jessica
    Harris, Robert S.
    Lindblad-Toh, Kerstin
    Haussler, David
    Karlsson, Elinor
    Jarvis, Erich D.
    Zhang, Guojie
    Paten, Benedict
    [J]. NATURE, 2020, 587 (7833) : 246 - +
  • [4] UniProt: the Universal Protein Knowledgebase in 2023
    Bateman, Alex
    Martin, Maria-Jesus
    Orchard, Sandra
    Magrane, Michele
    Ahmad, Shadab
    Alpi, Emanuele
    Bowler-Barnett, Emily H.
    Britto, Ramona
    Cukura, Austra
    Denny, Paul
    Dogan, Tunca
    Ebenezer, ThankGod
    Fan, Jun
    Garmiri, Penelope
    Gonzales, Leonardo Jose da Costa
    Hatton-Ellis, Emma
    Hussein, Abdulrahman
    Ignatchenko, Alexandr
    Insana, Giuseppe
    Ishtiaq, Rizwan
    Joshi, Vishal
    Jyothi, Dushyanth
    Kandasaamy, Swaathi
    Lock, Antonia
    Luciani, Aurelien
    Lugaric, Marija
    Luo, Jie
    Lussi, Yvonne
    MacDougall, Alistair
    Madeira, Fabio
    Mahmoudy, Mahdi
    Mishra, Alok
    Moulang, Katie
    Nightingale, Andrew
    Pundir, Sangya
    Qi, Guoying
    Raj, Shriya
    Raposo, Pedro
    Rice, Daniel L.
    Saidi, Rabie
    Santos, Rafael
    Speretta, Elena
    Stephenson, James
    Totoo, Prabhat
    Turner, Edward
    Tyagi, Nidhi
    Vasudev, Preethi
    Warner, Kate
    Watkins, Xavier
    Zellner, Hermann
    [J]. NUCLEIC ACIDS RESEARCH, 2023, 51 (D1) : D523 - D531
  • [5] An Optimized Shotgun Strategy for the Rapid Generation of Comprehensive Human Proteomes
    Bekker-Jensen, Dorte B.
    Kelstrup, Christian D.
    Batth, Tanveer S.
    Larsen, Sara C.
    Haldrup, Christa
    Bramsen, Jesper B.
    Sorensen, Karina D.
    Hoyer, Soren
    Orntoft, Torben F.
    Andersen, Claus L.
    Nielsen, Michael L.
    Olsen, Jesper V.
    [J]. CELL SYSTEMS, 2017, 4 (06) : 587 - +
  • [6] Burley SK, 2017, METHODS MOL BIOL, V1606, P627, DOI 10.1007/978-1-4939-7000-1_26
  • [7] A multiregional proteomic survey of the postnatal human brain
    Carlyle, Becky C.
    Kitchen, Robert R.
    Kanyo, Jean E.
    Voss, Edward Z.
    Pletikos, Mihovil
    Sousa, Andre M. M.
    Lam, TuKiet T.
    Gerstein, Mark B.
    Sestan, Nenad
    Nairn, Angus C.
    [J]. NATURE NEUROSCIENCE, 2017, 20 (12) : 1787 - +
  • [8] Cerdn-Vlez D., 2024, Bioinform. Adv, V4, pvba
  • [9] Oncogene-dependent sloppiness in mRNA translation
    Champagne, Julien
    Pataskar, Abhijeet
    Blommaert, Naomi
    Nagel, Remco
    Wernaart, Demi
    Ramalho, Sofia
    Kenski, Juliana
    Bleijerveld, Onno B.
    Zaal, Esther A.
    Berkers, Celia R.
    Altelaar, Maarten
    Peeper, Daniel S.
    Faller, William J.
    Agami, Reuven
    [J]. MOLECULAR CELL, 2021, 81 (22) : 4709 - +
  • [10] Pervasive functional translation of noncanonical human open reading frames
    Chen, Jin
    Brunner, Andreas-David
    Cogan, J. Zachery
    Nunez, James K.
    Fields, Alexander P.
    Adamson, Britt
    Itzhak, Daniel N.
    Li, Jason Y.
    Mann, Matthias
    Leonetti, Manuel D.
    Weissman, Jonathan S.
    [J]. SCIENCE, 2020, 367 (6482) : 1140 - +