Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures

被引:42
|
作者
Leong, Alyssa Zi-Xin [1 ]
Lee, Pey Yee [1 ]
Mohtar, M. Aiman [1 ]
Syafruddin, Saiful Effendi [1 ]
Pung, Yuh-Fen [2 ]
Low, Teck Yew [1 ]
机构
[1] Univ Kebangsaan Malaysia, UKM Med Mol Biol Inst UMBI, Kuala Lumpur 56000, Malaysia
[2] Univ Nottingham Malaysia, Sch Pharm, Div Biomed Sci, Semenyih 43500, Selangor, Malaysia
关键词
Short open reading frame (sORF); Small open reading frame (smORF); Microproteins; Ribosome profiling (RIBO-Seq); Mass spectrometry; Proteogenomics; RIBOSOME PROFILING REVEALS; MESSENGER-RNA; PROTEIN IDENTIFICATION; FUNCTIONAL ANNOTATION; ENCODED PEPTIDES; UPSTREAM ORFS; IN-VIVO; TRANSLATION; PROTEOMICS; DISCOVERY;
D O I
10.1186/s12929-022-00802-5
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
A short open reading frame (sORFs) constitutes <= 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises <= 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO-Seq), which unveiled sORF-based transcripts at various genome locations. Nonetheless, the existence of corresponding microproteins that are stable and functional was little substantiated by experimental evidence initially. With recent advancements in multi-omics, the identification, validation, and functional characterisation of sORFs and microproteins have become feasible. In this review, we discuss the history and development of an emerging research field of sORFs and microproteins. In particular, we focus on an array of bioinformatics and OMICS approaches used for predicting, sequencing, validating, and characterizing these recently discovered entities. These strategies include RIBO-Seq which detects sORF transcripts via ribosome footprints, and mass spectrometry (MS)-based proteomics for sequencing the resultant microproteins. Subsequently, our discussion extends to the functional characterisation of microproteins by incorporating CRISPR/Cas9 screen and protein-protein interaction (PPI) studies. Our review discusses not only detection methodologies, but we also highlight on the challenges and potential solutions in identifying and validating sORFs and their microproteins. The novelty of this review lies within its validation for the functional role of microproteins, which could contribute towards the future landscape of microproteomics.
引用
收藏
页数:15
相关论文
共 38 条
  • [31] Real-Time Search-Assisted Multiplexed Quantitative Proteomics Reveals System-Wide Translational Regulation of Non-Canonical Short Open Reading Frames
    Kozuka-Hata, Hiroko
    Hiroki, Tomoko
    Miyamura, Naoaki
    Kitamura, Aya
    Tsumoto, Kouhei
    Inoue, Jun-ichiro
    Oyama, Masaaki
    BIOMOLECULES, 2023, 13 (06)
  • [32] In-silico identification of putatively functional intergenic small open reading frames in the cucumber genome and their predicted response to biotic and abiotic stresses
    Ahmad, Esraa M.
    Abdelsamad, Ahmed
    El-Shabrawi, Hattem M.
    El-Awady, Mohamed A. M.
    Aly, Mohammed A. M.
    El-Soda, Mohamed
    PLANT CELL AND ENVIRONMENT, 2024, 47 (12) : 5330 - 5342
  • [33] Identification of novel proteins binding the AU-rich element of a-prothymosin mRNA through the selection of open reading frames (RIDome)
    Patrucco, Laura
    Peano, Clelia
    Chiesa, Andrea
    Guida, Filomena
    Luisi, Imma
    Boria, Ilenia
    Mignone, Flavio
    De Bellis, Gianluca
    Zucchelli, Silvia
    Gustincich, Stefano
    Santoro, Claudio
    Sblattero, Daniele
    Cotella, Diego
    RNA BIOLOGY, 2015, 12 (12) : 1289 - 1300
  • [34] Complementarity of Different SDS-PAGE Gel Staining Methods for the Identification of Short Open Reading Frame-Encoded Peptides
    Kaulich, Philipp T.
    Cassidy, Liam
    Weidenbach, Katrin
    Schmitz, Ruth A.
    Tholey, Andreas
    PROTEOMICS, 2020, 20 (19-20)
  • [35] BAIUCAS: a novel BLAST-based algorithm for the identification of upstream open reading frames with conserved amino acid sequences and its application to the Arabidopsis thaliana genome
    Takahashi, Hiro
    Takahashi, Anna
    Naito, Satoshi
    Onouchi, Hitoshi
    BIOINFORMATICS, 2012, 28 (17) : 2231 - 2241
  • [36] Multi-protease Approach for the Improved Identification and Molecular Characterization of Small Proteins and Short Open Reading Frame-Encoded Peptides
    Kaulich, Philipp T.
    Cassidy, Liam
    Bartel, Juergen
    Schmitz, Ruth A.
    Tholey, Andreas
    JOURNAL OF PROTEOME RESEARCH, 2021, 20 (05) : 2895 - 2903
  • [37] Depletion of High-Molecular-Mass Proteins for the Identification of Small Proteins and Short Open Reading Frame Encoded Peptides in Cellular Proteomes
    Cassidy, Liam
    Kaulich, Philipp T.
    Tholey, Andreas
    JOURNAL OF PROTEOME RESEARCH, 2019, 18 (04) : 1725 - 1734
  • [38] Bottom-up and top-down proteomic approaches for the identification, characterization, and quantification of the low molecular weight proteome with focus on short open reading frame-encoded peptides
    Cassidy, Liam
    Kaulich, Philipp T.
    Maass, Sandra
    Bartel, Juergen
    Becher, Doerte
    Tholey, Andreas
    PROTEOMICS, 2021, 21 (23-24)