A bioinformatic-assisted workflow for genome-wide identification of ncRNAs

被引:2
|
作者
Schmal, Matthias [1 ]
Girod, Crystal [2 ,4 ]
Yaver, Debbie [2 ,5 ]
Mach, Robert L. [3 ]
Mach-Aigner, Astrid R. [1 ,3 ]
机构
[1] TU Wien, Christian Doppler Lab Optimized Express Carbohydr, Inst Chem Environm & Biosci Engn, Gumpendorfer Str 1A, A-1060 Vienna, Austria
[2] Novozymes Inc, Prod Strain Technol, Davis, CA USA
[3] TU Wien, Inst Chem Environm & Biosci Engn, Gumpendorfer Str 1A, A-1060 Vienna, Austria
[4] Beckman Coulter Life Sci, San Jose, CA 95134 USA
[5] Nat Fynd, Chicago, IL USA
关键词
TRICHODERMA-REESEI; RFAM;
D O I
10.1093/nargab/lqac059
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the upcoming of affordable Next-Generation Sequencing technologies, the number of known non-protein coding RNAs increased drastically in recent years. Different types of non-coding RNAs (ncRNAs) emerged as key players in the regulation of gene expression on the RNA-RNA, RNA-DNA as well as RNA-protein level, ranging from involvement in chromatin remodeling and transcription regulation to post-transcriptional modifications. Prediction of ncRNAs involves the use of several bioinformatics tools and can be a daunting task for researchers. This led to the development of analysis pipelines such as UClncR and lncpipe. However, these pipelines are limited to datasets from human, mouse, zebrafish or fruit fly and are not able to analyze RNA sequencing data from other organisms. In this study, we developed the analysis pipeline Pinc (Pipeline for prediction of ncRNA) as an enhanced tool to predict ncRNAs based on sequencing data by removing transcripts that show protein-coding potential. Additionally, a feature for differential expression analysis of annotated genes as well as for identification of novel ncRNAs is implemented. Pinc uses Nextflow as a framework and is built with robust and well-established analysis tools. This will allow researchers to utilize sequencing data from every organism in order to reliably identify ncRNAs.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] A comparative genome-wide study of ncRNAs in trypanosomatids
    Tirza Doniger
    Rodolfo Katz
    Chaim Wachtel
    Shulamit Michaeli
    Ron Unger
    BMC Genomics, 11
  • [2] A comparative genome-wide study of ncRNAs in trypanosomatids
    Doniger, Tirza
    Katz, Rodolfo
    Wachtel, Chaim
    Michaeli, Shulamit
    Unger, Ron
    BMC GENOMICS, 2010, 11
  • [3] FactorY, a bioinformatic resource for genome-wide promoter analysis
    Guruceaga, Elizabeth
    Segura, Victor
    Corrales, Fernando J.
    Rubio, Angel
    COMPUTERS IN BIOLOGY AND MEDICINE, 2009, 39 (04) : 385 - 387
  • [4] Genome-wide target identification
    Charlotte Harrison
    Nature Reviews Drug Discovery, 2014, 13 (2) : 104 - 104
  • [5] Genome-Wide Identification, Transcript Profiling and Bioinformatic Analyses of GRAS Transcription Factor Genes in Rice
    Dutta, Mouboni
    Saha, Anusree
    Moin, Mazahar
    Kirti, Pulugurtha Bharadwaja
    FRONTIERS IN PLANT SCIENCE, 2021, 12
  • [6] Genome-wide bioinformatic and molecular analysis of introns in Saccharomyces cerevisiae
    Spingola, M
    Grate, L
    Haussler, D
    Ares, M
    RNA, 1999, 5 (02) : 221 - 234
  • [7] Genome-wide identification of enhancer elements
    Tulin, Sarah
    Barsi, Julius C.
    Bocconcelli, Carlo
    Smith, Joel
    INTERNATIONAL JOURNAL OF DEVELOPMENTAL BIOLOGY, 2016, 60 (4-6): : 141 - 150
  • [8] Towards genome-wide marker assisted breeding: genome-wide association study and genomic selection
    Iwata, Hiroyashi
    GENES & GENETIC SYSTEMS, 2011, 86 (06) : 393 - 393
  • [9] Genome-wide bioinformatic prediction and experimental evaluation of potential RNA thermometers
    Torsten Waldminghaus
    Lena C. Gaubig
    Franz Narberhaus
    Molecular Genetics and Genomics, 2007, 278 : 555 - 564
  • [10] Genome-wide bioinformatic prediction and experimental evaluation of potential RNA thermometers
    Waldminghaus, Torsten
    Gaubig, Lena C.
    Narberhaus, Franz
    MOLECULAR GENETICS AND GENOMICS, 2007, 278 (05) : 555 - 564