De novo assembly of transcriptomes and differential gene expression analysis using short-read data from emerging model organisms - a brief guide

被引:1
|
作者
Jackson, Daniel J. [1 ]
Cerveau, Nicolas [1 ]
Posnien, Nico [2 ]
机构
[1] Univ Gottingen, Dept Geobiol, Goldschmidtstr 3, D-37077 Gottingen, Germany
[2] Univ Gottingen, GZMB, Dept Dev Biochem, Justus Von Liebig Weg 11, D-37077 Gottingen, Germany
来源
FRONTIERS IN ZOOLOGY | 2024年 / 21卷 / 01期
关键词
Transcriptome assembly; De novo assembly; RNA-seq; Short reads; Emerging model system; Genome; Annotation; Differential gene expression; RNA-SEQ; QUALITY ASSESSMENT; GENOME; TOOL; QUANTIFICATION; IDENTIFICATION; SELECTION; ALIGNMENT; SAMPLES; MISUSE;
D O I
10.1186/s12983-024-00538-y
中图分类号
Q95 [动物学];
学科分类号
071002 ;
摘要
Many questions in biology benefit greatly from the use of a variety of model systems. High-throughput sequencing methods have been a triumph in the democratization of diverse model systems. They allow for the economical sequencing of an entire genome or transcriptome of interest, and with technical variations can even provide insight into genome organization and the expression and regulation of genes. The analysis and biological interpretation of such large datasets can present significant challenges that depend on the 'scientific status' of the model system. While high-quality genome and transcriptome references are readily available for well-established model systems, the establishment of such references for an emerging model system often requires extensive resources such as finances, expertise and computation capabilities. The de novo assembly of a transcriptome represents an excellent entry point for genetic and molecular studies in emerging model systems as it can efficiently assess gene content while also serving as a reference for differential gene expression studies. However, the process of de novo transcriptome assembly is non-trivial, and as a rule must be empirically optimized for every dataset. For the researcher working with an emerging model system, and with little to no experience with assembling and quantifying short-read data from the Illumina platform, these processes can be daunting. In this guide we outline the major challenges faced when establishing a reference transcriptome de novo and we provide advice on how to approach such an endeavor. We describe the major experimental and bioinformatic steps, provide some broad recommendations and cautions for the newcomer to de novo transcriptome assembly and differential gene expression analyses. Moreover, we provide an initial selection of tools that can assist in the journey from raw short-read data to assembled transcriptome and lists of differentially expressed genes.
引用
收藏
页数:18
相关论文
共 34 条
  • [1] Optimizing de novo assembly of short-read RNA-seq data for phylogenomics
    Yang, Ya
    Smith, Stephen A.
    BMC GENOMICS, 2013, 14
  • [2] Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study
    Zhao, Qiong-Yi
    Wang, Yi
    Kong, Yi-Meng
    Luo, Da
    Li, Xuan
    Hao, Pei
    BMC BIOINFORMATICS, 2011, 12 : S2
  • [3] Optimizing de novo assembly of short-read RNA-seq data for phylogenomics
    Ya Yang
    Stephen A Smith
    BMC Genomics, 14
  • [4] Analysis of gene expression for microminipig liver transcriptomes using parallel long-read technology and short-read sequencing
    Sakai, Chizuka
    Iwano, Shunsuke
    Shimizu, Makiko
    Onodera, Jun
    Uchida, Masashi
    Sakurada, Eri
    Yamazaki, Yuri
    Asaoka, Yoshiji
    Imura, Naoko
    Uno, Yasuhiro
    Murayama, Norie
    Hayashi, Ryoji
    Yamazaki, Hiroshi
    Miyamoto, Yohei
    BIOPHARMACEUTICS & DRUG DISPOSITION, 2016, 37 (04) : 220 - 232
  • [5] Qualitative De Novo Analysis of Full Length cDNA and Quantitative Analysis of Gene Expression for Common Marmoset (Callithrix jacchus) Transcriptomes Using Parallel Long-Read Technology and Short-Read Sequencing
    Shimizu, Makiko
    Iwano, Shunsuke
    Uno, Yasuhiro
    Uehara, Shotaro
    Inoue, Takashi
    Murayama, Norie
    Onodera, Jun
    Sasaki, Erika
    Yamazaki, Hiroshi
    PLOS ONE, 2014, 9 (06):
  • [6] Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
    Duan, Jialei
    Xia, Chuan
    Zhao, Guangyao
    Jia, Jizeng
    Kong, Xiuying
    BMC GENOMICS, 2012, 13
  • [7] Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study
    Qiong-Yi Zhao
    Yi Wang
    Yi-Meng Kong
    Da Luo
    Xuan Li
    Pei Hao
    BMC Bioinformatics, 12
  • [8] De novo transcriptome assembly and analysis of differential gene expression following peptidoglycan (PGN) challenge in Antheraea pernyi
    Liu, Yu
    Xin, Zhao-Zhe
    Zhang, Dai-Zhen
    Zhu, Xiao-Yu
    Wang, Ying
    Chen, Li
    Tang, Bo-Ping
    Zhou, Chun-Lin
    Chai, Xin-Yue
    Tian, Ji-Wu
    Liu, Qiu-Ning
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2018, 112 : 1199 - 1207
  • [9] Common workflow language (CWL)-based software pipeline for de novo genome assembly from long- and short-read data
    Korhonen, Pasi K.
    Hall, Ross S.
    Young, Neil D.
    Gasser, Robin B.
    GIGASCIENCE, 2019, 8 (04):
  • [10] Corset: enabling differential gene expression analysis for de novo assembled transcriptomes
    Davidson, Nadia M.
    Oshlack, Alicia
    GENOME BIOLOGY, 2014, 15 (07):