Impact of human gene annotations on RNA-seq differential expression analysis

被引:6
|
作者
Hamaguchi, Yu [1 ]
Zeng, Chao [1 ,2 ]
Hamada, Michiaki [1 ,2 ,3 ,4 ]
机构
[1] Waseda Univ, Fac Sci & Engn, Shinjuku Ku, 55N-06-10,3-4-1 Okubo, Tokyo 1698555, Japan
[2] Waseda Univ, AIST, Computat Bio Big Data Open Innovat Lab CBBD OIL, Shinjuku Ku, 3-4-1 Okubo, Tokyo 1698555, Japan
[3] Waseda Univ, Inst Med Oriented Struct Biol, Shinjuku Ku, 2-2 Wakamatsu Cho, Tokyo 1628480, Japan
[4] Nippon Med Sch, Grad Sch Med, Bunkyo Ku, 1-1-5 Sendagi, Tokyo 1138602, Japan
关键词
RNA-seq; Differential expression analysis; Benchmarking; Gene annotation; QUANTIFICATION; TRANSCRIPTOME; DISCOVERY; ALIGNMENT; HISAT;
D O I
10.1186/s12864-021-08038-7
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Differential expression (DE) analysis of RNA-seq data typically depends on gene annotations. Different sets of gene annotations are available for the human genome and are continually updated-a process complicated with the development and application of high-throughput sequencing technologies. However, the impact of the complexity of gene annotations on DE analysis remains unclear. Results Using "mappability", a metric of the complexity of gene annotation, we compared three distinct human gene annotations, GENCODE, RefSeq, and NONCODE, and evaluated how mappability affected DE analysis. We found that mappability was significantly different among the human gene annotations. We also found that increasing mappability improved the performance of DE analysis, and the impact of mappability mainly evident in the quantification step and propagated downstream of DE analysis systematically. Conclusions We assessed how the complexity of gene annotations affects DE analysis using mappability. Our findings indicate that the growth and complexity of gene annotations negatively impact the performance of DE analysis, suggesting that an approach that excludes unnecessary gene models from gene annotations improves the performance of DE analysis.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Impact of RNA-seq data analysis algorithms on gene expression estimation and downstream prediction
    Li Tong
    Po-Yen Wu
    John H. Phan
    Hamid R. Hassazadeh
    Weida Tong
    May D. Wang
    Scientific Reports, 10
  • [42] Differential expression in RNA-seq: A matter of depth
    Tarazona, Sonia
    Garcia-Alcalde, Fernando
    Dopazo, Joaquin
    Ferrer, Alberto
    Conesa, Ana
    GENOME RESEARCH, 2011, 21 (12) : 2213 - 2223
  • [43] Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
    Sahar Al Seesi
    Yvette Temate Tiagueu
    Alexander Zelikovsky
    Ion I Măndoiu
    BMC Genomics, 15
  • [44] Differential gene expression analysis using RNA-seq in the blood of goats exposed to transportation stress
    Naldurtiker, Aditya
    Batchu, Phaneendra
    Kouakou, Brou
    Terrill, Thomas H.
    McCommon, George W.
    Kannan, Govind
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [45] Analysis of differential gene expression by RNA-seq data in ABCG1 knockout mice
    Shen, Si-Qi
    Yan, Xiao-Wei
    Li, Peng-Tao
    Ji, Xiao-Hui
    GENE, 2019, 689 : 24 - 33
  • [46] Differential gene expression analysis using RNA-seq in the blood of goats exposed to transportation stress
    Aditya Naldurtiker
    Phaneendra Batchu
    Brou Kouakou
    Thomas H. Terrill
    George W. McCommon
    Govind Kannan
    Scientific Reports, 13
  • [47] Correction: Corrigendum: Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
    Cole Trapnell
    Adam Roberts
    Loyal Goff
    Geo Pertea
    Daehwan Kim
    David R Kelley
    Harold Pimentel
    Steven L Salzberg
    John L Rinn
    Lior Pachter
    Nature Protocols, 2014, 9 : 2513 - 2513
  • [48] Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
    Al Seesi, Sahar
    Tiagueu, Yvette Temate
    Zelikovsky, Alexander
    Mandoiu, Ion I.
    BMC GENOMICS, 2014, 15
  • [49] Differential Gene Expression Analysis of RNA-Seq Data for Detecting Internal Targets of Antimicrobial Peptides
    Mohammadi, Salimeh
    Prokopczuk, Federico
    Li, Xintian
    Taheri-Araghi, Sattar
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 383A - 383A
  • [50] Differential gene network analysis from single cell RNA-seq
    Wang, Yikai
    Wu, Hao
    Yu, Tianwei
    JOURNAL OF GENETICS AND GENOMICS, 2017, 44 (06) : 331 - 334