An Integrated Approach for RNA-seq Data Normalization

被引:5
|
作者
Yang, Shengping [1 ,2 ]
Mercante, Donald E. [2 ]
Zhang, Kun [3 ]
Fang, Zhide [2 ]
机构
[1] Texas Tech Univ, Hlth Sci Ctr, Sch Med, Dept Pathol, Lubbock, TX 79430 USA
[2] LSU Hlth Sci Ctr, Sch Publ Hlth, Biostat Program, New Orleans, LA 70112 USA
[3] Xavier Univ Louisiana, Dept Comp Sci, New Orleans, LA 70125 USA
来源
CANCER INFORMATICS | 2016年 / 15卷
基金
美国国家卫生研究院;
关键词
DNA copy number alterations; RNA-seq; normalization;
D O I
10.4137/CIN.S39781
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: DNA copy number alteration is common in many cancers. Studies have shown that insertion or deletion of DNA sequences can directly alter gene expression, and significant correlation exists between DNA copy number and gene expression. Data normalization is a critical step in the analysis of gene expression generated by RNA-seq technology. Successful normalization reduces/removes unwanted nonbiological variations in the data, while keeping meaningful information intact. However, as far as we know, no attempt has been made to adjust for the variation due to DNA copy number changes in RNA-seq data normalization. Results: In this article, we propose an integrated approach for RNA-seq data normalization. Comparisons show that the proposed normalization can improve power for downstream differentially expressed gene detection and generate more biologically meaningful results in gene profiling. In addition, our findings show that due to the effects of copy number changes, some housekeeping genes are not always suitable internal controls for studying gene expression. Conclusions: Using information from DNA copy number, integrated approach is successful in reducing noises due to both biological and nonbiological causes in RNA-seq data, thus increasing the accuracy of gene profiling.
引用
收藏
页码:129 / 141
页数:13
相关论文
共 50 条
  • [31] Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data
    Li, Peipei
    Piao, Yongjun
    Shon, Ho Sun
    Ryu, Keun Ho
    BMC BIOINFORMATICS, 2015, 16
  • [32] LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
    Bingqing Lin
    Li-Feng Zhang
    Xin Chen
    BMC Genomics, 15
  • [33] LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
    Lin, Bingqing
    Zhang, Li-Feng
    Chen, Xin
    BMC GENOMICS, 2014, 15
  • [34] Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions
    Evans, Ciaran
    Hardin, Johanna
    Stoebel, Daniel M.
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (05) : 776 - 792
  • [35] Using normalization to resolve RNA-Seq biases caused by amplification from minimal input
    Ager-Wick, Eirill
    Henkel, Christiaan V.
    Haug, Trude M.
    Weltzien, Finn-Arne
    PHYSIOLOGICAL GENOMICS, 2014, 46 (21) : 808 - 820
  • [36] cdev: a ground-truth based measure to evaluate RNA-seq normalization performance
    Tran, Diem-Trang
    Might, Matthew
    PEERJ, 2021, 9
  • [37] Gene Ontology based housekeeping gene selection for RNA-seq normalization
    Chen, Chien-Ming
    Lu, Yu-Lun
    Sio, Chi-Pong
    Wu, Guan-Chung
    Tzou, Wen-Shyong
    Pai, Tun-Wen
    METHODS, 2014, 67 (03) : 354 - 363
  • [38] Impact of gene annotation choice on the quantification of RNA-seq data
    Chisanga, David
    Liao, Yang
    Shi, Wei
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [39] A comparison of methods for differential expression analysis of RNA-seq data
    Soneson, Charlotte
    Delorenzi, Mauro
    BMC BIOINFORMATICS, 2013, 14
  • [40] Impact of gene annotation choice on the quantification of RNA-seq data
    David Chisanga
    Yang Liao
    Wei Shi
    BMC Bioinformatics, 23