An Integrated Approach for RNA-seq Data Normalization

被引:5
|
作者
Yang, Shengping [1 ,2 ]
Mercante, Donald E. [2 ]
Zhang, Kun [3 ]
Fang, Zhide [2 ]
机构
[1] Texas Tech Univ, Hlth Sci Ctr, Sch Med, Dept Pathol, Lubbock, TX 79430 USA
[2] LSU Hlth Sci Ctr, Sch Publ Hlth, Biostat Program, New Orleans, LA 70112 USA
[3] Xavier Univ Louisiana, Dept Comp Sci, New Orleans, LA 70125 USA
来源
CANCER INFORMATICS | 2016年 / 15卷
基金
美国国家卫生研究院;
关键词
DNA copy number alterations; RNA-seq; normalization;
D O I
10.4137/CIN.S39781
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: DNA copy number alteration is common in many cancers. Studies have shown that insertion or deletion of DNA sequences can directly alter gene expression, and significant correlation exists between DNA copy number and gene expression. Data normalization is a critical step in the analysis of gene expression generated by RNA-seq technology. Successful normalization reduces/removes unwanted nonbiological variations in the data, while keeping meaningful information intact. However, as far as we know, no attempt has been made to adjust for the variation due to DNA copy number changes in RNA-seq data normalization. Results: In this article, we propose an integrated approach for RNA-seq data normalization. Comparisons show that the proposed normalization can improve power for downstream differentially expressed gene detection and generate more biologically meaningful results in gene profiling. In addition, our findings show that due to the effects of copy number changes, some housekeeping genes are not always suitable internal controls for studying gene expression. Conclusions: Using information from DNA copy number, integrated approach is successful in reducing noises due to both biological and nonbiological causes in RNA-seq data, thus increasing the accuracy of gene profiling.
引用
收藏
页码:129 / 141
页数:13
相关论文
共 50 条
  • [21] Cross-platform normalization of microarray and RNA-seq data for machine learning applications
    Thompson, Jeffrey A.
    Tan, Jie
    Greene, Casey S.
    PEERJ, 2016, 4
  • [22] DegNorm: normalization of generalized transcript degradation improves accuracy in RNA-seq analysis
    Xiong, Bin
    Yang, Yiben
    Fineis, Frank R.
    Wang, Ji-Ping
    GENOME BIOLOGY, 2019, 20 (1)
  • [23] DegNorm: normalization of generalized transcript degradation improves accuracy in RNA-seq analysis
    Bin Xiong
    Yiben Yang
    Frank R. Fineis
    Ji-Ping Wang
    Genome Biology, 20
  • [24] Dimensionality Reduction of RNA-Seq Data
    Al-Turaiki, Isra
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (03): : 31 - 36
  • [25] Analysis of clustered RNA-seq data
    Park, Hyunjin
    Lee, Seungyeoun
    Kim, Ye Jin
    Choi, Myung-Sook
    Park, Taesung
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 19 (01) : 19 - 31
  • [26] RNA-Seq Data: A Complexity Journey
    Capobianco, Enrico
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2014, 11 (19): : 123 - 130
  • [27] A Semi-parametric Bayesian Approach for Differential Expression Analysis of RNA-seq Data
    Liu, Fangfang
    Wang, Chong
    Liu, Peng
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2015, 20 (04) : 555 - 576
  • [28] Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data
    Li, Jun
    Tibshirani, Robert
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 519 - 536
  • [29] Systematic Selection of Reference Genes for the Normalization of Circulating RNA Transcripts in Pregnant Women Based on RNA-Seq Data
    Chim, Stephen S. C.
    Wong, Karen K. W.
    Chung, Claire Y. L.
    Lam, Stephanie K. W.
    Kwok, Jamie S. L.
    Lai, Chit-Ying
    Cheng, Yvonne K. Y.
    Hui, Annie S. Y.
    Meng, Meng
    Chan, Oi-Ka
    Tsui, Stephen K. W.
    Lee, Keun-Young
    Chan, Ting-Fung
    Leung, Tak-Yeung
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (08)
  • [30] Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data
    Peipei Li
    Yongjun Piao
    Ho Sun Shon
    Keun Ho Ryu
    BMC Bioinformatics, 16