Evaluation of microarray data normalization procedures using spike-in experiments

被引:14
|
作者
Ryden, Patrik [1 ]
Andersson, Henrik
Landfors, Mattias
Naslund, Linda
Hartmanova, Blanka
Noppa, Laila
Sjostedt, Anders
机构
[1] Umea Univ, Dept Clin Microbiol, Div Clin Bacteriol, SE-90187 Umea, Sweden
[2] Umea Univ, Dept Mat & Math Stat, SE-90187 Umea, Sweden
[3] Univ Def, Proteome Ctr Study Intracellular Parasitism Bacte, Fac Mil Hlth Sci, Hradec Kralove 50001, Czech Republic
[4] Swedish Def Res Agcy, Div NBC Def, Dept Med Countermeasures, SE-90182 Umea, Sweden
关键词
D O I
10.1186/1471-2105-7-300
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recently, a large number of methods for the analysis of microarray data have been proposed but there are few comparisons of their relative performances. By using so-called spike-in experiments, it is possible to characterize the analyzed data and thereby enable comparisons of different analysis methods. Results: A spike-in experiment using eight in-house produced arrays was used to evaluate established and novel methods for filtration, background adjustment, scanning, channel adjustment, and censoring. The S-plus package EDMA, a stand-alone tool providing characterization of analyzed cDNA-microarray data obtained from spike-in experiments, was developed and used to evaluate 252 normalization methods. For all analyses, the sensitivities at low false positive rates were observed together with estimates of the overall bias and the standard deviation. In general, there was a trade-off between the ability of the analyses to identify differentially expressed genes (i.e. the analyses' sensitivities) and their ability to provide unbiased estimators of the desired ratios. Virtually all analysis underestimated the magnitude of the regulations; often less than 50% of the true regulations were observed. Moreover, the bias depended on the underlying mRNA-concentration; low concentration resulted in high bias. Many of the analyses had relatively low sensitivities, but analyses that used either the constrained model (i.e. a procedure that combines data from several scans) or partial filtration (a novel method for treating data from so-called not-found spots) had with few exceptions high sensitivities. These methods gave considerable higher sensitivities than some commonly used analysis methods. Conclusion: The use of spike-in experiments is a powerful approach for evaluating microarray preprocessing procedures. Analyzed data are characterized by properties of the observed log-ratios and the analysis' ability to detect differentially expressed genes. If bias is not a major problem; we recommend the use of either the CM-procedure or partial filtration.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Evaluation of microarray data normalization procedures using spike-in experiments
    Patrik Rydén
    Henrik Andersson
    Mattias Landfors
    Linda Näslund
    Blanka Hartmanová
    Laila Noppa
    Anders Sjöstedt
    BMC Bioinformatics, 7
  • [2] Consolidated strategy for the analysis of microarray spike-in data
    McCall, Matthew N.
    Irizarry, Rafael A.
    NUCLEIC ACIDS RESEARCH, 2008, 36 (17) : e108
  • [3] The Wild West of spike-in normalization
    Patel, Lauren A.
    Cao, Yuwei
    Mendenhall, Eric M.
    Benner, Christopher
    Goren, Alon
    NATURE BIOTECHNOLOGY, 2024, 42 (09) : 1343 - 1349
  • [4] Normalization of human RNA-seq experiments using chimpanzee RNA as a spike-in standard
    Yu, Hannah
    Hahn, Yoonsoo
    Park, Sang-Ryoul
    Chung, Sun-Ku
    Jeong, Sangkyun
    Yang, Inchul
    SCIENTIFIC REPORTS, 2016, 6
  • [5] Normalization of human RNA-seq experiments using chimpanzee RNA as a spike-in standard
    Hannah Yu
    Yoonsoo Hahn
    Sang-Ryoul Park
    Sun-Ku Chung
    Sangkyun Jeong
    Inchul Yang
    Scientific Reports, 6
  • [6] Normalization of low-density microarray using external spike-in controls: analysis of macrophage cell lines expression profile
    Paolo Fardin
    Stefano Moretti
    Barbara Biasotti
    Annamaria Ricciardi
    Stefano Bonassi
    Luigi Varesio
    BMC Genomics, 8
  • [7] Normalization of low-density microarray using external spike-in controls: analysis of macrophage cell lines expression profile
    Fardin, Paolo
    Moretti, Stefano
    Biasotti, Barbara
    Ricciardi, Annamaria
    Bonassi, Stefano
    Varesio, Luigi
    BMC GENOMICS, 2007, 8 (1)
  • [8] A wholly defined Agilent microarray spike-in dataset
    Zhu, Qianqian
    Miecznikowski, Jeffrey C.
    Halfon, Marc S.
    BIOINFORMATICS, 2011, 27 (09) : 1284 - 1289
  • [9] Performance assessment of copy number microarray platforms using a spike-in experiment
    Halper-Stromberg, Eitan
    Frelin, Laurence
    Ruczinski, Ingo
    Scharpf, Robert
    Jie, Chunfa
    Carvalho, Benilton
    Hao, Haiping
    Hetrick, Kurt
    Jedlicka, Anne
    Dziedzic, Amanda
    Doheny, Kim
    Scott, Alan F.
    Baylin, Steve
    Pevsner, Jonathan
    Spencer, Forrest
    Irizarry, Rafael A.
    BIOINFORMATICS, 2011, 27 (08) : 1052 - 1060
  • [10] Assessing the reliability of spike-in normalization for analyses of single-cell RNA sequencing data
    Lun, Aaron T. L.
    Calero-Nieto, Fernando J.
    Haim-Vilmovsky, Liora
    Gottgens, Berthold
    Marioni, John C.
    GENOME RESEARCH, 2017, 27 (11) : 1795 - 1806