Evaluation of microarray data normalization procedures using spike-in experiments

被引:14
|
作者
Ryden, Patrik [1 ]
Andersson, Henrik
Landfors, Mattias
Naslund, Linda
Hartmanova, Blanka
Noppa, Laila
Sjostedt, Anders
机构
[1] Umea Univ, Dept Clin Microbiol, Div Clin Bacteriol, SE-90187 Umea, Sweden
[2] Umea Univ, Dept Mat & Math Stat, SE-90187 Umea, Sweden
[3] Univ Def, Proteome Ctr Study Intracellular Parasitism Bacte, Fac Mil Hlth Sci, Hradec Kralove 50001, Czech Republic
[4] Swedish Def Res Agcy, Div NBC Def, Dept Med Countermeasures, SE-90182 Umea, Sweden
关键词
D O I
10.1186/1471-2105-7-300
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recently, a large number of methods for the analysis of microarray data have been proposed but there are few comparisons of their relative performances. By using so-called spike-in experiments, it is possible to characterize the analyzed data and thereby enable comparisons of different analysis methods. Results: A spike-in experiment using eight in-house produced arrays was used to evaluate established and novel methods for filtration, background adjustment, scanning, channel adjustment, and censoring. The S-plus package EDMA, a stand-alone tool providing characterization of analyzed cDNA-microarray data obtained from spike-in experiments, was developed and used to evaluate 252 normalization methods. For all analyses, the sensitivities at low false positive rates were observed together with estimates of the overall bias and the standard deviation. In general, there was a trade-off between the ability of the analyses to identify differentially expressed genes (i.e. the analyses' sensitivities) and their ability to provide unbiased estimators of the desired ratios. Virtually all analysis underestimated the magnitude of the regulations; often less than 50% of the true regulations were observed. Moreover, the bias depended on the underlying mRNA-concentration; low concentration resulted in high bias. Many of the analyses had relatively low sensitivities, but analyses that used either the constrained model (i.e. a procedure that combines data from several scans) or partial filtration (a novel method for treating data from so-called not-found spots) had with few exceptions high sensitivities. These methods gave considerable higher sensitivities than some commonly used analysis methods. Conclusion: The use of spike-in experiments is a powerful approach for evaluating microarray preprocessing procedures. Analyzed data are characterized by properties of the observed log-ratios and the analysis' ability to detect differentially expressed genes. If bias is not a major problem; we recommend the use of either the CM-procedure or partial filtration.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Systematic Comparison of False-Discovery-Rate-Controlling Strategies for Proteogenomic Search Using Spike-in Experiments
    Li, Honglan
    Park, Jonghun
    Kim, Hyunwoo
    Hwang, Kyu-Baek
    Paek, Eunok
    JOURNAL OF PROTEOME RESEARCH, 2017, 16 (06) : 2231 - 2239
  • [32] Normalization for triple-target microarray experiments
    Martin-Magniette, Marie-Laure
    Aubert, Julie
    Bar-Hen, Avner
    Elftieh, Samira
    Magniette, Frederic
    Renou, Jean-Pierre
    Daudin, Jean-Jacques
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [33] Normalization for triple-target microarray experiments
    Marie-Laure Martin-Magniette
    Julie Aubert
    Avner Bar-Hen
    Samira Elftieh
    Frederic Magniette
    Jean-Pierre Renou
    Jean-Jacques Daudin
    BMC Bioinformatics, 9
  • [34] Experiments using microarray technology: limitations and standard operating procedures
    Forster, T
    Roy, D
    Ghazal, P
    JOURNAL OF ENDOCRINOLOGY, 2003, 178 (02) : 195 - 204
  • [35] Understanding the physics of oligonucleotide microarrays: the Affymetrix spike-in data reanalysed
    Burden, Conrad J.
    PHYSICAL BIOLOGY, 2008, 5 (01)
  • [36] SILAC Surrogates: Rescue of Quantitative Information for Orphan Analytes in Spike-In SILAC Experiments
    Gilmore, Jason M.
    Milloy, Jeffrey A.
    Gerber, Scott A.
    ANALYTICAL CHEMISTRY, 2013, 85 (22) : 10812 - 10819
  • [37] Benchmarking RNA-seq differential expression analysis methods using spike-in and simulation data
    Baik, Bukyung
    Yoon, Sora
    Nam, Dougu
    PLOS ONE, 2020, 15 (04):
  • [38] Evaluation of BCR spike-in controls using DriverMap adaptive immune receptor (AIR) profiling technology
    Chenchik, Alex
    Liu, Tianbing
    Makhanov, Mikhail
    Hu, Dongfang
    Ghias, Khadija
    Diehl, Paul
    CANCER RESEARCH, 2024, 84 (06)
  • [39] Transformation and normalization of oligonucleotide microarray data
    Geller, SC
    Gregg, JP
    Hagerman, P
    Rocke, DM
    BIOINFORMATICS, 2003, 19 (14) : 1817 - 1823
  • [40] Iterative normalization of cDNA microarray data
    Wang, Y
    Lu, JP
    Lee, R
    Gu, ZP
    Clarke, R
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2002, 6 (01): : 29 - 37