Impact of the spotted microarray preprocessing method on fold-change compression and variance stability

被引:26
作者
Ambroise, Jerome [1 ]
Bearzatto, Bertrand [2 ]
Robert, Annie [3 ]
Govaerts, Bernadette [4 ]
Macq, Benoit [1 ]
Gala, Jean-Luc [2 ]
机构
[1] Catholic Univ Louvain, ICTEAM Inst, ELEN Dept, B-1348 Louvain, Belgium
[2] Catholic Univ Louvain, IREC Inst, Ctr Appl Mol Technol, B-1200 Brussels, Belgium
[3] Catholic Univ Louvain, IREC Inst, Dept Epidemiol & Biostat, B-1200 Brussels, Belgium
[4] Catholic Univ Louvain, Inst Stat Biotstat & Actuarial Sci, B-1348 Louvain, Belgium
关键词
BACKGROUND CORRECTION; NORMALIZATION; ACCURACY; REPRODUCIBILITY; TRANSFORMATION; PERFORMANCE; PLATFORMS;
D O I
10.1186/1471-2105-12-413
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The standard approach for preprocessing spotted microarray data is to subtract the local background intensity from the spot foreground intensity, to perform a log2 transformation and to normalize the data with a global median or a lowess normalization. Although well motivated, standard approaches for background correction and for transformation have been widely criticized because they produce high variance at low intensities. Whereas various alternatives to the standard background correction methods and to log2 transformation were proposed, impacts of both successive preprocessing steps were not compared in an objective way. Results: In this study, we assessed the impact of eight preprocessing methods combining four background correction methods and two transformations (the log2 and the glog), by using data from the MAQC study. The current results indicate that most preprocessing methods produce fold-change compression at low intensities. Fold-change compression was minimized using the Standard and the Edwards background correction methods coupled with a log2 transformation. The drawback of both methods is a high variance at low intensities which consequently produced poor estimations of the p-values. On the other hand, effective stabilization of the variance as well as better estimations of the p-values were observed after the glog transformation. Conclusion: As both fold-change magnitudes and p-values are important in the context of microarray class comparison studies, we therefore recommend to combine the Edwards correction with a hybrid transformation method that uses the log2 transformation to estimate fold-change magnitudes and the glog transformation to estimate p-values.
引用
收藏
页数:12
相关论文
共 31 条
[1]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[2]   Gene expression omnibus: Microarray data storage, submission, retrieval, and analysis [J].
Barrett, Tanya ;
Edgar, Ron .
DNA MICROARRAYS, PART B: DATABASES AND STATISTICS, 2006, 411 :352-369
[3]  
Cui X, 2003, STAT APPL GENET MOL, V2, P1009
[4]   Importance of pre-analytical steps for transcriptome and RT-qPCR analyses in the context of the phase II randomised multicentre trial REMAGUS02 of neoadjuvant chemotherapy in breast cancer patients [J].
de Cremoux, Patricia ;
Valet, Fabien ;
Gentien, David ;
Lehmann-Che, Jacqueline ;
Scott, Veronique ;
Tran-Perennou, Carine ;
Barbaroux, Catherine ;
Servant, Nicolas ;
Vacher, Sophie ;
Sigal-Zafrani, Brigitte ;
Mathieu, Marie-Christine ;
Bertheau, Philippe ;
Guinebretiere, Jean-Marc ;
Asselain, Bernard ;
Marty, Michel ;
Spyratos, Frederique .
BMC CANCER, 2011, 11
[5]  
Dudoit S, 2002, STAT SINICA, V12, P111
[6]   Estimation of transformation parameters for microarray data [J].
Durbin, B ;
Rocke, DM .
BIOINFORMATICS, 2003, 19 (11) :1360-1367
[7]  
Durbin B P, 2002, Bioinformatics, V18 Suppl 1, pS105
[8]   Non-linear normalization and background correction in one-channel cDNA microarray studies [J].
Edwards, D .
BIOINFORMATICS, 2003, 19 (07) :825-833
[9]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[10]   Improvement in the reproducibility and accuracy of DNA microarray quantification by optimizing hybridization conditions [J].
Han, Tao ;
Melvin, Cathy D. ;
Shi, Leming ;
Branham, William S. ;
Moland, Carrie L. ;
Pine, P. Scott ;
Thompson, Karol L. ;
Fuscoe, James C. .
BMC BIOINFORMATICS, 2006, 7 (Suppl 2)