Normalization of oligonucleotide arrays based on the least-variant set of genes

被引:41
作者
Calza, Stefano [1 ,2 ]
Valentini, Davide [1 ]
Pawitan, Yudi [1 ]
机构
[1] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[2] Univ Brescia, Dept Biomed Sci & Biotechnol, I-25121 Brescia, Italy
关键词
D O I
10.1186/1471-2105-9-140
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: It is well known that the normalization step of microarray data makes a difference in the downstream analysis. All normalization methods rely on certain assumptions, so differences in results can be traced to different sensitivities to violation of the assumptions. Illustrating the lack of robustness, in a striking spike-in experiment all existing normalization methods fail because of an imbalance between up- and down-regulated genes. This means it is still important to develop a normalization method that is robust against violation of the standard assumptions Results: We develop a new algorithm based on identification of the least-variant set (LVS) of genes across the arrays. The array-to-array variation is evaluated in the robust linear model fit of prenormalized probe-level data. The genes are then used as a reference set for a non-linear normalization. The method is applicable to any existing expression summaries, such as MAS5 or RMA. Conclusion: We show that LVS normalization outperforms other normalization methods when the standard assumptions are not satisfied. In the complex spike-in study, LVS performs similarly to the ideal (in practice unknown) housekeeping-gene normalization. An R package called Ivs is available in http://www.meb.ki.se/similar to yudpaw.
引用
收藏
页数:11
相关论文
共 34 条
  • [1] *AFF, 2002, STAT ALG DESCR DOC
  • [2] [Anonymous], SPIE BIOS
  • [3] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [4] BOLSTAD BM, 2006, AFFYPLM METHODS FITT
  • [5] Quantification of mRNA using real-time reverse transcription PCR (RT-PCR): trends and problems
    Bustin, SA
    [J]. JOURNAL OF MOLECULAR ENDOCRINOLOGY, 2002, 29 (01) : 23 - 39
  • [6] Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset
    Choe, SE
    Boutros, M
    Michelson, AM
    Church, GM
    Halfon, MS
    [J]. GENOME BIOLOGY, 2005, 6 (02)
  • [7] A benchmark for affymetrix GeneChip expression measures
    Cope, LM
    Irizarry, RA
    Jaffee, HA
    Wu, ZJ
    Speed, TP
    [J]. BIOINFORMATICS, 2004, 20 (03) : 323 - 331
  • [8] Data management and analysis for gene expression arrays
    Ermolaeva, O
    Rastogi, M
    Pruitt, KD
    Schuler, GD
    Bittner, ML
    Chen, YD
    Simon, R
    Meltzer, P
    Trent, JM
    Boguski, MS
    [J]. NATURE GENETICS, 1998, 20 (01) : 19 - 23
  • [9] β-actin and GAPDH housekeeping gene expression in asthmatic airways is variable and not suitable for normalising mRNA levels
    Glare, EM
    Divjak, M
    Bailey, MJ
    Walters, EH
    [J]. THORAX, 2002, 57 (09) : 765 - 770
  • [10] Gene expression profiling of Duchenne muscular dystrophy skeletal muscle
    Haslett, JN
    Sanoudou, D
    Kho, AT
    Han, M
    Bennett, RR
    Kohane, IS
    Beggs, AH
    Kunkel, LM
    [J]. NEUROGENETICS, 2003, 4 (04) : 163 - 171