Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data

被引:167
作者
Carvalho, Benilton
Bengtsson, Henrik
Speed, Terence P.
Irizarry, Rafael A. [1 ]
机构
[1] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[3] Walter & Eliza Hall Inst Med Res, Div Genet & Bioinformat, Melbourne, Vic, Australia
关键词
Affymetrix; genotyping; high-throughput; microarrays;
D O I
10.1093/biostatistics/kxl042
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In most microarray technologies, a number of critical steps are required to convert raw intensity measurements into the data relied upon by data analysts, biologists, and clinicians. These data manipulations, referred to as preprocessing, can influence the quality of the ultimate measurements. In the last few years, the high-throughput measurement of gene expression is the most popular application of microarray technology. For this application, various groups have demonstrated that the use of modern statistical methodology can substantially improve accuracy and precision of the gene expression measurements, relative to ad hoc procedures introduced by designers and manufacturers of the technology. Currently, other applications of microarrays are becoming more and more popular. In this paper, we describe a preprocessing methodology for a technology designed for the identification of DNA sequence variants in specific genes or regions of the human genome that are associated with phenotypes of interest such as disease. In particular, we describe a methodology useful for preprocessing Affymetrix single-nucleotide polymorphism chips and obtaining genotype calls with the preprocessed data. We demonstrate how our procedure improves existing approaches using data from 3 relatively large studies including the one in which large numbers of independent calls are available. The proposed methods are implemented in the package oligo available from Bioconductor.
引用
收藏
页码:485 / 499
页数:15
相关论文
共 17 条
  • [1] Affymetrix, 2006, BRLMM IMPR GEN CALL
  • [2] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [3] Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarrays
    Di, XJ
    Matsuzaki, H
    Webster, TA
    Hubbell, E
    Liu, GY
    Dong, SL
    Bartell, D
    Huang, J
    Chiles, R
    Yang, G
    Shen, MM
    Kulp, D
    Kennedy, GC
    Mei, R
    Jones, KW
    Cawley, S
    [J]. BIOINFORMATICS, 2005, 21 (09) : 1958 - 1963
  • [4] Huang J, 2006, BMC BIOINFORMATICS, V7, DOI 10.1186/1471-2105-7-83
  • [5] Summaries of affymetrix GeneChip probe level data
    Irizarry, RA
    Bolstad, BM
    Collin, F
    Cope, LM
    Hobbs, B
    Speed, TP
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (04) : e15
  • [6] Large-scale genotyping of complex DNA
    Kennedy, GC
    Matsuzaki, H
    Dong, SL
    Liu, WM
    Huang, J
    Liu, GY
    Xu, X
    Cao, MQ
    Chen, WW
    Zhang, J
    Liu, WW
    Yang, G
    Di, XJ
    Ryder, T
    He, ZJ
    Surti, U
    Phillips, MS
    Boyce-Jacino, MT
    Fodor, SPA
    Jones, KW
    [J]. NATURE BIOTECHNOLOGY, 2003, 21 (10) : 1233 - 1237
  • [7] Allele-specific amplification in cancer revealed by SNP array analysis
    LaFramboise, T
    Weir, BA
    Zhao, XJ
    Beroukhim, R
    Li, C
    Harrington, D
    Sellers, WR
    Meyerson, M
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (06) : 507 - 517
  • [8] Genotyping and annotation of Affymetrix SNP arrays
    Lamy, Philippe
    Andersen, Claus L.
    Wikman, Friedrik P.
    Wiuf, Carsten
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 (14)
  • [9] Algorithms for large-scale genotyping microarrays
    Liu, WM
    Di, XJ
    Yang, G
    Matsuzaki, H
    Huang, J
    Mei, R
    Ryder, TB
    Webster, TA
    Dong, SL
    Liu, GY
    Jones, KW
    Kennedy, GC
    Kulp, D
    [J]. BIOINFORMATICS, 2003, 19 (18) : 2397 - 2403
  • [10] Lönnstedt I, 2002, STAT SINICA, V12, P31