An Analysis Pipeline for Genome-wide Association Studies

被引:0
作者
Stefanov, Stefan [1 ]
Lautenberger, James [2 ]
Gold, Bert [1 ]
机构
[1] Natl Canc Inst Frederick, Expt Immunol Lab, Human Genet Sect, Frederick, MD 21702 USA
[2] Natl Canc Inst Frederick, Lab Genom Divers, Frederick, MD 21702 USA
基金
美国国家卫生研究院;
关键词
single nucleotide polymorphism; SNP; genetic association; GWAS; genetic epidemiology;
D O I
暂无
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
We developed an efficient pipeline to analyze genome-wide association study single nucleotide polymorphism scan results. Perl scripts were used to convert genotypes called using the BRLMM algorithm into a modified PB format. We computed summary statistics characteristic of our case and control populations including allele counts, missing values, heterozygosity, measures of compliance with Hardy-Weinberg equilibrium, and several population difference statistics. In addition, we computed association tests, including exact tests of association for genotypes, alleles, the Cochran-Armitage linear trend test, and dominant, recessive, and overdominant models at every single nucleotide polymorphism (SNP). In addition, pairwise linkage disequilbrium statistics were elaborated, using the command line version of Haplo View, which was possible by writing a reformatting script. Additional Perl scripts permit loading the results into a MySQL database conjoined with a Generic Genome Browser (gbrowse) for comprehensive visualization. This browser incorporates a download feature that provides actual case and control genotypes to users in associated genomic regions. Thus, re-analysis "on the fly" is possible for casual browser users from anywhere on the Internet.
引用
收藏
页码:455 / +
页数:7
相关论文
共 17 条
[1]  
Affymetrix, 2008, AFF POW TOOLS APT RE
[2]  
Cover T. M., 2006, ELEMENTS INFORM THEO, DOI [DOI 10.1002/047174882X, DOI 10.1002/047174882X.CH5]
[3]  
Genome.gov, 2008, CATALOG PUBLISHED GE
[4]   Genome-wide association study provides evidence for a breast cancer risk locus at 6q22-33 [J].
Gold, Bert ;
Kirchhoff, Tomas ;
Stefanov, Stefan ;
Lautenberger, James ;
Viale, Agnes ;
Garber, Judy ;
Friedman, Eitan ;
Narod, Steven ;
Olshen, Adam B. ;
Gregersen, Peter ;
Kosarin, Kristi ;
Olsh, Adam ;
Bergeron, Julie ;
Ellis, Nathan A. ;
Klein, Robert J. ;
Clark, Andrew G. ;
Norton, Larry ;
Dean, Michael ;
Boyd, Jeff ;
Offit, Kenneth .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (11) :4340-4345
[5]   Information-theoretic analysis of neural coding [J].
Johnson, DH ;
Gruner, CM ;
Baggerly, K ;
Seshagiri, C .
JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2001, 10 (01) :47-69
[6]   GENETIC DISTANCE BETWEEN POPULATIONS [J].
NEI, M .
AMERICAN NATURALIST, 1972, 106 (949) :283-+
[7]  
NEI M, 1978, GENETICS, V89, P583
[8]   Analysis of genetic variation in Ashkenazi Jews by high density SNP genotyping [J].
Olshen, Adam B. ;
Gold, Bert ;
Lohmueller, Kirk E. ;
Struewing, Jeffery P. ;
Satagopan, Jaya ;
Stefanov, Stefan A. ;
Eskin, Eleazar ;
Kirchhoff, Tomas ;
Lautenberger, James A. ;
Klein, Robert J. ;
Friedman, Eitan ;
Norton, Larry ;
Ellis, Nathan A. ;
Viale, Agnes ;
Lee, Catherine S. ;
Borgen, Patrick I. ;
Clark, Andrew G. ;
Offit, Kenneth ;
Boyd, Jeff .
BMC GENETICS, 2008, 9 (1)
[9]   PLINK: A tool set for whole-genome association and population-based linkage analyses [J].
Purcell, Shaun ;
Neale, Benjamin ;
Todd-Brown, Kathe ;
Thomas, Lori ;
Ferreira, Manuel A. R. ;
Bender, David ;
Maller, Julian ;
Sklar, Pamela ;
de Bakker, Paul I. W. ;
Daly, Mark J. ;
Sham, Pak C. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (03) :559-575
[10]   A genotype calling algorithm for affymetrix SNP arrays [J].
Rabbee, N ;
Speed, TP .
BIOINFORMATICS, 2006, 22 (01) :7-12