GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies

被引:144
作者
Gogarten, Stephanie M. [1 ]
Bhangale, Tushar [1 ,2 ]
Conomos, Matthew P. [1 ]
Laurie, Cecelia A. [1 ]
McHugh, Caitlin P. [1 ]
Painter, Ian [3 ]
Zheng, Xiuwen [1 ]
Crosslin, David R. [1 ]
Levine, David [1 ]
Lumley, Thomas [1 ,4 ]
Nelson, Sarah C. [1 ]
Rice, Kenneth [1 ]
Shen, Jess [1 ]
Swarnkar, Rohit [1 ]
Weir, Bruce S. [1 ]
Laurie, Cathy C. [1 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Genentech Inc, Dept Bioinformat & Computat Biol, San Francisco, CA 94080 USA
[3] Univ Washington, Dept Hlth Serv, Seattle, WA 98195 USA
[4] Univ Auckland, Dept Stat, Auckland 1, New Zealand
基金
美国国家卫生研究院;
关键词
MOSAICISM;
D O I
10.1093/bioinformatics/bts610
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
SUMMARY: GWASTools is an R/Bioconductor package for quality control and analysis of genome-wide association studies (GWAS). GWASTools brings the interactive capability and extensive statistical libraries of R to GWAS. Data are stored in NetCDF format to accommodate extremely large datasets that cannot fit within R's memory limits. The documentation includes instructions for converting data from multiple formats, including variants called from sequencing. GWASTools provides a convenient interface for linking genotypes and intensity data with sample and single nucleotide polymorphism annotation. Availability and implementation: GWASTools is implemented in R and is available from Bioconductor (http://www.bioconductor.org). An extensive vignette detailing a recommended work flow is included.
引用
收藏
页码:3329 / 3331
页数:3
相关论文
共 8 条
[1]   An R package for analysis of whole-genome association studies [J].
Clayton, David ;
Leung, Hin-Tak .
HUMAN HEREDITY, 2007, 64 (01) :45-51
[2]   Mechanisms of mosaicism, chimerism and uniparental disomy identified by single nucleotide polymorphism array analysis [J].
Conlin, Laura K. ;
Thiel, Brian D. ;
Bonnemann, Carsten G. ;
Medne, Livija ;
Ernst, Linda M. ;
Zackai, Elaine H. ;
Deardorff, Matthew A. ;
Krantz, Ian D. ;
Hakonarson, Hakon ;
Spinner, Nancy B. .
HUMAN MOLECULAR GENETICS, 2010, 19 (07) :1263-1275
[3]   The Gene, Environment Association Studies Consortium (GENEVA): Maximizing the Knowledge Obtained from GWAS by Collaboration Across Studies of Multiple Conditions [J].
Cornelis, Marilyn C. ;
Agrawal, Arpana ;
Cole, John W. ;
Hansel, Nadia N. ;
Barnes, Kathleen C. ;
Beaty, Terri H. ;
Bennett, Siiri N. ;
Bierut, Laura J. ;
Boerwinkle, Eric ;
Doheny, Kimberly F. ;
Feenstra, Bjarke ;
Feingold, Eleanor ;
Fornage, Myriam ;
Haiman, Christopher A. ;
Harris, Emily L. ;
Hayes, M. Geoffrey ;
Heit, John A. ;
Hu, Frank B. ;
Kang, Jae H. ;
Laurie, Cathy C. ;
Ling, Hua ;
Manolio, Teri A. ;
Marazita, Mary L. ;
Mathias, Rasika A. ;
Mirel, Daniel B. ;
Paschall, Justin ;
Pasquale, Louis R. ;
Pugh, Elizabeth W. ;
Rice, John P. ;
Udren, Jenna ;
van Dam, Rob M. ;
Wang, Xiaojing ;
Wiggs, Janey L. ;
Williams, Kayleen ;
Yu, Kai .
GENETIC EPIDEMIOLOGY, 2010, 34 (04) :364-372
[4]   Detectable clonal mosaicism from birth to old age and its relationship to cancer [J].
Laurie, Cathy C. ;
Laurie, Cecelia A. ;
Rice, Kenneth ;
Doheny, Kimberly F. ;
Zelnick, Leila R. ;
McHugh, Caitlin P. ;
Ling, Hua ;
Hetrick, Kurt N. ;
Pugh, Elizabeth W. ;
Amos, Chris ;
Wei, Qingyi ;
Wang, Li-E ;
Lee, Jeffrey E. ;
Barnes, Kathleen C. ;
Hansel, Nadia N. ;
Mathias, Rasika ;
Daley, Denise ;
Beaty, Terri H. ;
Scott, Alan F. ;
Ruczinski, Ingo ;
Scharpf, Rob B. ;
Bierut, Laura J. ;
Hartz, Sarah M. ;
Landi, Maria Teresa ;
Freedman, Neal D. ;
Goldin, Lynn R. ;
Ginsburg, David ;
Li, Jun ;
Desch, Karl C. ;
Strom, Sara S. ;
Blot, William J. ;
Signorello, Lisa B. ;
Ingles, Sue A. ;
Chanock, Stephen J. ;
Berndt, Sonja I. ;
Le Marchand, Loic ;
Henderson, Brian E. ;
Monroe, Kristine R. ;
Heit, John A. ;
de Andrade, Mariza ;
Armasu, Sebastian M. ;
Regnier, Cynthia ;
Lowe, William L. ;
Hayes, M. Geoffrey ;
Marazita, Mary L. ;
Feingold, Eleanor ;
Murray, Jeffrey C. ;
Melbye, Mads ;
Feenstra, Bjarke ;
Kang, Jae H. .
NATURE GENETICS, 2012, 44 (06) :642-U58
[5]   Quality Control and Quality Assurance in Genotypic Data for Genome-Wide Association Studies [J].
Laurie, Cathy C. ;
Doheny, Kimberly F. ;
Mirel, Daniel B. ;
Pugh, Elizabeth W. ;
Bierut, Laura J. ;
Bhangale, Tushar ;
Boehm, Frederick ;
Caporaso, Neil E. ;
Cornelis, Marilyn C. ;
Edenberg, Howard J. ;
Gabriel, Stacy B. ;
Harris, Emily L. ;
Hu, Frank B. ;
Jacobs, Kevin B. ;
Kraft, Peter ;
Landi, Maria Teresa ;
Lumley, Thomas ;
Manolio, Teri A. ;
McHugh, Caitlin ;
Painter, Ian ;
Paschall, Justin ;
Rice, John P. ;
Rice, Kenneth M. ;
Zheng, Xiuwen ;
Weir, Bruce S. .
GENETIC EPIDEMIOLOGY, 2010, 34 (06) :591-602
[6]   High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping [J].
Peiffer, Daniel A. ;
Le, Jennie M. ;
Steemers, Frank J. ;
Chang, Weihua ;
Jenniges, Tony ;
Garcia, Francisco ;
Haden, Kirt ;
Li, Jiangzhen ;
Shaw, Chad A. ;
Belmont, John ;
Cheung, Sau Wai ;
Shen, Richard M. ;
Barker, David L. ;
Gunderson, Kevin L. .
GENOME RESEARCH, 2006, 16 (09) :1136-1148
[7]   A faster circular binary segmentation algorithm for the analysis of array CGH data [J].
Venkatraman, E. S. ;
Olshen, Adam B. .
BIOINFORMATICS, 2007, 23 (06) :657-663
[8]   A high-performance computing toolset for relatedness and principal component analysis of SNP data [J].
Zheng, Xiuwen ;
Levine, David ;
Shen, Jess ;
Gogarten, Stephanie M. ;
Laurie, Cathy ;
Weir, Bruce S. .
BIOINFORMATICS, 2012, 28 (24) :3326-3328