Quality control, imputation and analysis of genome-wide genotyping data from the Illumina HumanCoreExome microarray

被引:54
作者
Coleman, Jonathan R. I. [1 ]
Euesden, Jack [2 ]
Patel, Hamel [2 ,3 ]
Folarin, Amos A. [4 ]
Newhouse, Stephen [4 ]
Breen, Gerome [2 ,5 ]
机构
[1] MRC Social Genet & Dev Psychiat Ctr SGDP, London, England
[2] SGDP, London, England
[3] South London & Maudsley NHS Trust, Natl Inst Hlth Res, Biomed Res Ctr Mental Hlth, Bioinformat Core, London, England
[4] NIHR, BRC MH, Bioinformat Core, London, England
[5] NIHR, BRC MH, Genom & Biomarkers & BioResource Mental & Neurol, London, England
关键词
GWAS; methods; low-coverage microarray; imputation; analysis; ASSOCIATION; MODEL; PLINK;
D O I
10.1093/bfgp/elv037
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The decreasing cost of performing genome-wide association studies has made genomics widely accessible. However, there is a paucity of guidance for best practice in conducting such analyses. For the results of a study to be valid and replicable, multiple biases must be addressed in the course of data preparation and analysis. In addition, standardizing methods across small, independent studies would increase comparability and the potential for effective meta-analysis. This article provides a discussion of important aspects of quality control, imputation and analysis of genome-wide data from a low-coverage microarray, as well as a straight-forward guide to performing a genome-wide association study. A detailed protocol is provided online, with example scripts available at https://github.com/JoniColeman/gwas_scripts.
引用
收藏
页码:298 / 304
页数:7
相关论文
共 50 条
  • [1] Accuracy of imputation to infer unobserved APOE epsilon alleles in genome-wide genotyping data
    Farid Radmanesh
    William J Devan
    Christopher D Anderson
    Jonathan Rosand
    Guido J Falcone
    European Journal of Human Genetics, 2014, 22 : 1239 - 1242
  • [2] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Zheng, Hou-Feng
    Ladouceur, Martin
    Greenwood, Celia M. T.
    Richards, J. Brent
    JOURNAL OF GENETICS AND GENOMICS, 2012, 39 (10) : 545 - 550
  • [3] Quality Control Procedures for Genome-Wide Association Studies
    Truong, Van Q.
    Woerner, Jakob A.
    Cherlin, Tess A.
    Bradford, Yuki
    Lucas, Anastasia M.
    Okeh, Chelsea C.
    Shivakumar, Manu K.
    Hui, Daniel H.
    Kumar, Rachit
    Pividori, Milton
    Jones, S. Chris
    Bossa, Abigail C.
    Turner, Stephen D.
    Ritchie, Marylyn D.
    Verma, Shefali S.
    CURRENT PROTOCOLS, 2022, 2 (11):
  • [4] Odyssey: a semi-automated pipeline for phasing, imputation, and analysis of genome-wide genetic data
    Eller, Ryan J.
    Janga, Sarath C.
    Walsh, Susan
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [5] The effect of genome-wide association scan quality control on imputation outcome for common variants
    Southam, Lorraine
    Panoutsopoulou, Kalliope
    Rayner, N. William
    Chapman, Kay
    Durrant, Caroline
    Ferreira, Teresa
    Arden, Nigel
    Carr, Andrew
    Deloukas, Panos
    Doherty, Michael
    Loughlin, John
    McCaskie, Andrew
    Ollier, William E. R.
    Ralston, Stuart
    Spector, Timothy D.
    Valdes, Ana M.
    Wallis, Gillian A.
    Wilkinson, J. Mark
    Marchini, Jonathan
    Zeggini, Eleftheria
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2011, 19 (05) : 610 - 614
  • [6] The effect of genome-wide association scan quality control on imputation outcome for common variants
    Lorraine Southam
    Kalliope Panoutsopoulou
    N William Rayner
    Kay Chapman
    Caroline Durrant
    Teresa Ferreira
    Nigel Arden
    Andrew Carr
    Panos Deloukas
    Michael Doherty
    John Loughlin
    Andrew McCaskie
    William E R Ollier
    Stuart Ralston
    Timothy D Spector
    Ana M Valdes
    Gillian A Wallis
    J Mark Wilkinson
    Jonathan Marchini
    Eleftheria Zeggini
    European Journal of Human Genetics, 2011, 19 : 610 - 614
  • [7] Efficient genome-wide genotyping strategies and data integration in crop plants
    Torkamaneh, Davoud
    Boyle, Brian
    Belzile, Francois
    THEORETICAL AND APPLIED GENETICS, 2018, 131 (03) : 499 - 511
  • [8] Quality Control and Quality Assurance in Genotypic Data for Genome-Wide Association Studies
    Laurie, Cathy C.
    Doheny, Kimberly F.
    Mirel, Daniel B.
    Pugh, Elizabeth W.
    Bierut, Laura J.
    Bhangale, Tushar
    Boehm, Frederick
    Caporaso, Neil E.
    Cornelis, Marilyn C.
    Edenberg, Howard J.
    Gabriel, Stacy B.
    Harris, Emily L.
    Hu, Frank B.
    Jacobs, Kevin B.
    Kraft, Peter
    Landi, Maria Teresa
    Lumley, Thomas
    Manolio, Teri A.
    McHugh, Caitlin
    Painter, Ian
    Paschall, Justin
    Rice, John P.
    Rice, Kenneth M.
    Zheng, Xiuwen
    Weir, Bruce S.
    GENETIC EPIDEMIOLOGY, 2010, 34 (06) : 591 - 602
  • [9] A tutorial on conducting genome-wide association studies: Quality control and statistical analysis
    Marees, Andries T.
    de Kluiver, Hilde
    Stringer, Sven
    Vorspan, Florence
    Curis, Emmanuel
    Marie-Claire, Cynthia
    Derks, Eske M.
    INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2018, 27 (02)
  • [10] Odyssey: a semi-automated pipeline for phasing, imputation, and analysis of genome-wide genetic data
    Ryan J. Eller
    Sarath C. Janga
    Susan Walsh
    BMC Bioinformatics, 20