GWASinspector: comprehensive quality control of genome-wide association study results

被引:16
作者
Ani, Alireza [1 ,2 ]
van der Most, Peter J. [1 ]
Snieder, Harold [1 ]
Vaez, Ahmad [1 ,2 ]
Nolte, Ilja M. [1 ]
机构
[1] Univ Groningen, Dept Epidemiol, Univ Med Ctr Groningen, NL-9700 RB Groningen, Netherlands
[2] Isfahan Univ Med Sci, Dept Bioinformat, Esfahan 8174673461, Iran
关键词
R PACKAGE;
D O I
10.1093/bioinformatics/btaa1084
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Quality control (QC) of genome wide association study (GWAS) result files has become increasingly difficult due to advances in genomic technology. The main challenges include continuous increases in the number of polymorphic genetic variants contained in recent GWASs and reference panels, the rising number of cohorts participating in a GWAS consortium, and inclusion of new variant types. Here, we present GWASinspector, a flexible R package for comprehensive QC of GWAS results. This package is compatible with recent imputation reference panels, handles insertion/deletion and multi-allelic variants, provides extensive QC reports and efficiently processes big data files. Reference panels covering three human genome builds (NCBI36, GRCh37 and GRCh38) are available. GWASinspector has a user friendly design and allows easy set-up of the QC pipeline through a configuration file. In addition to checking and reporting on individual files, it can be used in preparation of a meta-analysis by testing for systemic differences between studies and generating cleaned, harmonized GWAS files. Comparison with existing GWAS QC tools shows that the main advantages of GWASinspector are its ability to more effectively deal with insertion/deletion and multi-allelic variants and its relatively low memory use.
引用
收藏
页码:129 / 130
页数:2
相关论文
共 6 条
[1]   Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits [J].
Evangelou, Evangelos ;
Warren, Helen R. ;
Mosen-Ansorena, David ;
Mifsu, Borbala ;
Pazoki, Raha ;
Gao, He ;
Ntritsos, Georgios ;
Dimou, Niki ;
Cabrer, Claudia P. ;
Karaman, Ibrahim ;
Ng, FuLiang ;
Evangelou, Marina ;
Witkowska, Katarzyna ;
Tzanis, Evan ;
Hellwege, Jacklyn N. ;
Giri, Ayush ;
Edwards, Digna R. Velez ;
Sun, Yan, V ;
Cho, Kelly ;
Gaziano, J. Michael ;
Wilson, Peter W. F. ;
Tsao, Philip S. ;
Kovesdy, Csaba P. ;
Esko, Tonu ;
Magi, Reedik ;
Milani, Lili ;
Almgren, Peter ;
Boutin, Thibaud ;
Debette, Stephanie ;
Ding, Jun ;
Giulianini, Franco ;
Holliday, Elizabeth G. ;
Jackson, Anne U. ;
Li-Gao, Ruifang ;
Lin, Wei-Yu ;
Luan, Jian'an ;
Mangino, Massimo ;
Oldmeadow, Christopher ;
Prins, Bram Peter ;
Qian, Yong ;
Sargurupremraj, Muralidharan ;
Shah, Nabi ;
Surendran, Praveen ;
Theriault, Sebastien ;
Verweij, Niek ;
Willems, Sara M. ;
Zhao, Jing-Hua ;
Amouyel, Philippe ;
Connell, John ;
de Mutsert, Renee .
NATURE GENETICS, 2018, 50 (10) :1412-+
[2]   GWAtoolbox: an R package for fast quality control and handling of genome-wide association studies meta-analysis data [J].
Fuchsberger, Christian ;
Taliun, Daniel ;
Pramstaller, Peter P. ;
Pattaro, Cristian .
BIOINFORMATICS, 2012, 28 (03) :444-445
[3]   GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies [J].
Gogarten, Stephanie M. ;
Bhangale, Tushar ;
Conomos, Matthew P. ;
Laurie, Cecelia A. ;
McHugh, Caitlin P. ;
Painter, Ian ;
Zheng, Xiuwen ;
Crosslin, David R. ;
Levine, David ;
Lumley, Thomas ;
Nelson, Sarah C. ;
Rice, Kenneth ;
Shen, Jess ;
Swarnkar, Rohit ;
Weir, Bruce S. ;
Laurie, Cathy C. .
BIOINFORMATICS, 2012, 28 (24) :3329-3331
[4]   Genetic loci associated with heart rate variability and their effects on cardiac disease risk [J].
Nolte, Ilja M. ;
Munoz, M. Loretto ;
Tragante, Vinicius ;
Amare, Azmeraw T. ;
Jansen, Rick ;
Vaez, Ahmad ;
von der Heyde, Benedikt ;
Avery, Christy L. ;
Bis, Joshua C. ;
Dierckx, Bram ;
van Dongen, Jenny ;
Gogarten, Stephanie M. ;
Goyette, Philippe ;
Hernesniemi, Jussi ;
Huikari, Ville ;
Hwang, Shih-Jen ;
Jaju, Deepali ;
Kerr, Kathleen F. ;
Kluttig, Alexander ;
Krijthe, Bouwe P. ;
Kumar, Jitender ;
van der Laan, Sander W. ;
Lyytikainen, Leo-Pekka ;
Maihofer, Adam X. ;
Minassian, Arpi ;
van der Most, Peter J. ;
Mueller-Nurasyid, Martina ;
Nivard, Michel ;
Salvi, Erika ;
Stewart, James D. ;
Thayer, Julian F. ;
Verweij, Niek ;
Wong, Andrew ;
Zabaneh, Delilah ;
Zafarmand, Mohammad H. ;
Abdellaoui, Abdel ;
Albarwani, Sulayma ;
Albert, Christine ;
Alonso, Alvaro ;
Ashar, Foram ;
Auvinen, Juha ;
Axelsson, Tomas ;
Baker, Dewleen G. ;
de Bakker, Paul I. W. ;
Barcella, Matteo ;
Bayoumi, Riad ;
Bieringa, Rob J. ;
Boomsma, Dorret ;
Boucher, Gabrielle ;
Britton, Annie R. .
NATURE COMMUNICATIONS, 2017, 8
[5]   QCGWAS: A flexible R package for automated quality control of genome-wide association results [J].
van der Most, Peter J. ;
Vaez, Ahmad ;
Prins, Bram P. ;
Munoz, M. Loretto ;
Snieder, Harold ;
Alizadeh, Behrooz Z. ;
Nolte, Ilja M. .
BIOINFORMATICS, 2014, 30 (08) :1185-1186
[6]   Quality control and conduct of genome-wide association meta-analyses [J].
Winkler, Thomas W. ;
Day, Felix R. ;
Croteau-Chonka, Damien C. ;
Wood, Andrew R. ;
Locke, Adam E. ;
Maegi, Reedik ;
Ferreira, Teresa ;
Fall, Tove ;
Graff, Mariaelisa ;
Justice, Anne E. ;
Luan, Jian'an ;
Gustafsson, Stefan ;
Randall, Joshua C. ;
Vedantam, Sailaja ;
Workalemahu, Tsegaselassie ;
Kilpelainen, Tuomas O. ;
Scherag, Andre ;
Esko, Tonu ;
Kutalik, Zoltan ;
Heid, Iris M. ;
Loos, Ruth J. F. .
NATURE PROTOCOLS, 2014, 9 (05) :1192-1212