FVGWAS: Fast voxelwise genome wide association analysis of large-scale imaging genetic data

被引:44
作者
Huang, Meiyan [1 ]
Nichols, Thomas [2 ]
Huang, Chao [3 ,4 ]
Yu, Yang [5 ]
Lu, Zhaohua [3 ,4 ]
Knickmeyer, Rebecca C. [6 ]
Feng, Qianjin [1 ]
Zhu, Hongtu [3 ,4 ]
机构
[1] Southern Med Univ, Sch Biomed Engn, Guangzhou 510515, Guangdong, Peoples R China
[2] Univ Warwick, Dept Stat, Coventry CV4 7AL, W Midlands, England
[3] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[4] Univ N Carolina, Biomed Res Imaging Ctr, Chapel Hill, NC 27599 USA
[5] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
[6] Univ N Carolina, Dept Psychiat, Chapel Hill, NC 27599 USA
基金
美国国家卫生研究院; 加拿大健康研究院;
关键词
Computational complexity; Family-wise error rate; Heteroscedastic linear model; Voxelwise genome wide association; Wild bootstrap; QUANTITATIVE TRAIT LOCI; ALZHEIMERS-DISEASE; NEUROIMAGING PHENOTYPES; STATISTICAL-ANALYSIS; REGRESSION-MODELS; COMMON VARIANTS; BRAIN STRUCTURE; RANDOM-FIELD; SET; EQUATIONS;
D O I
10.1016/j.neuroimage.2015.05.043
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
More and more large-scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical data to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. Several major big-data challenges arise from testing genome-wide (N-C > 12 million known variants) associations with signals at millions of locations (N-V similar to 106) in the brain from thousands of subjects (n similar to 103). The aim of this paper is to develop a Fast Voxelwise Genome Wide Association analysiS (FVGWAS) framework to efficiently carry out whole-genome analyses of whole-brain data. FVGWAS consists of three components including a heteroscedastic linear model, a global sure independence screening (GSIS) procedure, and a detection procedure based on wild bootstrap methods. Specifically, for standard linear association, the computational complexity is O (nN(V)N(C)) for voxelwise genome wide association analysis (VGWAS) method compared with O ((N-C + N-V)n(2)) for FVGWAS. Simulation studies show that FVGWAS is an efficient method of searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. Finally, we have successfully applied FVGWAS to a large-scale imaging genetic data analysis of ADNI data with 708 subjects, 193,275 voxels in RAVENS maps, and 501,584 SNPs, and the total processing time was 203,645 s for a single CPU. Our FVGWAS may be a valuable statistical toolbox for large-scale imaging genetic analysis as the field is rapidly advancing with ultra-high-resolution imaging and whole-genome sequencing. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:613 / 627
页数:15
相关论文
共 58 条
[1]   Haploview: analysis and visualization of LD and haplotype maps [J].
Barrett, JC ;
Fry, B ;
Maller, J ;
Daly, MJ .
BIOINFORMATICS, 2005, 21 (02) :263-265
[2]   Relation between variants in the neurotrophin receptor gene, NTRK3, and white matter integrity in healthy young adults [J].
Braskie, Meredith N. ;
Kohannim, Omid ;
Jahanshad, Neda ;
Chiang, Ming-Chang ;
Barysheva, Marina ;
Toga, Arthur W. ;
Ringman, John M. ;
Montgomery, Grant W. ;
McMahon, Katie L. ;
de Zubicaray, Greig I. ;
Martin, Nicholas G. ;
Wright, Margaret J. ;
Thompson, Paul M. .
NEUROIMAGE, 2013, 82 :146-153
[3]   Common Alzheimer's Disease Risk Variant within the CLU Gene Affects White Matter Microstructure in Young Adults [J].
Braskie, Meredith N. ;
Jahanshad, Neda ;
Stein, Jason L. ;
Barysheva, Marina ;
McMahon, Katie L. ;
de Zubicaray, Greig I. ;
Martin, Nicholas G. ;
Wright, Margaret J. ;
Ringman, John M. ;
Toga, Arthur W. ;
Thompson, Paul M. .
JOURNAL OF NEUROSCIENCE, 2011, 31 (18) :6764-6770
[4]   A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING [J].
Chen, Song Xi ;
Qin, Ying-Li .
ANNALS OF STATISTICS, 2010, 38 (02) :808-835
[5]   Voxel-based morphometry using the RAVENS maps: Methods and validation using simulated longitudinal atrophy [J].
Davatzikos, C ;
Genc, A ;
Xu, DR ;
Resnick, SM .
NEUROIMAGE, 2001, 14 (06) :1361-1369
[6]   Multiple hypothesis testing in microarray experiments [J].
Dudoit, S ;
Shaffer, JP ;
Boldrick, JC .
STATISTICAL SCIENCE, 2003, 18 (01) :71-103
[7]   Imaging genetics in ADHD [J].
Durston, Sarah .
NEUROIMAGE, 2010, 53 (03) :832-838
[8]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[9]   SURE INDEPENDENCE SCREENING IN GENERALIZED LINEAR MODELS WITH NP-DIMENSIONALITY [J].
Fan, Jianqing ;
Song, Rui .
ANNALS OF STATISTICS, 2010, 38 (06) :3567-3604
[10]   The structure of haplotype blocks in the human genome [J].
Gabriel, SB ;
Schaffner, SF ;
Nguyen, H ;
Moore, JM ;
Roy, J ;
Blumenstiel, B ;
Higgins, J ;
DeFelice, M ;
Lochner, A ;
Faggart, M ;
Liu-Cordero, SN ;
Rotimi, C ;
Adeyemo, A ;
Cooper, R ;
Ward, R ;
Lander, ES ;
Daly, MJ ;
Altshuler, D .
SCIENCE, 2002, 296 (5576) :2225-2229