A Latent Variable Partial Least Squares Path Modeling Approach to Regional Association and Polygenic Effect with Applications to a Human Obesity Study

被引:11
作者
Xue, Fuzhong [1 ,2 ,3 ]
Li, Shengxu [4 ]
Luan, Jian'an [2 ,3 ]
Yuan, Zhongshang [1 ]
Luben, Robert N. [5 ]
Khaw, Kay-Tee [6 ]
Wareham, Nicholas J. [2 ,3 ]
Loos, Ruth J. F. [2 ,3 ]
Zhao, Jing Hua [2 ,3 ]
机构
[1] Shandong Univ, Sch Publ Hlth, Dept Epidemiol & Hlth Stat, Jinan 250100, Peoples R China
[2] MRC Epidemiol Unit, Cambridge, England
[3] Inst Metab Sci, Cambridge, England
[4] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Epidemiol, New Orleans, LA USA
[5] Univ Cambridge, Dept Publ Hlth & Primary Care, Strangeways Res Lab, Cambridge, England
[6] Univ Cambridge, Sch Clin Med, Clin Gerontol Unit, Cambridge, England
基金
中国国家自然科学基金; 英国医学研究理事会;
关键词
GENOME-WIDE ASSOCIATION; QUANTITATIVE TRAIT LOCI; DISEASE; RISK; GENE; MARKERS; SUSCEPTIBILITY; POPULATION; PARTICIPANTS; REGRESSION;
D O I
10.1371/journal.pone.0031927
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic association studies are now routinely used to identify single nucleotide polymorphisms (SNPs) linked with human diseases or traits through single SNP-single trait tests. Here we introduced partial least squares path modeling (PLSPM) for association between single or multiple SNPs and a latent trait that can involve single or multiple correlated measurement(s). Furthermore, the framework naturally provides estimators of polygenic effect by appropriately weighting trait-attributing alleles. We conducted computer simulations to assess the performance via multiple SNPs and human obesity-related traits as measured by body mass index (BMI), waist and hip circumferences. Our results showed that the associate statistics had type I error rates close to nominal level and were powerful for a range of effect and sample sizes. When applied to 12 candidate regions in data (N = 2,417) from the European Prospective Investigation of Cancer (EPIC)-Norfolk study, a region in FTO was found to have stronger association (rs7204609 similar to rs9939881 at the first intron P = 4.29 x 10(-7)) than single SNP analysis (all with P>10(-4)) and a latent quantitative phenotype was obtained using a subset sample of EPIC-Norfolk(N = 12,559). We believe our method is appropriate for assessment of regional association and polygenic effect on a single or multiple traits.
引用
收藏
页数:10
相关论文
共 44 条
[1]  
[Anonymous], 1993, An introduction to the bootstrap
[2]  
[Anonymous], 2009, BMC P, DOI DOI 10.1186/1753-6561-3-S7-S135
[3]   Gene- or Region-Based Analysis of Genome-Wide Association Studies [J].
Beyene, Joseph ;
Tritchler, David ;
Asimit, Jennifer L. ;
Hamid, Jemila S. .
GENETIC EPIDEMIOLOGY, 2009, 33 :S105-S110
[4]   Identification of Association Between Disease and Multiple Markers Via Sparse Partial Least-Squares Regression [J].
Chun, Hyonho ;
Ballard, David H. ;
Cho, Judy ;
Zhao, Hongyu .
GENETIC EPIDEMIOLOGY, 2011, 35 (06) :479-486
[5]  
Coin LJM, 2010, NAT METHODS, V7, P541, DOI [10.1038/NMETH.1466, 10.1038/nmeth.1466]
[6]  
DAVISON AC, 2003, BOOTSTRAP METHODS TH
[7]  
Day N, 1999, BRIT J CANCER, V80, P95
[8]  
Esposito Vinzi., 2010, Handbook of Partial Least Squares - Concepts, Methods and Applications
[9]  
Glaz J, 2008, SCAN STAT METHODS ST
[10]   A behavioural genomic analysis of DNA markers associated with general cognitive ability in 7-year-olds [J].
Harlaar, N ;
Butcher, LM ;
Meaburn, E ;
Sham, P ;
Craig, IW ;
Plomin, R .
JOURNAL OF CHILD PSYCHOLOGY AND PSYCHIATRY, 2005, 46 (10) :1097-1107