Maximum likelihood estimation of haplotype effects and haplotype-environment interactions in association studies

被引:111
作者
Lin, DY
Zeng, D
Millikan, R
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Dept Epidemiol, Chapel Hill, NC 27599 USA
关键词
case-control studies; cohort studies; complex diseases; EM algorithm; gene-environment interactions; haplotype analysis; Hardy-Weinberg equilibrium; linkage disequilibrium; profile likelihood; retrospective likelihood; SNPs; unphased genotype;
D O I
10.1002/gepi.20098
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The associations between haplotypes and disease phenotypes offer valuable clues about the genetic determinants of complex diseases. It is highly challenging to make statistical inferences about these associations because of the unknown gametic phase in genotype data. We describe a general likelihood-based approach to inferring haplotype-disease associations in studies of unrelated individuals. We consider all possible phenotypes (including disease indicator, quantitative trait, and potentially censored age at onset of disease) and all commonly used study designs (including cross-sectional, case-control, cohort, nested case-control, and case-cohort). The effects of haplotypes on phenotype are characterized by appropriate regression models, which allow various genetic mechanisms and gene-environment interactions. We present the likelihood functions for all study designs and disease phenotypes under Hardy-Weinberg disequilibrium. The corresponding maximum likelihood estimators are approximately unbiased, normally distributed, and statistically efficient. We provide simple and efficient numerical algorithms to calculate the maximum likelihood estimators and their variances, and implement these algorithms in a freely available computer program. Extensive simulation studies demonstrate that the proposed methods perform well in realistic situations. An application to the Carolina Breast Cancer Study reveals significant haplotype effects and haplotype-smoking interactions in the development of breast cancer.
引用
收藏
页码:299 / 312
页数:14
相关论文
共 39 条
[1]  
Akaike H., 1998, A Celebration ofStatistics, P387, DOI DOI 10.1007/978-1-4613-8560-8_1
[2]   Haplotypes vs single marker linkage disequilibrium tests:: what do we gain? (Reprinted European Journal of Human Genetics, Vol 4, pg 291-300, 2001) [J].
Akey, Joshua ;
Jin, Li ;
Xiong, Momiao .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 :S51-S58
[3]  
[Anonymous], 2002, ANAL LONGITUDINAL DA
[4]   Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease [J].
Botstein, D ;
Risch, N .
NATURE GENETICS, 2003, 33 (Suppl 3) :228-237
[5]  
CLARK AG, 1990, MOL BIOL EVOL, V7, P111
[6]  
COX DR, 1972, J R STAT SOC B, V34, P187
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]   Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329
[9]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[10]   Genetic analysis of case/control data using estimated haplotype frequencies: Application to APOE locus variation and Alzheimer's disease [J].
Fallin, D ;
Cohen, A ;
Essioux, L ;
Chumakov, I ;
Blumenfeld, M ;
Cohen, D ;
Schork, NJ .
GENOME RESEARCH, 2001, 11 (01) :143-151