Haplotype-based association analysis in cohort and nested case-control studies

被引:9
作者
Chen, JB [1 ]
Chatterjee, N [1 ]
机构
[1] NCI, Biostat Branch, Div Canc Epidemiol & Genet, Rockville, MD 20852 USA
关键词
cohort study; Cox proportional hazards model; nested case-control study; unphased genotype data;
D O I
10.1111/j.1541-0420.2005.00406.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic epidemiologic studies often collect genotype data at multiple loci within a genomic region of interest from a sample of unrelated individuals. One popular method for analyzing such data is to assess whether haplotypes, i.e., the arrangements of alleles along individual chromosomes, are associated with the disease phenotype or not. For many study subjects, however, the exact haplotype configuration on the pair of homologous chromosomes cannot be derived with certainty from the available locus-specific genotype data (phase ambiguity). In this article, we consider estimating haplotype-specific association parameters in the Cox proportional hazards model, using genotype, environmental exposure, and the disease endpoint data collected from cohort or nested case-control studies. We study alternative Expectation-Maximization algorithms for estimating haplotype frequencies from cohort and nested case-control studies. Based on a hazard function of the disease derived from the observed genotype data, we then propose a semiparametric method for joint estimation of relative-risk parameters and the cumulative baseline hazard function. The method is greatly simplified under a rare disease assumption, for which an asymptotic variance estimator is also proposed. The performance of the proposed estimators is assessed via simulation studies. An application of the proposed method is presented, using data from the Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study.
引用
收藏
页码:28 / 35
页数:8
相关论文
共 27 条
[1]   Methods for the analysis of sampled cohort data in the Cox proportional hazards model [J].
Borgan, O ;
Goldstein, L ;
Langholz, B .
ANNALS OF STATISTICS, 1995, 23 (05) :1749-1778
[2]   NONPARAMETRIC-ESTIMATION OF RELATIVE MORTALITY FROM NESTED CASE-CONTROL STUDIES [J].
BORGAN, O ;
LANGHOLZ, B .
BIOMETRICS, 1993, 49 (02) :593-602
[3]   Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease [J].
Botstein, D ;
Risch, N .
NATURE GENETICS, 2003, 33 (Suppl 3) :228-237
[4]   COVARIANCE ANALYSIS OF CENSORED SURVIVAL DATA [J].
BRESLOW, N .
BIOMETRICS, 1974, 30 (01) :89-99
[5]  
BREWLOW NE, 1972, J ROYAL STAT SOC B, V34, P216
[6]  
COX DR, 1972, J R STAT SOC B, V34, P187
[7]   Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329
[8]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[9]   Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data [J].
Fallin, D ;
Schork, NJ .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) :947-959
[10]   UNIQUE CONSISTENT SOLUTION TO LIKELIHOOD EQUATIONS [J].
FOUTZ, RV .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1977, 72 (357) :147-148