A pseudolikelihood approach for assessing genetic association in case-control studies with unmeasured population structure

被引:2
作者
Chen, Yong [1 ]
Liang, Kung-Yee [2 ]
Tong, Pan [3 ]
Beaty, Terri H. [4 ]
Barnes, Kathleen C. [5 ]
Kao, W. H. Linda [4 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
[2] Natl Hlth Res Inst, Zhunan, Taiwan
[3] Univ Texas Houston, Dept Bioinformat & Computat Biol, Houston, TX USA
[4] Johns Hopkins Univ, Dept Epidemiol, Baltimore, MD USA
[5] Univ Colorado Denver, Anschutz Med Campus, Aurora, CO USA
基金
美国医疗保健研究与质量局;
关键词
Case-control study; latent class model; population substructure; pseudolikelihood; two stage estimation; STAGE RENAL-DISEASE; MAXIMUM-LIKELIHOOD-ESTIMATION; ASYMPTOTIC-BEHAVIOR; LINKAGE ANALYSIS; GENOMIC CONTROL; STRATIFICATION; MODEL; INFERENCE; LOCI; REGRESSION;
D O I
10.1177/0962280220921212
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The case-control study design is one of the main tools for detecting associations between genetic markers and diseases. It is well known that population substructure can lead to spurious association between disease status and a genetic marker if the prevalence of disease and the marker allele frequency vary across subpopulations. In this paper, we propose a novel statistical method to estimate the association in case-control studies with unmeasured population substructure. The proposed method takes two steps. First, the information on genomic markers and disease status is used to infer the population substructure; second, the association between the disease and the test marker adjusting for the population substructure is modeled and estimated parametrically through polytomous logistic regression. The performance of the proposed method, relative to the existing methods, on bias, coverage probability and computational time, is assessed through simulations. The method is applied to an end-stage renal disease study in African Americans population.
引用
收藏
页码:3153 / 3165
页数:13
相关论文
共 45 条
[1]  
AITKIN M, 1985, J ROY STAT SOC B MET, V47, P67
[2]   LIKELIHOOD OF A MODEL AND INFORMATION CRITERIA [J].
AKAIKE, H .
JOURNAL OF ECONOMETRICS, 1981, 16 (01) :3-14
[3]  
Arminger Gerhard., 1995, HDB STAT MODELING SO
[4]   TESTS FOR LINEAR TRENDS IN PROPORTIONS AND FREQUENCIES [J].
ARMITAGE, P .
BIOMETRICS, 1955, 11 (03) :375-386
[5]   The power of genomic control [J].
Bacanu, SA ;
Devlin, B ;
Roeder, K .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (06) :1933-1944
[6]   A METHOD FOR QUANTIFYING DIFFERENTIATION BETWEEN POPULATIONS AT MULTI-ALLELIC LOCI AND ITS IMPLICATIONS FOR INVESTIGATING IDENTITY AND PATERNITY [J].
BALDING, DJ ;
NICHOLS, RA .
GENETICA, 1995, 96 (1-2) :3-12
[7]   Latent variable regression for multiple discrete outcomes [J].
Bandeen-Roche, K ;
Miglioretti, DL ;
Zeger, SL ;
Rathouz, PJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (440) :1375-1386
[8]   Population stratification and spurious allelic association [J].
Cardon, LR ;
Palmer, LJ .
LANCET, 2003, 361 (9357) :598-604
[9]   On the asymptotic behaviour of the pseudolikelihood ratio test statistic with boundary problems [J].
Chen, Yong ;
Liang, Kung-Yee .
BIOMETRIKA, 2010, 97 (03) :603-620
[10]   EFFECTIVENESS OF ADJUSTMENT BY SUBCLASSIFICATION IN REMOVING BIAS IN OBSERVATIONAL STUDIES [J].
COCHRAN, WG .
BIOMETRICS, 1968, 24 (02) :295-&