Semiparametric estimation in the secondary analysis of case-control studies

被引:13
作者
Ma, Yanyuan [1 ,2 ]
Carroll, Raymond J. [2 ]
机构
[1] Univ S Carolina, Columbia, SC 29208 USA
[2] Texas A&M Univ, College Stn, TX USA
基金
美国国家科学基金会;
关键词
Biased samples; Case-control study; Heteroscedastic regression; Secondary analysis; Semiparametric estimation; HETEROCYCLIC AMINES; INFERENCE; RISK; ASSOCIATION; REGRESSION; PHENOTYPE; CANCER; MODELS; ROBUST;
D O I
10.1111/rssb.12107
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study the regression relationship between covariates in case-control data: an area known as the secondary analysis of case-control studies. The context is such that only the form of the regression mean is specified, so that we allow an arbitrary regression error distribution, which can depend on the covariates and thus can be heteroscedastic. Under mild regularity conditions we establish the theoretical identifiability of such models. Previous work in this context has either specified a fully parametric distribution for the regression errors, specified a homoscedastic distribution for the regression errors, has specified the rate of disease in the population (we refer to this as the true population) or has made a rare disease approximation. We construct a class of semiparametric estimation procedures that rely on none of these. The estimators differ from the usual semiparametric estimators in that they draw conclusions about the true population, while technically operating in a hypothetical superpopulation. We also construct estimators with a unique feature, in that they are robust against the misspecification of the regression error distribution in terms of variance structure, whereas all other non-parametric effects are estimated despite the biased samples. We establish the asymptotic properties of the estimators and illustrate their finite sample performance through simulation studies, as well as through an empirical example on the relationship between red meat consumption and hetero-cyclic amines. Our analysis verified the positive relationship between red meat consumption and two forms of hetro-cyclic amines, indicating that increased red meat consumption leads to increased levels of MeIQx and PhIP, both being risk factors for colorectal cancer. Computer software as well as data to illustrate the methodology are available from http://www.stat.tamu.edu/similar to carroll/matlabprograms/software.php.
引用
收藏
页码:127 / 151
页数:25
相关论文
共 50 条
  • [1] Dimension reduction and estimation in the secondary analysis of case-control studies
    Liang, Liang
    Carroll, Raymond
    Ma, Yanyuan
    ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (01): : 1782 - 1821
  • [2] Improved Semiparametric Analysis of Polygenic Gene-Environment Interactions in Case-Control Studies
    Wang, Tianying
    Asher, Alex
    STATISTICS IN BIOSCIENCES, 2021, 13 (03) : 386 - 401
  • [3] Semiparametric analysis of complex polygenic gene-environment interactions in case-control studies
    Stalder, Odile
    Asher, Alex
    Liang, Liang
    Carroll, Raymond J.
    Ma, Yanyuan
    Chatterjee, Nilanjan
    BIOMETRIKA, 2017, 104 (04) : 801 - 812
  • [4] Robust estimation for homoscedastic regression in the secondary analysis of case-control data
    Wei, Jiawei
    Carroll, Raymond J.
    Mueller, Ursula U.
    Van Keilegom, Ingrid
    Chatterjee, Nilanjan
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (01) : 185 - 206
  • [5] CONTROL FUNCTION ASSISTED IPW ESTIMATION WITH A SECONDARY OUTCOME IN CASE-CONTROL STUDIES
    Sofer, Tamar
    Cornelis, Marilyn C.
    Kraft, Peter
    Tchetgen, Eric J. Tchetgen
    STATISTICA SINICA, 2017, 27 (02) : 785 - 804
  • [6] Unified Analysis of Secondary Traits in Case-Control Association Studies
    Ghosh, Arpita
    Wright, Fred A.
    Zou, Fei
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (502) : 566 - 576
  • [7] Semiparametric analysis for case-control studies: a partial smoothing spline approach
    Kim, Young-Ju
    JOURNAL OF APPLIED STATISTICS, 2010, 37 (06) : 1015 - 1025
  • [8] A semiparametric efficient estimator in case-control studies for gene-environment independent models
    Liang, Liang
    Ma, Yanyuan
    Carroll, Raymond J.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 173 : 38 - 50
  • [9] A new semiparametric procedure for matched case-control studies with missing covariates
    Sinha, Samiran
    Wang, Suojin
    JOURNAL OF NONPARAMETRIC STATISTICS, 2009, 21 (07) : 889 - 905
  • [10] Robust Estimation for Secondary Trait Association in Case-Control Genetic Studies
    Tapsoba, Jean de Dieu
    Kooperberg, Charles
    Reiner, Alexander
    Wang, Ching-Yun
    Dai, James Y.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2014, 179 (10) : 1264 - 1272