Dimension reduction and estimation in the secondary analysis of case-control studies

被引:0
作者
Liang, Liang [1 ]
Carroll, Raymond [2 ,3 ]
Ma, Yanyuan [4 ]
机构
[1] Harvard Univ, Dept Biostat, Boston, MA 02115 USA
[2] Texas A&M Univ, Dept Stat, 3143 TAMU, College Stn, TX 77843 USA
[3] Univ Technol Sydney, Sch Math & Phys Sci, POB 123, Broadway, NSW 2007, Australia
[4] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Biased samples; case-control study; dimension reduction; heteroscedastic error; secondary analysis; semiparametric estimation; REGRESSION; MODEL; ASSOCIATION; PHENOTYPE; INFERENCE; DESIGNS; MOMENT; ROBUST;
D O I
10.1214/18-EJS1446
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Studying the relationship between covariates based on retrospective data is the main purpose of secondary analysis, an area of increasing interest. We examine the secondary analysis problem when multiple covariates are available, while only a regression mean model is specified. Despite the completely parametric modeling of the regression mean function, the case-control nature of the data requires special treatment and semiparametric efficient estimation generates various nonparametric estimation problems with multivariate covariates. We devise a dimension reduction approach that fits with the specified primary and secondary models in the original problem setting, and use reweighting to adjust for the case-control nature of the data, even when the disease rate in the source population is unknown. The resulting estimator is both locally efficient and robust against the misspecification of the regression error distribution, which can be heteroscedastic as well as non-Gaussian. We demonstrate the advantage of our method over several existing methods, both analytically and numerically.
引用
收藏
页码:1782 / 1821
页数:40
相关论文
共 50 条
[21]   A general regression framework for a secondary outcome in case-control studies [J].
Tchetgen, Eric J. Tchetgen .
BIOSTATISTICS, 2014, 15 (01) :117-128
[22]   ESTIMATION OF RELATIVE RISK IN CASE-CONTROL STUDIES THROUGH LOGISTIC-REGRESSION ANALYSIS [J].
AGGARWAL, AR ;
SINGH, P .
BIOMETRICAL JOURNAL, 1993, 35 (04) :479-485
[23]   Semiparametrically efficient estimation in quantile regression of secondary analysis [J].
Liang, Liang ;
Ma, Yanyuan ;
Wei, Ying ;
Carroll, Raymond J. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (04) :625-648
[24]   Estimation of genetic heritabilities of human traits in case-control studies [J].
Hu, Die ;
Chen, Shuyue ;
Zhang, Hong .
ANNALS OF HUMAN GENETICS, 2023, 87 (03) :91-103
[25]   Sparse estimation for case-control studies with multiple disease subtypes [J].
Ballout, Nadim ;
Garcia, Cedric ;
Viallon, Vivian .
BIOSTATISTICS, 2021, 22 (04) :738-755
[26]   A pseudolikelihood approach for assessing genetic association in case-control studies with unmeasured population structure [J].
Chen, Yong ;
Liang, Kung-Yee ;
Tong, Pan ;
Beaty, Terri H. ;
Barnes, Kathleen C. ;
Kao, W. H. Linda .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (11) :3153-3165
[27]   A semiparametric efficient estimator in case-control studies for gene-environment independent models [J].
Liang, Liang ;
Ma, Yanyuan ;
Carroll, Raymond J. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 173 :38-50
[28]   Power and replication in case-control studies [J].
Lalouel, LM ;
Rohrwasser, A .
AMERICAN JOURNAL OF HYPERTENSION, 2002, 15 (02) :201-205
[29]   Secondary analysis of case-control association studies: Insights on weighting-based inference motivate a new specification test [J].
Li, Fan ;
Allen, Andrew S. .
STATISTICS IN MEDICINE, 2020, 39 (22) :2869-2882
[30]   Efficient estimation of indirect effects in case-control studies using a unified likelihood framework [J].
Satten, Glen A. ;
Curtis, Sarah W. ;
Solis-Lemus, Claudia ;
Leslie, Elizabeth J. ;
Epstein, Michael P. .
STATISTICS IN MEDICINE, 2022, 41 (15) :2879-2893