A two-step method for variable selection in the analysis of a case-cohort study

被引:10
|
作者
Newcombe, P. J. [1 ]
Connolly, S. [1 ]
Seaman, S. [1 ]
Richardson, S. [1 ]
Sharp, S. J. [2 ]
机构
[1] MRC Biostat Unit, Cambridge, England
[2] MRC Epidemiol Unit, Cambridge, England
基金
英国医学研究理事会;
关键词
Case-cohort study; survival analysis; variable selection; Bayesian variable selection; type; 2; diabetes; fatty acids; MODEL EXPLORATION; STOCHASTIC SEARCH; COX REGRESSION; ASSOCIATION; LASSO; DESIGN;
D O I
10.1093/ije/dyx224
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Accurate detection and estimation of true exposure-outcome associations is important in aetiological analysis; when there are multiple potential exposure variables of interest, methods for detecting the subset of variables most likely to have true associations with the outcome of interest are required. Case-cohort studies often collect data on a large number of variables which have not been measured in the entire cohort (e.g. panels of biomarkers). There is a lack of guidance on methods for variable selection in case-cohort studies. Methods: We describe and explore the application of three variable selection methods to data from a case-cohort study. These are: (i) selecting variables based on their level of significance in univariable (i.e. one-at-a-time) Prentice- weighted Cox regression models; (ii) stepwise selection applied to Prentice-weighted Cox regression; and (iii) a two-step method which applies a Bayesian variable selection algorithm to obtain posterior probabilities of selection for each variable using multivariable logistic regression followed by effect estimation using Prentice-weighted Cox regression. Results: Across nine different simulation scenarios, the two-step method demonstrated higher sensitivity and lower false discovery rate than the one-at-a-time and stepwise methods. In an application of the methods to data from the EPIC-InterAct case-cohort study, the two-step method identified an additional two fatty acids as being associated with incident type 2 diabetes, compared with the one-at-a-time and stepwise methods. Conclusions: The two-step method enables more powerful and accurate detection of exposure-outcome associations in case-cohort studies. An R package is available to enable researchers to apply this method.
引用
收藏
页码:597 / 604
页数:8
相关论文
共 50 条
  • [1] Regression analysis for secondary response variable in a case-cohort study
    Pan, Yinghao
    Cai, Jianwen
    Kim, Sangmi
    Zhou, Haibo
    BIOMETRICS, 2018, 74 (03) : 1014 - 1022
  • [2] Variable selection for case-cohort studies with failure time outcome
    Ni, Ai
    Cai, Jianwen
    Zeng, Donglin
    BIOMETRIKA, 2016, 103 (03) : 547 - 562
  • [3] Two-step clustering analysis in researches: A case study
    Kayri, Murat
    EURASIAN JOURNAL OF EDUCATIONAL RESEARCH, 2007, 7 (28): : 89 - 99
  • [4] The variable two-step BDF method for parabolic equations
    Georgios Akrivis
    Minghua Chen
    Jianxing Han
    Fan Yu
    Zhimin Zhang
    BIT Numerical Mathematics, 2024, 64
  • [5] The variable two-step BDF method for parabolic equations
    Akrivis, Georgios
    Chen, Minghua
    Han, Jianxing
    Yu, Fan
    Zhang, Zhimin
    BIT NUMERICAL MATHEMATICS, 2024, 64 (01)
  • [6] Bi-level variable selection for case-cohort studies with group variables
    Kim, Soyoung
    Ahn, Kwang Woo
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (10-11) : 3404 - 3414
  • [7] Variable selection for case-cohort studies with informatively interval-censored outcomes
    Du, Mingyue
    Zhao, Xingqiu
    Sun, Jianguo
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 172
  • [8] A two-step approach for variable selection in linear regression with measurement error
    Song, Jiyeon
    Shin, Seung Jun
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2019, 26 (01) : 47 - 55
  • [9] A regularized variable selection procedure in additive hazards model with stratified case-cohort design
    Ni, Ai
    Cai, Jianwen
    LIFETIME DATA ANALYSIS, 2018, 24 (03) : 443 - 463
  • [10] A regularized variable selection procedure in additive hazards model with stratified case-cohort design
    Ai Ni
    Jianwen Cai
    Lifetime Data Analysis, 2018, 24 : 443 - 463