A two-step method for variable selection in the analysis of a case-cohort study

被引:10
|
作者
Newcombe, P. J. [1 ]
Connolly, S. [1 ]
Seaman, S. [1 ]
Richardson, S. [1 ]
Sharp, S. J. [2 ]
机构
[1] MRC Biostat Unit, Cambridge, England
[2] MRC Epidemiol Unit, Cambridge, England
基金
英国医学研究理事会;
关键词
Case-cohort study; survival analysis; variable selection; Bayesian variable selection; type; 2; diabetes; fatty acids; MODEL EXPLORATION; STOCHASTIC SEARCH; COX REGRESSION; ASSOCIATION; LASSO; DESIGN;
D O I
10.1093/ije/dyx224
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Accurate detection and estimation of true exposure-outcome associations is important in aetiological analysis; when there are multiple potential exposure variables of interest, methods for detecting the subset of variables most likely to have true associations with the outcome of interest are required. Case-cohort studies often collect data on a large number of variables which have not been measured in the entire cohort (e.g. panels of biomarkers). There is a lack of guidance on methods for variable selection in case-cohort studies. Methods: We describe and explore the application of three variable selection methods to data from a case-cohort study. These are: (i) selecting variables based on their level of significance in univariable (i.e. one-at-a-time) Prentice- weighted Cox regression models; (ii) stepwise selection applied to Prentice-weighted Cox regression; and (iii) a two-step method which applies a Bayesian variable selection algorithm to obtain posterior probabilities of selection for each variable using multivariable logistic regression followed by effect estimation using Prentice-weighted Cox regression. Results: Across nine different simulation scenarios, the two-step method demonstrated higher sensitivity and lower false discovery rate than the one-at-a-time and stepwise methods. In an application of the methods to data from the EPIC-InterAct case-cohort study, the two-step method identified an additional two fatty acids as being associated with incident type 2 diabetes, compared with the one-at-a-time and stepwise methods. Conclusions: The two-step method enables more powerful and accurate detection of exposure-outcome associations in case-cohort studies. An R package is available to enable researchers to apply this method.
引用
收藏
页码:597 / 604
页数:8
相关论文
共 50 条
  • [21] A Two-Step Bayesian Approach for Propensity Score Analysis: Simulations and Case Study
    David Kaplan
    Jianshen Chen
    Psychometrika, 2012, 77 : 581 - 609
  • [22] A Novel Two-step Sparse Learning Approach for Variable Selection and Optimal Predictive Modeling
    Liu, Yiren
    Qin, S. Joe
    IFAC PAPERSONLINE, 2022, 55 (07): : 57 - 64
  • [23] Two-step variable selection in partially linear additive models with time series data
    Feng, Mu
    Chen, Zhao
    Cheng, Ximing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (03) : 661 - 671
  • [24] Two-Step Calibration Estimator with Double Use of Auxiliary Variable: Method and Application
    Alka, Singh
    Rai, Piyush Kant
    Qasim, Muhammad
    JIRSS-JOURNAL OF THE IRANIAN STATISTICAL SOCIETY, 2022, 21 (01): : 37 - 54
  • [25] Solution of the heat equation with variable properties by two-step Adomian decomposition method
    Marwat, D. N. Khan
    Asghar, S.
    MATHEMATICAL AND COMPUTER MODELLING, 2008, 48 (1-2) : 83 - 90
  • [26] Using the Whole Cohort in the Analysis of Case-Cohort Data
    Breslow, Norman E.
    Lumley, Thomas
    Ballantyne, Christie M.
    Chambless, Lloyd E.
    Kulich, Michal
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2009, 169 (11) : 1398 - 1405
  • [27] A Two-Step quantile selection model for the safety analysis at signalized intersections
    Xu, Xuecai
    Li, Y. C.
    Wong, S. C.
    Zhu, Feng
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2020, 12 (04) : 547 - 565
  • [28] A two-step optimization approach for technology selection
    Utturwar, A.
    Rallabhandi, S.
    DeLaurentis, D.
    Mavris, D.
    ENGINEERING OPTIMIZATION, 2006, 38 (08) : 889 - 908
  • [29] Two-step cross-validation selection method for partially linear models
    Avramidis, P
    STATISTICA SINICA, 2005, 15 (04) : 1033 - 1048
  • [30] A two-step machine learning approach for dynamic model selection: A case study on a micro milling process
    Cruz, Yarens J.
    Rivas, Marcelino
    Quiza, Ramon
    Haber, Rodolfo E.
    Castano, Fernando
    Villalonga, Alberto
    COMPUTERS IN INDUSTRY, 2022, 143