Conditional screening for ultrahigh-dimensional survival data in case-cohort studies

被引:0
作者
Jing Zhang
Haibo Zhou
Yanyan Liu
Jianwen Cai
机构
[1] Zhongnan University of Economics and Law,School of Statistics and Mathematics
[2] University of North Carolina at Chapel Hill,Department of Biostatistics
[3] Wuhan University,School of Mathematics and Statistics
来源
Lifetime Data Analysis | 2021年 / 27卷
关键词
Case-cohort design; Conditional screening; Sure screening property; Survival data; Ultrahigh-dimensional data; Weighted estimating equation;
D O I
暂无
中图分类号
学科分类号
摘要
The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction of sure independence screening, screening procedures have achieved great success in terms of effectively reducing the dimensionality and identifying active covariates. However, commonly used screening methods are based on marginal correlation or its variants, they may fail to identify hidden active variables which are jointly important but are weakly correlated with the response. Moreover, these screening methods are mainly proposed for data under the simple random sampling and can not be directly applied to case-cohort data. In this paper, we consider the ultrahigh-dimensional survival data under the case-cohort design, and propose a conditional screening method by incorporating some important prior known information of active variables. This method can effectively detect hidden active variables. Furthermore, it possesses the sure screening property under some mild regularity conditions and does not require any complicated numerical optimization. We evaluate the finite sample performance of the proposed method via extensive simulation studies and further illustrate the new approach through a real data set from patients with breast cancer.
引用
收藏
页码:632 / 661
页数:29
相关论文
共 162 条
  • [1] Andersen PK(1982)Cox’s regression model for counting processes: a large sample study Ann Statis 10 1100-1120
  • [2] Gill RD(1994)Robust variance estimation for the case-cohort design Biometrics 50 1064-1072
  • [3] Barlow WE(2016)Conditional sure independence screening J Am Stat Assoc 111 1266-1277
  • [4] Barut E(2000)Exposure stratified case-cohort designs Lifetime Data Anal 6 39-58
  • [5] Fan J(2007)Weighted likelihood for semiparametric models and two-phase stratified samples, with application to cox regression Scand J Stat 34 86-102
  • [6] Verhasselt A(2007)The Dantzig selector: Statistical estimation when $p$ is much larger than $n$ Ann Stat 35 2313-2351
  • [7] Borgan O(2013)Marginal empirical likelihood and sure independence feature screening Ann Stat 41 2123-2148
  • [8] Langholz B(2001)Generalized case-cohort sampling J R Stat Soc B 63 791-809
  • [9] Samuelsen SO(1999)Case-cohort and case-control analysis with Cox’s model Biometrika 86 755-764
  • [10] Goldstein L(1972)Regression models and life-tables J R Stat Soc B 34 187-220