Conditional screening for ultrahigh-dimensional survival data in case-cohort studies

被引:0
作者
Jing Zhang
Haibo Zhou
Yanyan Liu
Jianwen Cai
机构
[1] Zhongnan University of Economics and Law,School of Statistics and Mathematics
[2] University of North Carolina at Chapel Hill,Department of Biostatistics
[3] Wuhan University,School of Mathematics and Statistics
来源
Lifetime Data Analysis | 2021年 / 27卷
关键词
Case-cohort design; Conditional screening; Sure screening property; Survival data; Ultrahigh-dimensional data; Weighted estimating equation;
D O I
暂无
中图分类号
学科分类号
摘要
The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction of sure independence screening, screening procedures have achieved great success in terms of effectively reducing the dimensionality and identifying active covariates. However, commonly used screening methods are based on marginal correlation or its variants, they may fail to identify hidden active variables which are jointly important but are weakly correlated with the response. Moreover, these screening methods are mainly proposed for data under the simple random sampling and can not be directly applied to case-cohort data. In this paper, we consider the ultrahigh-dimensional survival data under the case-cohort design, and propose a conditional screening method by incorporating some important prior known information of active variables. This method can effectively detect hidden active variables. Furthermore, it possesses the sure screening property under some mild regularity conditions and does not require any complicated numerical optimization. We evaluate the finite sample performance of the proposed method via extensive simulation studies and further illustrate the new approach through a real data set from patients with breast cancer.
引用
收藏
页码:632 / 661
页数:29
相关论文
共 162 条
[41]  
Gorst-Rasmussen A(1986)A case-cohort design for epidemiologic cohort studies and disease prevention trials Biometrika 73 1-11
[42]  
Scheike T(2004)Maximum likelihood estimation for Cox’s regression model under case-cohort sampling Scand J Stat 31 283-293
[43]  
He X(1988)Asymptotic distribution theory and efficiency results for case-cohort studies Ann Stat 16 64-81
[44]  
Wang L(2014)Censored rank independence screening for high-dimensional survival data Biometrika 101 799-814
[45]  
Hong HG(1996)Regression shrinkage and selection via the lasso J R Stat Soc B 58 267-288
[46]  
Hong HG(2009)Univariate shrinkage in the Cox model for high dimensional data Stat Appl Genet Mol 8 1-18
[47]  
Kang J(2011)On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data Stat Med 30 1105-1117
[48]  
Li Y(2002)A gene-expression signature as a predictor of survival in breast cancer New Engl J Med 347 1999-2009
[49]  
Hong HG(2002)Gene expression profiling predicts clinical outcome of breast cancer Nature 415 530-536
[50]  
Wang L(2015)Conditional quantile screening in ultrahigh-dimensional heterogeneous data Biometrika 102 65-76