Feature screening for ultrahigh-dimensional survival data when failure indicators are missing at random

被引:0
作者
Jianglin Fang
机构
[1] Hunan Institute of Engineering,College of Science
来源
Statistical Papers | 2021年 / 62卷
关键词
Ultrahigh-dimensional data; Censored data; Missing data; Feature screening; Active variable set;
D O I
暂无
中图分类号
学科分类号
摘要
In modern statistical applications, the dimension of covariates can be much larger than the sample size, and extensive research has been done on screening methods which can effectively reduce the dimensionality. However, the existing feature screening procedure can not be used to handle the ultrahigh-dimensional survival data problems when failure indicators are missing at random. This motivates us to develop a feature screening procedure to handle this case. In this paper, we propose a feature screening procedure by sieved nonparametric maximum likelihood technique for ultrahigh-dimensional survival data with failure indicators missing at random. The proposed method has several desirable advantages. First, it does not rely on any model assumption and works well for nonlinear survival regression models. Second, it can be used to handle the incomplete survival data with failure indicators missing at random. Third, the proposed method is invariant under the monotone transformation of the response and satisfies the sure screening property. Simulation studies are conducted to examine the performance of our approach, and a real data example is also presented for illustration.
引用
收藏
页码:1141 / 1166
页数:25
相关论文
共 64 条
  • [1] Bitouzé D(1999)A Dvoretzky-Kiefer-Wolfowitz type inequality for the Kaplan-Meier estimator Annals de I’Institut Henri Poincare B 35 735-763
  • [2] Laurent B(2007)The Dantzig selector: statistical estimation when Ann Stat 35 2313-2351
  • [3] Massart P(2001) is much larger than J Am Stat Assoc 96 1348-1360
  • [4] Candes E(2008)Variable selection via nonconcave penalized likelihood and its oracle properties J R Stat Soc Ser B 35 2313-2351
  • [5] Tao T(2010)Sure independence screening for ultrahigh dimensional feature space J Am Stat Assoc 38 3567-3604
  • [6] Fan J(2010)Sure independence screening in generalized linear models with NP-dimensionality Statistics 2 70-86
  • [7] Li R(1981)High-dimensional variable selection for Cox’s proportional hazards model Ann Stat 9 853-860
  • [8] Fan J(1983)Testing with replacement and the product limit estimator Ann Stat 11 49-58
  • [9] Lv J(2008)Large sample behaviour of the product-limit estimator on the whole line Stat Pap 49 791-792
  • [10] Fan J(2013)An improved estimator to analyse missing data Ann Stat 41 342-369