Feature screening based on distance correlation for ultrahigh-dimensional censored data with covariate measurement error

被引:10
作者
Chen, Li-Pang [1 ]
机构
[1] Univ Western Ontario, Dept Stat & Actuarial Sci, 1151 Richmond St, London, ON N6A 3K7, Canada
关键词
Buckley-James imputation; Marginal dependence; Mismeasurement; Model misspecification; Survival data; Ultrahigh-dimension; VARIABLE SELECTION; MODEL; SURVIVAL; LIKELIHOOD; REGRESSION;
D O I
10.1007/s00180-020-01039-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Feature screening is an important method to reduce the dimension and capture informative variables in ultrahigh-dimensional data analysis. Its key idea is to select informative variables using correlations between the response and the covariates. Many methods have been developed for feature screening. These methods, however, are challenged by complex features pertinent to the data collection as well as the nature of the data themselves. Typically, incomplete response caused by right-censoring and covariate measurement error are often accompanying with survival analysis. Even though many methods have been proposed for censored data, little work has been available when both incomplete response and measurement error occur simultaneously. In addition, the conventional feature screening methods may fail to detect the truly important covariates that are marginally independent of the response variable due to correlations among covariates. In this paper, we explore this important problem and propose the model-free feature screening method in the presence of the censored response and error-prone covariates. In addition, we also develop the iteration method to improve the accuracy of selecting all important covariates. Numerical studies are reported to assess the performance of the proposed method. Finally, we implement the proposed method to a real dataset.
引用
收藏
页码:857 / 884
页数:28
相关论文
共 33 条
[1]  
Akaike H, 1998, SELECTED PAPERS HIRO, P199
[2]  
BUCKLEY J, 1979, BIOMETRIKA, V66, P429
[3]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[4]  
Carroll J., 2006, MEASUREMENT ERROR NO, V2nd edn
[5]   Dynamic Treatment Regimes: Statistical Methods for Precision Medicine [J].
Chen, Li-Pang .
BIOMETRICS, 2020, 76 (03) :1045-1046
[6]   Semiparametric methods for left-truncated and right-censored survival data with covariate measurement error [J].
Chen, Li-Pang ;
Yi, Grace Y. .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (03) :481-517
[7]   Semiparametric estimation for the transformation model with length-biased data and covariate measurement error [J].
Chen, Li-Pang .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (03) :420-442
[8]   Semiparametric estimation for cure survival model with left-truncated and right-censored data and covariate measurement error [J].
Chen, Li-Pang .
STATISTICS & PROBABILITY LETTERS, 2019, 154
[9]   Pseudo likelihood estimation for the additive hazards model with data subject to left-truncation and right-censoring [J].
Chen, Li-Pang .
STATISTICS AND ITS INTERFACE, 2019, 12 (01) :135-148
[10]   Semiparametric estimation for the accelerated failure time model with length-biased sampling and covariate measurement error [J].
Chen, Li-Pang .
STAT, 2018, 7 (01)