HIGH-DIMENSIONAL VARIABLE SELECTION WITH RIGHT-CENSORED LENGTH-BIASED DATA

被引:2
作者
Di He [1 ,2 ]
Zhou, Yong [3 ]
Zou, Hui [4 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
[2] Nanjing Univ, Sch Econ, Nanjing 210046, Peoples R China
[3] East China Normal Univ, Acad Stat & Interdisciplinary Sci, Shanghai 200062, Peoples R China
[4] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
基金
中国国家自然科学基金;
关键词
Accelerated failure time model; high-dimensional variable selection; length-biased data; multi-stage penalization; NONCONCAVE PENALIZED LIKELIHOOD; SEMIPARAMETRIC TRANSFORMATION MODELS; NONPARAMETRIC-ESTIMATION; EMPIRICAL DISTRIBUTIONS; QUANTILE REGRESSION; PREVALENT COHORT; ADAPTIVE LASSO; SURVIVAL;
D O I
10.5705/ss.202018.0089
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Length-biased data are common in various fields, including epidemiology and labor economics, and they have attracted considerable attention in survival literature. A crucial goal of a survival analysis is to identify a subset of risk factors and their risk contributions from among a vast number of clinical covariates. However, there is no research on variable selection for length-biased data, owing to the complex nature of such data and the lack of a convenient loss function. Therefore, we propose an estimation method based on penalized estimating equations to obtain a sparse and consistent estimator for length-biased data under an accelerated failure time model. The proposed estimator possesses the selection and estimation consistency property. In particular, we implement our method using a SCAD penalty and a local linear approximation algorithm. We suggest selecting the tuning parameter using the extended BIC in high-dimensional settings. Furthermore, we develop a novel multistage SCAD penalized estimating equation procedure to achieve improved estimation accuracy and sparsity in the variable selection. Simulation studies show that the proposed procedure has high accuracy and almost perfect sparsity. Oscar Awards data are analyzed as an application of the proposed method.
引用
收藏
页码:193 / 215
页数:23
相关论文
共 50 条
[31]   Mean residual life cure models for right-censored data with and without length-biased sampling [J].
Chen, Chyong-Mei ;
Chen, Hsin-Jen ;
Peng, Yingwei .
BIOMETRICAL JOURNAL, 2023, 65 (05)
[32]   Composite estimating equation approach for additive risk model with length-biased and right-censored data [J].
Ma, Huijuan ;
Zhang, Feipeng ;
Zhou, Yong .
STATISTICS & PROBABILITY LETTERS, 2015, 96 :45-53
[33]   Variable Selection for Length-Biased and Interval-Censored Failure Time Data [J].
Feng, Fan ;
Cheng, Guanghui ;
Sun, Jianguo .
MATHEMATICS, 2023, 11 (22)
[34]   SEMIPARAMETRIC INFERENCE FOR THE PROPORTIONAL MEAN RESIDUAL LIFE MODEL WITH RIGHT-CENSORED LENGTH-BIASED DATA [J].
Bai, Fangfang ;
Huang, Jian ;
Zhou, Yong .
STATISTICA SINICA, 2016, 26 (03) :1129-1158
[35]   Composite partial likelihood estimation for length-biased and right-censored data with competing risks [J].
Zhang, Feipeng ;
Peng, Heng ;
Zhou, Yong .
JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 149 :160-176
[36]   Nonparametric inference on mean residual life function with length-biased right-censored data [J].
Wu, Hongping ;
Shan, Ang .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2020, 49 (09) :2065-2079
[37]   Nonparametric and semiparametric estimation of quantile residual lifetime for length-biased and right-censored data [J].
Wang, Yixin ;
Zhou, Zhefang ;
Zhou, Xiao-Hua ;
Zhou, Yong .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (02) :220-250
[38]   An Efficient Estimation of the Mean Residual Life Function with Length-Biased Right-Censored Data [J].
Wu, Hongping ;
Luan, Yihui .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
[39]   SMOOTH COMPOSITE LIKELIHOOD ANALYSIS OF LENGTH-BIASED AND RIGHT-CENSORED DATA WITH AFT MODEL [J].
Chen, Xuerong ;
Hu, Na ;
Sun, Jianguo .
STATISTICA SINICA, 2017, 27 (01) :229-242
[40]   Semiparametric transformation models with length-biased and right-censored data under the case-cohort design [J].
Ma, Huijuan ;
Qiu, Zhiping ;
Zhou, Yong .
STATISTICS AND ITS INTERFACE, 2016, 9 (02) :213-222