Variable selection for censored data with greedy algorithm based adaptive quantile regression models

被引:0
作者
Rahaman Khan, Md Hasinur [1 ]
Nishat, Md Nasim Saba [1 ]
机构
[1] Univ Dhaka, Inst Stat Res & Training, Appl Stat & Data Sci, Dhaka, Bangladesh
关键词
Accelerated failure time models; Greedy algorithms; High dimensional censored data; Variable selection; HIGH-DIMENSIONAL DATA; SURVIVAL ANALYSIS; LASSO;
D O I
10.1080/03610918.2025.2483889
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The application of quantile regression offers a versatile and appealing approach for analyzing censored data, particularly within the accelerated failure time (AFT) model, where the focus lies on the significance of conditional quantile functions in regression analysis. The extension is achieved through the integration of well-established greedy algorithms-sure independence screening (SIS), tilting, and PC-simple-resulting in the development of Quant-SIS, Quant-Tilting, and Quant-PC techniques, respectively. These techniques prove to be adaptable, efficient, and consistent variable selection algorithms for high-dimensional datasets due to the inherent properties of sure independence, tilting correlation, and partial faithfulness. We compare the performance of the proposed methods with two competitive approaches from the existing literature. Through a comprehensive series of simulation studies encompassing diverse scenarios including varying collinearity levels among covariates, levels of censoring, and quantiles-we demonstrate their efficacy. Additionally, we apply the proposed methods to real-world microarray data from Diffuse Large B-cell Lymphoma (DLBCL) patients. This application showcases the ability of our techniques to accurately identify genes associated with the survival time of DLBCL patients. The results indicate a substantial enhancement in performance, as the modified quantile regression techniques for censored data significantly outperform existing methods across a wide spectrum of cases.
引用
收藏
页数:18
相关论文
共 46 条
[11]   Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data [J].
Gui, J ;
Li, HZ .
BIOINFORMATICS, 2005, 21 (13) :3001-3008
[12]   QUANTILE-ADAPTIVE MODEL-FREE VARIABLE SCREENING FOR HIGH-DIMENSIONAL HETEROGENEOUS DATA [J].
He, Xuming ;
Wang, Lan ;
Hong, Hyokyoung Grace .
ANNALS OF STATISTICS, 2013, 41 (01) :342-369
[13]   Penalized estimating functions and variable selection in semiparametric regression models [J].
Johnson, Brent A. ;
Lin, D. Y. ;
Zeng, Donglin .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (482) :672-680
[14]  
Khan M., 2013, THESIS U WARWICK
[15]   Ranking based variable selection for censored data using AFT models [J].
Khan, Md Hasinur Rahaman ;
Akhter, Marzan .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (06) :2917-2939
[16]   Stability selection for lasso, ridge and elastic net implemented with AFT models [J].
Khan, Md Hasinur Rahaman ;
Bhadra, Anamika ;
Howlader, Tamanna .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (05)
[17]   Variable selection for accelerated lifetime models with synthesized estimation techniques [J].
Khan, Md Hasinur Rahaman ;
Shaw, J. Ewart H. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (03) :937-952
[18]   On the performance of adaptive preprocessing technique in analyzing high-dimensional censored data [J].
Khan, Md Hasinur Rahaman .
BIOMETRICAL JOURNAL, 2018, 60 (04) :687-702
[19]   Variable selection for survival data with a class of adaptive elastic net techniques [J].
Khan, Md Hasinur Rahaman ;
Shaw, J. Ewart H. .
STATISTICS AND COMPUTING, 2016, 26 (03) :725-741
[20]   REGRESSION QUANTILES [J].
KOENKER, R ;
BASSETT, G .
ECONOMETRICA, 1978, 46 (01) :33-50