Variable selection for censored data with greedy algorithm based adaptive quantile regression models

被引:1
作者
Rahaman Khan, Md Hasinur [1 ]
Nishat, Md Nasim Saba [1 ]
机构
[1] Univ Dhaka, Inst Stat Res & Training, Appl Stat & Data Sci, Dhaka, Bangladesh
关键词
Accelerated failure time models; Greedy algorithms; High dimensional censored data; Variable selection; HIGH-DIMENSIONAL DATA; SURVIVAL ANALYSIS; LASSO;
D O I
10.1080/03610918.2025.2483889
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The application of quantile regression offers a versatile and appealing approach for analyzing censored data, particularly within the accelerated failure time (AFT) model, where the focus lies on the significance of conditional quantile functions in regression analysis. The extension is achieved through the integration of well-established greedy algorithms-sure independence screening (SIS), tilting, and PC-simple-resulting in the development of Quant-SIS, Quant-Tilting, and Quant-PC techniques, respectively. These techniques prove to be adaptable, efficient, and consistent variable selection algorithms for high-dimensional datasets due to the inherent properties of sure independence, tilting correlation, and partial faithfulness. We compare the performance of the proposed methods with two competitive approaches from the existing literature. Through a comprehensive series of simulation studies encompassing diverse scenarios including varying collinearity levels among covariates, levels of censoring, and quantiles-we demonstrate their efficacy. Additionally, we apply the proposed methods to real-world microarray data from Diffuse Large B-cell Lymphoma (DLBCL) patients. This application showcases the ability of our techniques to accurately identify genes associated with the survival time of DLBCL patients. The results indicate a substantial enhancement in performance, as the modified quantile regression techniques for censored data significantly outperform existing methods across a wide spectrum of cases.
引用
收藏
页数:18
相关论文
共 46 条
[1]  
Agrawal R., 1998, SIGMOD Record, V27, P94, DOI 10.1145/276305.276314
[2]   Median regression with censored cost data [J].
Bang, H ;
Tsiatis, AA .
BIOMETRICS, 2002, 58 (03) :643-649
[3]   l1-PENALIZED QUANTILE REGRESSION IN HIGH-DIMENSIONAL SPARSE MODELS [J].
Belloni, Alexandre ;
Chernozhukov, Victor .
ANNALS OF STATISTICS, 2011, 39 (01) :82-130
[4]   Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm [J].
Buehlmann, P. ;
Kalisch, M. ;
Maathuis, M. H. .
BIOMETRIKA, 2010, 97 (02) :261-278
[5]   High dimensional variable selection via tilting [J].
Cho, Haeran ;
Fryzlewicz, Piotr .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 :593-622
[6]  
Efron B., 1967, P 5 BERK S MATH STAT, V4, P831, DOI DOI 10.1525/9780520313903
[7]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[8]  
Fan JQ, 2002, ANN STAT, V30, P74
[9]   Bayesian variable selection method for censored survival data [J].
Faraggi, D .
BIOMETRICS, 1998, 54 (04) :1475-1485
[10]   Research on collaborative negotiation for e-commerce. [J].
Feng, YQ ;
Lei, Y ;
Li, Y ;
Cao, RZ .
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, :2085-2088