False discovery rate control for high-dimensional Cox model with uneven data splitting

被引:0
|
作者
Ge, Yeheng [1 ]
Zhang, Sijia [1 ]
Zhang, Xiao [2 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[2] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen, Peoples R China
关键词
De-sparsified estimator; false discovery control; symmetric-based statistic; Cox model; PROPORTIONAL HAZARDS MODEL; VARIABLE SELECTION; CONFIDENCE-INTERVALS; REGRESSION; REGIONS; REGULARIZATION; LASSO; TESTS;
D O I
10.1080/00949655.2023.2290135
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Statistical inference for high-dimensional survival data is important for obtaining valid scientific results in many research areas, including biomedical studies and financial risk management. In this paper, a novel framework for feature selection in Cox model is proposed, which achieves false discovery rate (FDR) control asymptotically. The key step is to construct a sequence of ranking statistics based on two independent estimators of the regression coefficients. The FDR control is reached by choosing a data-driven threshold along the ranking of symmetric-based statistics. The de-sparsified estimator and uneven data splitting strategy are employed to improve the robustness of variable selection results and the power in finite sample analysis. We establish the asymptotic FDR control property for the proposed approach at any designated level. Extensive simulation studies and an empirical application on a P2P loan dataset confirm the robustness of the proposed method in FDR control, and show that it often leads to higher power among competitors.
引用
收藏
页码:1462 / 1493
页数:32
相关论文
共 50 条