False discovery rate control for high-dimensional Cox model with uneven data splitting

被引:0
|
作者
Ge, Yeheng [1 ]
Zhang, Sijia [1 ]
Zhang, Xiao [2 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[2] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen, Peoples R China
关键词
De-sparsified estimator; false discovery control; symmetric-based statistic; Cox model; PROPORTIONAL HAZARDS MODEL; VARIABLE SELECTION; CONFIDENCE-INTERVALS; REGRESSION; REGIONS; REGULARIZATION; LASSO; TESTS;
D O I
10.1080/00949655.2023.2290135
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Statistical inference for high-dimensional survival data is important for obtaining valid scientific results in many research areas, including biomedical studies and financial risk management. In this paper, a novel framework for feature selection in Cox model is proposed, which achieves false discovery rate (FDR) control asymptotically. The key step is to construct a sequence of ranking statistics based on two independent estimators of the regression coefficients. The FDR control is reached by choosing a data-driven threshold along the ranking of symmetric-based statistics. The de-sparsified estimator and uneven data splitting strategy are employed to improve the robustness of variable selection results and the power in finite sample analysis. We establish the asymptotic FDR control property for the proposed approach at any designated level. Extensive simulation studies and an empirical application on a P2P loan dataset confirm the robustness of the proposed method in FDR control, and show that it often leads to higher power among competitors.
引用
收藏
页码:1462 / 1493
页数:32
相关论文
共 50 条
  • [21] The terminating-random experiments selector: Fast high-dimensional variable selection with false discovery rate control
    Machkour, Jasin
    Muma, Michael
    Palomar, Daniel P.
    SIGNAL PROCESSING, 2025, 231
  • [22] On the sign consistency of the Lasso for the high-dimensional Cox model
    Lv, Shaogao
    You, Mengying
    Lin, Huazhen
    Lian, Heng
    Huang, Jian
    JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 167 : 79 - 96
  • [23] NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA
    Sun, Hokeun
    Lin, Wei
    Feng, Rui
    Li, Hongzhe
    STATISTICA SINICA, 2014, 24 (03) : 1433 - 1459
  • [24] Model-free feature screening for high-dimensional survival data
    Lin, Yuanyuan
    Liu, Xianhui
    Hao, Meiling
    SCIENCE CHINA-MATHEMATICS, 2018, 61 (09) : 1617 - 1636
  • [25] A sequential feature selection procedure for high-dimensional Cox proportional hazards model
    Yu, Ke
    Luo, Shan
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2022, 74 (06) : 1109 - 1142
  • [26] Marginal false discovery rate control for likelihood-based penalized regression models
    Miller, Ryan E.
    Breheny, Patrick
    BIOMETRICAL JOURNAL, 2019, 61 (04) : 889 - 901
  • [27] ONLINE RULES FOR CONTROL OF FALSE DISCOVERY RATE AND FALSE DISCOVERY EXCEEDANCE
    Javanmard, Adel
    Montanari, Andrea
    ANNALS OF STATISTICS, 2018, 46 (02) : 526 - 554
  • [28] Repeated Sieving for Prediction Model Building with High-Dimensional Data
    Liu, Lu
    Jung, Sin-Ho
    JOURNAL OF PERSONALIZED MEDICINE, 2024, 14 (07):
  • [29] Extreme learning machine Cox model for high-dimensional survival analysis
    Wang, Hong
    Li, Gang
    STATISTICS IN MEDICINE, 2019, 38 (12) : 2139 - 2156
  • [30] Sequential selection procedures and false discovery rate control
    G'Sell, Max Grazier
    Wager, Stefan
    Chouldechova, Alexandra
    Tibshirani, Robert
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (02) : 423 - 444