Model-Free Conditional Feature Screening with FDR Control

被引:10
|
作者
Tong, Zhaoxue [1 ]
Cai, Zhanrui [2 ]
Yang, Songshan [3 ]
Li, Runze [1 ]
机构
[1] Penn State Univ, University Pk, PA USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Renmin Univ China, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
False discovery rate control; Ranking consistency; Sure screening; Ultra-high dimensional data analysis; FEATURE-SELECTION; FILTER; RATES;
D O I
10.1080/01621459.2022.2063130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a model-free conditional feature screening method with false discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built upon a new measure of conditional independence. Thus, the new method does not require a specific functional form of the regression function and is robust to heavy-tailed responses and predictors. The variables to be conditional on are allowed to be multivariate. The proposed method enjoys sure screening and ranking consistency properties under mild regularity conditions. To control the FDR, we apply the Reflection via Data Splitting method and prove its theoretical guarantee using martingale theory and empirical process techniques. Simulated examples and real data analysis show that the proposed method performs very well compared with existing works. Supplementary materials for this article are available online.
引用
收藏
页码:2575 / 2587
页数:13
相关论文
共 49 条
  • [1] Model-Free Feature Screening and FDR Control With Knockoff Features
    Liu, Wanjun
    Ke, Yuan
    Liu, Jingyuan
    Li, Runze
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (537) : 428 - 443
  • [2] Model-free conditional screening via conditional distance correlation
    Jun Lu
    Lu Lin
    Statistical Papers, 2020, 61 : 225 - 244
  • [3] Model-free conditional screening via conditional distance correlation
    Lu, Jun
    Lin, Lu
    STATISTICAL PAPERS, 2020, 61 (01) : 225 - 244
  • [4] Model-free feature screening for ultrahigh-dimensional data conditional on some variables
    Yi Liu
    Qihua Wang
    Annals of the Institute of Statistical Mathematics, 2018, 70 : 283 - 301
  • [5] Model-free feature screening for ultrahigh-dimensional data conditional on some variables
    Liu, Yi
    Wang, Qihua
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2018, 70 (02) : 283 - 301
  • [6] Model-free conditional feature screening for ultra-high dimensional right censored data
    Chen, Xiaolin
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (12) : 2425 - 2446
  • [7] Model-free feature screening for ultrahigh dimensional classification
    Sheng, Ying
    Wang, Qihua
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 178
  • [8] Model-Free Feature Screening for Ultrahigh Dimenssional Discriminant Analysis
    Cui, Hengjian
    Li, Runze
    Zhong, Wei
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) : 630 - 641
  • [9] Robust model-free feature screening via quantile correlation
    Ma, Xuejun
    Zhang, Jingxiao
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 143 : 472 - 480
  • [10] Model-free feature screening for ultrahigh dimensional censored regression
    Tingyou Zhou
    Liping Zhu
    Statistics and Computing, 2017, 27 : 947 - 961