Model-Free Conditional Feature Screening with FDR Control

被引:10
作者
Tong, Zhaoxue [1 ]
Cai, Zhanrui [2 ]
Yang, Songshan [3 ]
Li, Runze [1 ]
机构
[1] Penn State Univ, University Pk, PA USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Renmin Univ China, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
False discovery rate control; Ranking consistency; Sure screening; Ultra-high dimensional data analysis; FEATURE-SELECTION; FILTER; RATES;
D O I
10.1080/01621459.2022.2063130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a model-free conditional feature screening method with false discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built upon a new measure of conditional independence. Thus, the new method does not require a specific functional form of the regression function and is robust to heavy-tailed responses and predictors. The variables to be conditional on are allowed to be multivariate. The proposed method enjoys sure screening and ranking consistency properties under mild regularity conditions. To control the FDR, we apply the Reflection via Data Splitting method and prove its theoretical guarantee using martingale theory and empirical process techniques. Simulated examples and real data analysis show that the proposed method performs very well compared with existing works. Supplementary materials for this article are available online.
引用
收藏
页码:2575 / 2587
页数:13
相关论文
共 49 条
  • [41] Grouped feature screening for ultra-high dimensional data for the classification model
    He, Hanji
    Deng, Guangming
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (05) : 974 - 997
  • [42] Feature selection for multiset-valued data based on fuzzy conditional information entropy using iterative model and matrix operation
    Huang, Dan
    Chen, Yiying
    Liu, Fang
    Li, Zhaowen
    APPLIED SOFT COMPUTING, 2023, 142
  • [43] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Liu, Yi
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2019, 35 (04): : 845 - 861
  • [44] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Yi Liu
    Acta Mathematicae Applicatae Sinica, English Series, 2019, 35 : 845 - 861
  • [45] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Yi LIU
    ActaMathematicaeApplicataeSinica, 2019, 35 (04) : 845 - 861
  • [46] Feature selection-based machine learning modeling for distributed model predictive control of nonlinear processes
    Zhao, Tianyi
    Zheng, Yingzhe
    Wu, Zhe
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 169
  • [47] Optimized Defect Prediction Model Using Statistical Process Control and Correlation-Based Feature Selection Method
    Nanditha, J.
    Sruthi, K. N.
    Ashok, Sreeja
    Judy, M. V.
    INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 1, 2016, 384 : 355 - 366
  • [48] A proposed model based on k-nearest neighbour classifier with feature selection techniques to control and forecast plant disease
    Imran, Inas Ismael
    Ali, Rawaa Hamza
    Jameel, Shymaa Mohammed
    Jaleel, Refed Adnan
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2024, 15 (3-4) : 306 - 313
  • [49] Cascade-Free Model Predictive Control for Single-Phase Grid-Connected Power Converters
    Acuna, Pablo
    Aguilera, Ricardo P.
    Ghias, Amer M. Y. M.
    Rivera, Marco
    Baier, Carlos R.
    Agelidis, Vassilios G.
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (01) : 285 - 294