Model-Free Conditional Feature Screening with FDR Control

被引:10
作者
Tong, Zhaoxue [1 ]
Cai, Zhanrui [2 ]
Yang, Songshan [3 ]
Li, Runze [1 ]
机构
[1] Penn State Univ, University Pk, PA USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Renmin Univ China, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
False discovery rate control; Ranking consistency; Sure screening; Ultra-high dimensional data analysis; FEATURE-SELECTION; FILTER; RATES;
D O I
10.1080/01621459.2022.2063130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a model-free conditional feature screening method with false discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built upon a new measure of conditional independence. Thus, the new method does not require a specific functional form of the regression function and is robust to heavy-tailed responses and predictors. The variables to be conditional on are allowed to be multivariate. The proposed method enjoys sure screening and ranking consistency properties under mild regularity conditions. To control the FDR, we apply the Reflection via Data Splitting method and prove its theoretical guarantee using martingale theory and empirical process techniques. Simulated examples and real data analysis show that the proposed method performs very well compared with existing works. Supplementary materials for this article are available online.
引用
收藏
页码:2575 / 2587
页数:13
相关论文
共 49 条
  • [31] A Model-Free Diagnosis Approach for Intake Leakage Detection and Characterization in Diesel Engines
    Hoblos, Ghaleb
    Benkaci, Mourad
    MACHINES, 2015, 3 (03) : 157 - 172
  • [32] Feature selection in a discrete feature space based on fuzzy conditional information entropy iterative model and matrix operation
    Li, Zhaowen
    Chen, Yiying
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2023, 52 (05) : 597 - 635
  • [33] Filter-based robust model-free adaptive funnel control for discrete-time nonlinear systems with jumped reference signal
    Cheng, Yun
    Ren, Xuemei
    Zheng, Dongdong
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (18) : 11019 - 11035
  • [34] Clipper: p-value-free FDR control on high-throughput data from two conditions
    Ge, Xinzhou
    Chen, Yiling Elaine
    Song, Dongyuan
    McDermott, MeiLu
    Woyshner, Kyla
    Manousopoulou, Antigoni
    Wang, Ning
    Li, Wei
    Wang, Leo D.
    Li, Jingyi Jessica
    GENOME BIOLOGY, 2021, 22 (01)
  • [35] Improved Thermal Comfort Model Leveraging Conditional Tabular GAN Focusing on Feature Selection
    Shajalal, Md.
    Bohlouli, Milad
    Das, Hari Prasanna
    Boden, Alexander
    Stevens, Gunnar
    IEEE ACCESS, 2024, 12 : 30039 - 30053
  • [36] Conditional Feature Selection: Evaluating Model Averaging When Selecting Features with Shapley Values
    Huber, Florian
    Steinhage, Volker
    GEOMATICS, 2024, 4 (03): : 286 - 310
  • [37] An Empirical Study on New Model-Free Multi-output Variable Selection Methods
    Ansari, Jonathan
    Luetkebohmert, Eva
    Rockel, Marcus
    COMBINING, MODELLING AND ANALYZING IMPRECISION, RANDOMNESS AND DEPENDENCE, SMPS 2024, 2024, 1458 : 9 - 17
  • [38] Block HSIC Lasso: model-free biomarker detection for ultra-high dimensional data
    Climente-Gonzalez, Hector
    Azencott, Chloe-Agathe
    Kaski, Samuel
    Yamada, Makoto
    BIOINFORMATICS, 2019, 35 (14) : I427 - I435
  • [39] Model-free isoconversional method applied to polymer crystallization governed by the Hoffman-Lauritzen kinetics
    Farjas, Jordi
    Pere Lopez-Olmedo, Joan
    Roura, Pere
    POLYMER, 2017, 120 : 111 - 118
  • [40] Model-free PZC filter-based multi-loop speed controller for servo drives
    Lim, Hee-Sung
    Kim, Seok-Kyoon
    Lee, Kyo-Beum
    JOURNAL OF POWER ELECTRONICS, 2024, 24 (08) : 1273 - 1285