Model-Free Conditional Feature Screening with FDR Control

被引:10
|
作者
Tong, Zhaoxue [1 ]
Cai, Zhanrui [2 ]
Yang, Songshan [3 ]
Li, Runze [1 ]
机构
[1] Penn State Univ, University Pk, PA USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Renmin Univ China, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
False discovery rate control; Ranking consistency; Sure screening; Ultra-high dimensional data analysis; FEATURE-SELECTION; FILTER; RATES;
D O I
10.1080/01621459.2022.2063130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a model-free conditional feature screening method with false discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built upon a new measure of conditional independence. Thus, the new method does not require a specific functional form of the regression function and is robust to heavy-tailed responses and predictors. The variables to be conditional on are allowed to be multivariate. The proposed method enjoys sure screening and ranking consistency properties under mild regularity conditions. To control the FDR, we apply the Reflection via Data Splitting method and prove its theoretical guarantee using martingale theory and empirical process techniques. Simulated examples and real data analysis show that the proposed method performs very well compared with existing works. Supplementary materials for this article are available online.
引用
收藏
页码:2575 / 2587
页数:13
相关论文
共 49 条
  • [21] An efficient model-free approach to interaction screening for high dimensional data
    Xiong, Wei
    Pan, Han
    Wang, Jianrong
    Tian, Maozai
    STATISTICS IN MEDICINE, 2023, 42 (10) : 1583 - 1605
  • [22] Feature screening via false discovery rate control for linear model with multivariate responses
    Yu, Congran
    Cui, Hengjian
    STATISTICAL PAPERS, 2025, 66 (02)
  • [23] A model-free multivariate non-recursive feature elimination for feature selection on high-dimensional complex multiple response data
    Xia, Siwei
    Yang, Yuehan
    INFORMATION SCIENCES, 2025, 713
  • [24] Robust Model-Free Gait Recognition by Statistical Dependency Feature Selection and Globality-Locality Preserving Projections
    Rida, Imad
    Boubchir, Larbi
    Al-Maadeed, Noor
    Al-Maadeed, Somaya
    Bouridane, Ahmed
    2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 652 - 655
  • [25] Ultrahigh dimensional feature screening for additive model with multivariate response
    Liu, Shishi
    Li, Xiangjie
    Zhang, Jingxiao
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (05) : 775 - 799
  • [26] A Novel Feature Selection Method for the Conditional Information Entropy Model
    Ruan, Jing
    Zhang, Changsheng
    EMERGING RESEARCH IN ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, 2011, 237 : 598 - +
  • [27] Model-Free Statistical Inference on High-Dimensional Data
    Guo, Xu
    Li, Runze
    Zhang, Zhe
    Zou, Changliang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, : 186 - 197
  • [28] Threshold Selection in Feature Screening for Error Rate Control
    Guo, Xu
    Ren, Haojie
    Zou, Changliang
    Li, Runze
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 1773 - 1785
  • [29] A feature ranking model with redundancy control
    Zhou X.
    Diao X.
    Cao J.
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2016, 48 (05): : 153 - 158
  • [30] A Model-Free, Fully Automated Baseline-Removal Method for Raman Spectra
    Schulze, H. Georg
    Foist, Rod B.
    Okuda, Kadek
    Ivanov, Andre
    Turner, Robin F. B.
    APPLIED SPECTROSCOPY, 2011, 65 (01) : 75 - 84