Group Feature Screening Based on Information Gain Ratio for Ultrahigh-Dimensional Data

被引:3
作者
Wang, Zhongzheng [1 ]
Deng, Guangming [1 ,2 ]
Yu, Jianqi [1 ]
机构
[1] Guilin Univ Technol, Coll Sci, Guilin 541000, Peoples R China
[2] Guilin Univ Technol, Appl Stat Inst, Guilin 541000, Peoples R China
基金
中国国家自然科学基金;
关键词
REGRESSION; SELECTION; LASSO;
D O I
10.1155/2022/1600986
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Most model-free feature screening approaches focus on the -individual predictor; therefore, they are not able to incorporate structured predictors like grouped variables. In this article, we propose a group screening procedure via the information gain ratio for a classification model, which is a direct extension of the original sure independence screening procedure and also model-free. The proposed method yields a better screening performance and classification accuracy. It is demonstrated that the proposed group screening method possesses the sure screening property and ranking consistency properties under certain regularity conditions. Through simulation studies and real-world data analysis, we demonstrate the proposed method with the finite sample performance.
引用
收藏
页数:15
相关论文
共 21 条
  • [1] The Group Exponential Lasso for Bi-Level Variable Selection
    Breheny, Patrick
    [J]. BIOMETRICS, 2015, 71 (03) : 731 - 740
  • [2] Breheny P, 2009, STAT INTERFACE, V2, P369
  • [3] Model-Free Feature Screening for Ultrahigh Dimenssional Discriminant Analysis
    Cui, Hengjian
    Li, Runze
    Zhong, Wei
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) : 630 - 641
  • [4] Sure independence screening for ultrahigh dimensional feature space
    Fan, Jianqing
    Lv, Jinchi
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 849 - 883
  • [5] Fan JQ, 2009, J MACH LEARN RES, V10, P2013
  • [6] Grouped feature screening for ultra-high dimensional data for the classification model
    He, Hanji
    Deng, Guangming
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (05) : 974 - 997
  • [7] Feature Screening for Ultrahigh Dimensional Categorical Data With Applications
    Huang, Danyang
    Li, Runze
    Wang, Hansheng
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2014, 32 (02) : 237 - 244
  • [8] A group bridge approach for variable selection
    Huang, Jian
    Ma, Shuange
    Xie, Huiliang
    Zhang, Cun-Hui
    [J]. BIOMETRIKA, 2009, 96 (02) : 339 - 355
  • [9] The Kolmogorov filter for variable screening in high-dimensional binary classification
    Mai, Qing
    Zou, Hui
    [J]. BIOMETRIKA, 2013, 100 (01) : 229 - 234
  • [10] Ni L., 2019, THESIS E CHINA NORMA