Ensemble Feature Selection With Block-Regularized m x 2 Cross-Validation

被引:2
|
作者
Yang, Xingli [1 ]
Wang, Yu [2 ]
Wang, Ruibo [2 ]
Li, Jihong [2 ]
机构
[1] Shanxi Univ, Sch Math Sci, Taiyuan 030006, Peoples R China
[2] Shanxi Univ, Sch Modern Educ Technol, Taiyuan 030006, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Correlation; Indexes; Data models; Technological innovation; Reliability theory; Upper bound; Beta distribution; block-regularized m x 2 cross-validation; ensemble feature selection (EFS); false positive; true positive; VARIABLE SELECTION; REGRESSION; PRECISION; RECALL;
D O I
10.1109/TNNLS.2021.3128173
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble feature selection (EFS) has attracted significant interest in the literature due to its great potential in reducing the discovery rate of noise features and stabilizing the feature selection results. In view of the superior performance of block-regularized m x 2 cross-validation on generalization performance and algorithm comparison, a novel EFS technology based on block-regularized m x 2 cross-validation is proposed in this study. Contrary to the traditional ensemble learning with a binomial distribution, the distribution of feature selection frequency in the proposed technique is approximated by a beta distribution more accurately. Furthermore, theoretical analysis of the proposed technique shows that it yields a higher selection probability for important features, lower selected risk for noise features, more true positives, and fewer false positives. Finally, the above conclusions are verified by the simulated and real data experiments.
引用
收藏
页码:6628 / 6641
页数:14
相关论文
共 50 条
  • [31] Wavelet basis selection for regression by cross-validation
    Greenblatt, SA
    COMPUTATIONAL APPROACHES TO ECONOMIC PROBLEMS, 1997, 6 : 39 - 55
  • [32] Cross-validation criteria for SETAR model selection
    De Gooijer, JG
    JOURNAL OF TIME SERIES ANALYSIS, 2001, 22 (03) : 267 - 281
  • [33] GENERALIZED CROSS-VALIDATION FOR COVARIANCE MODEL SELECTION
    MARCOTTE, D
    MATHEMATICAL GEOLOGY, 1995, 27 (05): : 659 - 672
  • [34] Hybrid Feature Selection Method Based on Neural Networks and Cross-Validation for Liver Cancer With Microarray
    Kim, Sangman
    Park, Jusung
    IEEE ACCESS, 2018, 6 : 78214 - 78224
  • [35] Consistent cross-validatory model-selection for dependent data:: hv-block cross-validation
    Racine, J
    JOURNAL OF ECONOMETRICS, 2000, 99 (01) : 39 - 61
  • [36] A competitive ensemble pruning approach based on cross-validation technique
    Dai, Qun
    KNOWLEDGE-BASED SYSTEMS, 2013, 37 : 394 - 414
  • [37] Enhancing Phishing Detection Through Ensemble Learning and Cross-Validation
    Jawad, Samer Kadhim
    Alnajjar, Satea Hikmat
    2024 INTERNATIONAL CONFERENCE ON SMART APPLICATIONS, COMMUNICATIONS AND NETWORKING, SMARTNETS-2024, 2024,
  • [38] An approach for dynamic weighting ensemble classifiers based on cross-validation
    Zhu, Yuquan
    Ou, Jishun
    Chen, Geng
    Yu, Haiping
    Journal of Computational Information Systems, 2010, 6 (01): : 297 - 305
  • [39] Best subset selection via cross-validation criterion
    Yuichi Takano
    Ryuhei Miyashiro
    TOP, 2020, 28 : 475 - 488
  • [40] Bootstrap Cross-Validation Improves Model Selection in Pharmacometrics
    Cavenaugh, James Stephens
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2022, 14 (02): : 168 - 203