An Empirical Evaluation of Constrained Feature Selection

被引:0
|
作者
Bach J. [1 ]
Zoller K. [2 ]
Trittenbach H. [1 ]
Schulz K. [2 ,3 ]
Böhm K. [1 ]
机构
[1] Department of Informatics, Karlsruhe Institute of Technology (KIT), Am Fasanengarten 5, Baden-Württemberg, Karlsruhe
[2] Department of Mechanical Engineering, Karlsruhe Institute of Technology (KIT), Kaiserstraße 12, Baden-Württemberg, Karlsruhe
[3] Faculty of Mechanical Engineering and Mechatronics, Karlsruhe University of Applied Sciences, Moltkestraße 30, Baden-Württemberg, Karlsruhe
关键词
Constraints; Domain knowledge; Feature selection; Theory-guided data science;
D O I
10.1007/s42979-022-01338-z
中图分类号
学科分类号
摘要
While feature selection helps to get smaller and more understandable prediction models, most existing feature-selection techniques do not consider domain knowledge. One way to use domain knowledge is via constraints on sets of selected features. However, the impact of constraints, e.g., on the predictive quality of selected features, is currently unclear. This article is an empirical study that evaluates the impact of propositional and arithmetic constraints on filter feature selection. First, we systematically generate constraints from various types, using datasets from different domains. As expected, constraints tend to decrease the predictive quality of feature sets, but this effect is non-linear. So we observe feature sets both adhering to constraints and with high predictive quality. Second, we study a concrete setting in materials science. This part of our study sheds light on how one can analyze scientific hypotheses with the help of constraints. © 2022, The Author(s).
引用
收藏
相关论文
共 50 条
  • [21] Empirical Study of Individual Feature Evaluators and Cutting Criteria for Feature Selection in Classification
    Arauzo-Azofra, Antonio
    Aznarte M, Jose L.
    Benitez, Jose M.
    2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 541 - +
  • [22] Feature-Based Algorithm Selection for Constrained Continuous Optimisation
    Poursoltan, Shayan
    Neumann, Frank
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 1461 - 1468
  • [23] Multilabel Feature Selection With Constrained Latent Structure Shared Term
    Gao, Wanfu
    Li, Yonghao
    Hu, Liang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1253 - 1262
  • [24] A novel feature selection approach based on constrained eigenvalues optimization
    Benkessirat, Amina
    Benblidia, Nadjia
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 4836 - 4846
  • [25] Evaluation of Feature Selection Measures for Steganalysis
    Rajput, G. K.
    Agrawal, R. K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 432 - 439
  • [26] A Dynamic Evaluation Metric for Feature Selection
    Rajabinasab, Muhammad
    Lautrup, Anton D.
    Hyrup, Tobias
    Zimek, Arthur
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2024, 2025, 15268 : 65 - 72
  • [27] The Effect of Feature Selection on Phish Website Detection An Empirical Study on Robust Feature Subset Selection for Effective Classification
    Zuhair, Hiba
    Selmat, Ali
    Salleh, Mazleena
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (10) : 221 - 232
  • [28] Feature selection by integrating two groups of feature evaluation criteria
    Gao, Wanfu
    Hu, Liang
    Zhang, Ping
    Wang, Feng
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 110 : 11 - 19
  • [29] The optimal combination of feature selection and data discretization: An empirical study
    Tsai, Chih-Fong
    Chen, Yu-Chi
    INFORMATION SCIENCES, 2019, 505 : 282 - 293
  • [30] The Impact of Feature Selection on Defect Prediction Performance: An Empirical Comparison
    Xu, Zhou
    Liu, Jin
    Yang, Zijiang
    An, Gege
    Jia, Xiangyang
    2016 IEEE 27TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2016, : 309 - 320