An Empirical Evaluation of Constrained Feature Selection

被引:0
|
作者
Bach J. [1 ]
Zoller K. [2 ]
Trittenbach H. [1 ]
Schulz K. [2 ,3 ]
Böhm K. [1 ]
机构
[1] Department of Informatics, Karlsruhe Institute of Technology (KIT), Am Fasanengarten 5, Baden-Württemberg, Karlsruhe
[2] Department of Mechanical Engineering, Karlsruhe Institute of Technology (KIT), Kaiserstraße 12, Baden-Württemberg, Karlsruhe
[3] Faculty of Mechanical Engineering and Mechatronics, Karlsruhe University of Applied Sciences, Moltkestraße 30, Baden-Württemberg, Karlsruhe
关键词
Constraints; Domain knowledge; Feature selection; Theory-guided data science;
D O I
10.1007/s42979-022-01338-z
中图分类号
学科分类号
摘要
While feature selection helps to get smaller and more understandable prediction models, most existing feature-selection techniques do not consider domain knowledge. One way to use domain knowledge is via constraints on sets of selected features. However, the impact of constraints, e.g., on the predictive quality of selected features, is currently unclear. This article is an empirical study that evaluates the impact of propositional and arithmetic constraints on filter feature selection. First, we systematically generate constraints from various types, using datasets from different domains. As expected, constraints tend to decrease the predictive quality of feature sets, but this effect is non-linear. So we observe feature sets both adhering to constraints and with high predictive quality. Second, we study a concrete setting in materials science. This part of our study sheds light on how one can analyze scientific hypotheses with the help of constraints. © 2022, The Author(s).
引用
收藏
相关论文
共 50 条
  • [1] An Empirical Evaluation of Techniques for Feature Selection with Cost
    Adams, Stephen
    Meekins, Ryan
    Beling, Peter A.
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 834 - 841
  • [2] Empirical evaluation of feature selection methods in classification
    Cehovin, Luka
    Bosnic, Zoran
    INTELLIGENT DATA ANALYSIS, 2010, 14 (03) : 265 - 281
  • [3] An Empirical Evaluation of Feature Selection Stability and Classification Accuracy
    Buyukkececi, Mustafa
    Okur, Mehmet Cudi
    GAZI UNIVERSITY JOURNAL OF SCIENCE, 2024, 37 (02): : 606 - 620
  • [4] Empirical Evaluation of the Performance of Feature Selection Approaches on Random Forest
    Kumar, Smitha S.
    Shaikh, Talal
    2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA), 2017, : 227 - 231
  • [5] Empirical Evaluation of the Ensemble Framework for Feature Selection in DDoS Attack
    Das, Saikat
    Venugopal, Deepak
    Shiva, Sajjan
    Sheldon, Frederick T.
    2020 7TH IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (CSCLOUD 2020)/2020 6TH IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD (EDGECOM 2020), 2020, : 56 - 61
  • [6] Empirical study of feature selection methods based on individual feature evaluation for classification problems
    Arauzo-Azofra, Antonio
    Aznarte, Jose Luis
    Benitez, Jose M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 8170 - 8177
  • [7] Constrained Laplacian Score for Semi-supervised Feature Selection
    Benabdeslem, Khalid
    Hindawi, Mohammed
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2011, 6911 : 204 - 218
  • [8] Data-driven Feature Selection Methods for Text Classification: an Empirical Evaluation
    Fragoso, Rogerio C. P.
    Pinheiro, Roberto H. W.
    Cavalcanti, George D. C.
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2019, 25 (04) : 334 - 360
  • [9] Constrained class-wise feature selection (CCFS)
    Syed Fawad Hussain
    Fatima Shahzadi
    Badre Munir
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 3211 - 3224
  • [10] Constrained class-wise feature selection (CCFS)
    Hussain, Syed Fawad
    Shahzadi, Fatima
    Munir, Badre
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (10) : 3211 - 3224