An Empirical Evaluation of Constrained Feature Selection

被引:0
|
作者
Bach J. [1 ]
Zoller K. [2 ]
Trittenbach H. [1 ]
Schulz K. [2 ,3 ]
Böhm K. [1 ]
机构
[1] Department of Informatics, Karlsruhe Institute of Technology (KIT), Am Fasanengarten 5, Baden-Württemberg, Karlsruhe
[2] Department of Mechanical Engineering, Karlsruhe Institute of Technology (KIT), Kaiserstraße 12, Baden-Württemberg, Karlsruhe
[3] Faculty of Mechanical Engineering and Mechatronics, Karlsruhe University of Applied Sciences, Moltkestraße 30, Baden-Württemberg, Karlsruhe
关键词
Constraints; Domain knowledge; Feature selection; Theory-guided data science;
D O I
10.1007/s42979-022-01338-z
中图分类号
学科分类号
摘要
While feature selection helps to get smaller and more understandable prediction models, most existing feature-selection techniques do not consider domain knowledge. One way to use domain knowledge is via constraints on sets of selected features. However, the impact of constraints, e.g., on the predictive quality of selected features, is currently unclear. This article is an empirical study that evaluates the impact of propositional and arithmetic constraints on filter feature selection. First, we systematically generate constraints from various types, using datasets from different domains. As expected, constraints tend to decrease the predictive quality of feature sets, but this effect is non-linear. So we observe feature sets both adhering to constraints and with high predictive quality. Second, we study a concrete setting in materials science. This part of our study sheds light on how one can analyze scientific hypotheses with the help of constraints. © 2022, The Author(s).
引用
收藏
相关论文
共 50 条
  • [41] Evaluation of Feature Selection on Human Activity Recognition
    Mazaar, Hussein
    Emary, Eid
    Onsi, Hoda
    2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), 2015, : 591 - 599
  • [42] Evaluation of feature selection on network traffic classification
    Wang, Yun
    Wang, Pan
    Wang, ZiXuan
    Wu, KaiLin
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 813 - 818
  • [43] Constraint Score Evaluation for Spectral Feature Selection
    Kalakech, Mariam
    Biela, Philippe
    Hamad, Denis
    Macaire, Ludovic
    NEURAL PROCESSING LETTERS, 2013, 38 (02) : 155 - 175
  • [44] An evaluation of feature selection methods for environmental data
    Effrosynidis, Dimitrios
    Arampatzis, Avi
    ECOLOGICAL INFORMATICS, 2021, 61
  • [45] Feature evaluation and selection with cooperative game theory
    Sun, Xin
    Liu, Yanheng
    Li, Jin
    Zhu, Jianqi
    Chen, Huiling
    Liu, Xuejie
    PATTERN RECOGNITION, 2012, 45 (08) : 2992 - 3002
  • [46] Constraint Score Evaluation for Spectral Feature Selection
    Mariam Kalakech
    Philippe Biela
    Denis Hamad
    Ludovic Macaire
    Neural Processing Letters, 2013, 38 : 155 - 175
  • [47] Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms
    Rudolf Jagdhuber
    Michel Lang
    Arnulf Stenzl
    Jochen Neuhaus
    Jörg Rahnenführer
    BMC Bioinformatics, 21
  • [48] Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms
    Jagdhuber, Rudolf
    Lang, Michel
    Stenzl, Arnulf
    Neuhaus, Jochen
    Rahnenfuehrer, Joerg
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [49] An Empirical Study on the Performance of Rule-Based Classification by Feature Selection
    Balakrishnan, Sarojini
    Babu, M. R.
    Krishna, P. V.
    2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014), 2014, : 147 - +
  • [50] Fall recognition system using feature selection and SVM: an empirical study
    Maldonado-Mendez, Carolina
    Hernandez-Mendez, Sergio
    2019 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (CONIELECOMP), 2019, : 187 - 192