Feature relevance determination for ordinal regression in the context of feature redundancies and privileged information

被引:3
|
作者
Pfannschmidt, Lukas [1 ]
Jakob, Jonathan [1 ]
Hinder, Fabian [1 ]
Biehl, Michael [2 ]
Tino, Peter [3 ]
Hammer, Barbara [1 ]
机构
[1] Bielefeld Univ, Machine Learning Grp, Bielefeld, Germany
[2] Univ Groningen, Intelligent Syst Grp, Groningen, Netherlands
[3] Univ Birmingham, Comp Sci, Birmingham, W Midlands, England
关键词
Global feature relevance; Feature selection; Interpretability; Ordinal regression; Privileged information; FEATURE-SELECTION; CONSISTENCY;
D O I
10.1016/j.neucom.2019.12.133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advances in machine learning technologies have led to increasingly powerful models in particular in the context of big data. Yet, many application scenarios demand for robustly interpretable models rather than optimum model accuracy; as an example, this is the case if potential biomarkers or causal factors should be discovered based on a set of given measurements. In this contribution, we focus on feature selection paradigms, which enable us to uncover relevant factors of a given regularity based on a sparse model. We focus on the important specific setting of linear ordinal regression, i.e. data have to be ranked into one of a finite number of ordered categories by a linear projection. Unlike previous work, we consider the case that features are potentially redundant, such that no unique minimum set of relevant features exists. We aim for an identification of all strongly and all weakly relevant features as well as their type of relevance (strong or weak); we achieve this goal by determining feature relevance bounds, which correspond to the minimum and maximum feature relevance, respectively, if searched over all equivalent models. In addition, we discuss how this setting enables us to substitute some of the features, e.g. due to their semantics, and how to extend the framework of feature relevance intervals to the setting of privileged information, i.e. potentially relevant information is available for training purposes only, but cannot be used for the prediction itself. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:266 / 279
页数:14
相关论文
共 36 条
  • [21] Conditional mutual information-based feature selection algorithm for maximal relevance minimal redundancy
    Gu, Xiangyuan
    Guo, Jichang
    Xiao, Lijun
    Li, Chongyi
    APPLIED INTELLIGENCE, 2022, 52 (02) : 1436 - 1447
  • [22] Determination of feature relevance for the grouping of motor unit action potentials through a generative mixture model
    Vellido, Alfredo
    Andrade, Adriano O.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2007, 2 (02) : 111 - 121
  • [23] An efficient method for feature selection in linear regression based on an extended Akaike’s information criterion
    D. P. Vetrov
    D. A. Kropotov
    N. O. Ptashko
    Computational Mathematics and Mathematical Physics, 2009, 49 : 1972 - 1985
  • [24] An efficient method for feature selection in linear regression based on an extended Akaike's information criterion
    Vetrov, D. P.
    Kropotov, D. A.
    Ptashko, N. O.
    COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 2009, 49 (11) : 1972 - 1985
  • [25] Fuzzy Mutual Information Based min-Redundancy and Max-Relevance Heterogeneous Feature Selection
    Yu, Daren
    An, Shuang
    Hu, Qinghua
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2011, 4 (04) : 619 - 633
  • [26] Fuzzy Mutual Information Based min-Redundancy and Max-Relevance Heterogeneous Feature Selection
    Yu D.
    An S.
    Hu Q.
    International Journal of Computational Intelligence Systems, 2011, 4 (4) : 619 - 633
  • [27] Variable Weighted Maximal Relevance Minimal Redundancy Criterion for Feature Selection Using Normalized Mutual Information
    Bandyopadhyay, Sanghamitra
    Bhadra, Tapas
    Maulik, Ujjwal
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2015, 25 (2-3) : 189 - 213
  • [28] Mutual Information with Parameter Determination Approach for Feature Selection in Multivariate Time Series Prediction
    Liu, Tianhong
    Wei, Haikun
    Zhang, Chi
    Zhang, Kanjian
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2016, 2016, 629 : 227 - 237
  • [29] Feature Selection Fusion (FSF) for Aggregating Relevance Ranking Information with Application to ZigBee Radio Frequency Device Identification
    Bihl, Trevor J.
    Temple, Michael A.
    Bauer, Kenneth W., Jr.
    PROCEEDINGS OF THE 2016 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON) AND OHIO INNOVATION SUMMIT (OIS), 2016, : 80 - 87
  • [30] ASSESSMENT OF FEATURE SELECTION AND CLASSIFICATION APPROACHES TO ENHANCE INFORMATION FROM OVERNIGHT OXIMETRY IN THE CONTEXT OF APNEA DIAGNOSIS
    Alvarez, Daniel
    Hornero, Roberto
    Victor Marcos, J.
    Wessel, Niels
    Penzel, Thomas
    Glos, Martin
    Del Campo, Felix
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2013, 23 (05)