Feature relevance determination for ordinal regression in the context of feature redundancies and privileged information

被引：3

作者：

Pfannschmidt, Lukas ^{[1
]}

Jakob, Jonathan ^{[1
]}

Hinder, Fabian ^{[1
]}

Biehl, Michael ^{[2
]}

Tino, Peter ^{[3
]}

Hammer, Barbara ^{[1
]}

机构：

[1] Bielefeld Univ, Machine Learning Grp, Bielefeld, Germany

[2] Univ Groningen, Intelligent Syst Grp, Groningen, Netherlands

[3] Univ Birmingham, Comp Sci, Birmingham, W Midlands, England

来源：

NEUROCOMPUTING | 2020年 / 416卷

关键词：

Global feature relevance; Feature selection; Interpretability; Ordinal regression; Privileged information; FEATURE-SELECTION; CONSISTENCY;

D O I：

10.1016/j.neucom.2019.12.133

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Advances in machine learning technologies have led to increasingly powerful models in particular in the context of big data. Yet, many application scenarios demand for robustly interpretable models rather than optimum model accuracy; as an example, this is the case if potential biomarkers or causal factors should be discovered based on a set of given measurements. In this contribution, we focus on feature selection paradigms, which enable us to uncover relevant factors of a given regularity based on a sparse model. We focus on the important specific setting of linear ordinal regression, i.e. data have to be ranked into one of a finite number of ordered categories by a linear projection. Unlike previous work, we consider the case that features are potentially redundant, such that no unique minimum set of relevant features exists. We aim for an identification of all strongly and all weakly relevant features as well as their type of relevance (strong or weak); we achieve this goal by determining feature relevance bounds, which correspond to the minimum and maximum feature relevance, respectively, if searched over all equivalent models. In addition, we discuss how this setting enables us to substitute some of the features, e.g. due to their semantics, and how to extend the framework of feature relevance intervals to the setting of privileged information, i.e. potentially relevant information is available for training purposes only, but cannot be used for the prediction itself. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：266 / 279

页数：14

共 36 条

[31] Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
Peng, HC
Long, FH
Ding, C
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) : 1226 - 1238
[32] Maximum relevance minimum redundancy-based feature selection using rough mutual information in adaptive neighborhood rough sets
Kanglin Qu
Jiucheng Xu
Ziqin Han
Shihui Xu
Applied Intelligence, 2023, 53 : 17727 - 17746
[33] Maximum relevance minimum redundancy-based feature selection using rough mutual information in adaptive neighborhood rough sets
Qu, Kanglin
Xu, Jiucheng
Han, Ziqin
Xu, Shihui
APPLIED INTELLIGENCE, 2023, 53 (14) : 17727 - 17746
[34] Low Redundancy Feature Selection of Short Term Solar Irradiance Prediction Using Conditional Mutual Information and Gauss Process Regression
Huang, Nantian
Li, Ruiqing
Lin, Lin
Yu, Zhiyong
Cai, Guowei
SUSTAINABILITY, 2018, 10 (08)
[35] Joint feature selection and classification using a Bayesian neural network with "automatic relevance determination" priors: Potential use in CAD of medical imaging
Chen, Weijie
Zur, Richard M.
Giger, Maryellen L.
MEDICAL IMAGING 2007: COMPUTER-AIDED DIAGNOSIS, PTS 1 AND 2, 2007, 6514
[36] Maximal Information Coefficient and Support Vector Regression Based Nonlinear Feature Selection and QSAR Modeling on Toxicity of Alcohol Compounds to Tadpoles of Rana temporaria
Wang, Lifeng
Xing, Pengwei
Wang, Cong
Zhou, Xiaomao
Dai, Zhijun
Bai, Lianyang
JOURNAL OF THE BRAZILIAN CHEMICAL SOCIETY, 2019, 30 (02) : 279 - 285

← 1 2 3 4 →