Machine learning models, epistemic set-valued data and generalized loss functions: An encompassing approach

被引:17
|
作者
Couso, Ines [1 ]
Sanchez, Luciano [2 ]
机构
[1] Univ Oviedo, Dept Stat & Operat Res, Oviedo, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo, Spain
关键词
Regression; Classification; Loss function; Generalized stochastic ordering; Set-valued data; Low-quality data; FUZZY RULES; IMPRECISE;
D O I
10.1016/j.ins.2016.04.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study those problems where the goal is to find "optimal" models with respect to some specific criterion, in regression and supervised classification problems. Alternatives to the usual expected loss minimization criterion are proposed, and a general framework where this criterion can be seen as a particular instance of a general family of criteria is provided. In the new setting, each model is formally identified with a random variable that associates a loss value to each individual in the population. Based on this identification, different stochastic orderings between random variables lead to different criteria to compare pairs of models. Our general setting encompasses the classical criterion based on the minimization of the expected loss, but also other criteria where a numerical loss function is not available, and therefore the computation of its expectation does not make sense. The presentation of the new framework is divided into two stages. First, we consider the new framework under standard situations about the sample information, where both the collection of attributes and the response variables are observed with precision. Then, we assume that just incomplete information about them (expressed in terms of set-valued data sets) is provided. We cast some comparison criteria from the recent literature on learning methods from low-quality data as particular instances of our general approach. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:129 / 150
页数:22
相关论文
共 50 条
  • [21] Approximation of Some Classes of Set-Valued Periodic Functions by Generalized Trigonometric Polynomials
    V. F. Babenko
    V. V. Babenko
    M. V. Polishchuk
    Ukrainian Mathematical Journal, 2016, 68 : 502 - 514
  • [22] Lipschitz properties of nonsmooth functions and set-valued mappings via generalized differentiation
    Nguyen Mau Nam
    Lafferriere, Gerardo
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2013, 89 : 110 - 120
  • [23] A Set-Valued Analysis Approach to Second Order Differentiation of Nonsmooth Functions
    Miguel Sama
    Set-Valued and Variational Analysis, 2009, 17 : 41 - 61
  • [24] A Set-Valued Analysis Approach to Second Order Differentiation of Nonsmooth Functions
    Sama, Miguel
    SET-VALUED AND VARIATIONAL ANALYSIS, 2009, 17 (01) : 40 - 60
  • [25] Generalized robust loss functions for machine learning
    Fu, Saiji
    Wang, Xiaoxiao
    Tang, Jingjing
    Lan, Shulin
    Tian, Yingjie
    NEURAL NETWORKS, 2024, 171 : 200 - 214
  • [26] A topological approach for vector quasi-variational inequalities with set-valued functions
    Ratna Dev Sonia
    Computational Management Science, 2023, 20
  • [27] A topological approach for vector quasi-variational inequalities with set-valued functions
    Sonia
    Sarma, Ratna Dev
    COMPUTATIONAL MANAGEMENT SCIENCE, 2023, 20 (01)
  • [28] Gap functions for a system of generalized vector quasi-equilibrium problems with set-valued mappings
    Huang, Nan-Jing
    Li, Jun
    Wu, Soon-yi
    JOURNAL OF GLOBAL OPTIMIZATION, 2008, 41 (03) : 401 - 415
  • [29] Gap functions for a system of generalized vector quasi-equilibrium problems with set-valued mappings
    Nan-jing Huang
    Jun Li
    Soon-yi Wu
    Journal of Global Optimization, 2008, 41 : 401 - 415
  • [30] Separability of set-valued data sets and existence of support hyperplanes in the support function machine
    Chen, Jiqiang
    Xue, Xiaoping
    Ma, Litao
    Ha, Minghu
    INFORMATION SCIENCES, 2018, 430 : 432 - 443