Risk upper bounds for general ensemble methods with an application to multiclass classification

被引:5
作者
Laviolette, Francois [1 ]
Morvant, Emilie [2 ]
Ralaivola, Liva [3 ]
Roy, Jean-Francis [1 ,4 ]
机构
[1] Univ Laval, Dept Informat & Genie Logiciel, Quebec City, PQ G1K 7P4, Canada
[2] Univ Lyon, UJM St Etienne, CNRS, IOGS,Lab Hubert Curien UMR 5516, F-42023 St Etienne, France
[3] Aix Marseille Univ, CNRS, Cent Marseille, LIF,QARMA, Marseille, France
[4] Coveo Solut Inc, Quebec City, PQ, Canada
关键词
Majority vote; Ensemble methods; PAC-Bayesian Theory; Multiclass classification; Multilabel Prediction; PAC-BAYESIAN ANALYSIS;
D O I
10.1016/j.neucom.2016.09.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper generalizes a pivotal result from the PAC-Bayesian literature-the-C-bound-primarily designed for binary classification to the general case of ensemble methods of voters with arbitrary outputs. We provide a generic version of the C-bound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. On the one hand, this bound may advantageously be applied on more complex outputs than mere binary outputs, such as multiclass labels and multilabel, and on the other hand, it allows us to consider margin relaxations. We provide a specialization of the bound to multiclass classification together with empirical evidence that the presented theoretical result is tightly bound to the risk of the majority vote classifier. We also give insights as to how the proposed bound may be of use to characterize the risk of multilabel predictors.
引用
收藏
页码:15 / 25
页数:11
相关论文
共 50 条
  • [41] A new feature selection approach based on ensemble methods in semi-supervised classification
    Nesma Settouti
    Mohamed Amine Chikh
    Vincent Barra
    Pattern Analysis and Applications, 2017, 20 : 673 - 686
  • [42] A new feature selection approach based on ensemble methods in semi-supervised classification
    Settouti, Nesma
    Chikh, Mohamed Amine
    Barra, Vincent
    PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (03) : 673 - 686
  • [43] More Accurate Diagnosis in Electric Power Apparatus Conditions Using Ensemble Classification Methods
    Hirose, Hideo
    Zaman, Faisal
    IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2011, 18 (05) : 1584 - 1590
  • [44] Application of penalized linear regression and ensemble methods for drought forecasting in Northeast China
    Zeng Li
    Taotao Chen
    Qi Wu
    Guimin Xia
    Daocai Chi
    Meteorology and Atmospheric Physics, 2020, 132 : 113 - 130
  • [45] Application of penalized linear regression and ensemble methods for drought forecasting in Northeast China
    Li, Zeng
    Chen, Taotao
    Wu, Qi
    Xia, Guimin
    Chi, Daocai
    METEOROLOGY AND ATMOSPHERIC PHYSICS, 2020, 132 (01) : 113 - 130
  • [46] Analysis Accuracy of XGBoost Model for Multiclass Classification - A Case Study of Applicant Level Risk Prediction for Life Insurance
    Mustika, Widya Fajar
    Murfi, Hendri
    Widyaningsih, Yekti
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 71 - 77
  • [47] Human Activity Recognition for Multi-label Classification in Smart Homes Using Ensemble Methods
    Kasubi, John W.
    Huchaiah, Manjaiah D.
    ARTIFICIAL INTELLIGENCE AND SUSTAINABLE COMPUTING FOR SMART CITY, AIS2C2 2021, 2021, 1434 : 282 - 294
  • [48] An extension of the type-1 and singleton fuzzy logic system trained by scaled conjugate gradient methods for multiclass classification problems
    Finotti Amaral, Renan P.
    Menezes, Ivan F. M.
    Ribeiro, Moises, V
    NEUROCOMPUTING, 2020, 411 (411) : 149 - 163
  • [49] Assessing the Risk of Groundwater Pollution in Northern Algeria through the Evaluation of Influencing Parameters and Ensemble Methods
    Salah Eddine Tachi
    Hamza Bouguerra
    Meroua Djellal
    Ouassim Benaroussi
    Abdelhakim Belaroui
    Bartosz Łozowski
    Maria Augustyniak
    Saâdia Benmamar
    Salim Benziada
    Andrzej Woźnica
    Doklady Earth Sciences, 2023, 513 : 1233 - 1243
  • [50] Assessing the Risk of Groundwater Pollution in Northern Algeria through the Evaluation of Influencing Parameters and Ensemble Methods
    Tachi, Salah Eddine
    Bouguerra, Hamza
    Djellal, Meroua
    Benaroussi, Ouassim
    Belaroui, Abdelhakim
    Lozowski, Bartosz
    Augustyniak, Maria
    Benmamar, Saadia
    Benziada, Salim
    Woznica, Andrzej
    DOKLADY EARTH SCIENCES, 2023, 513 (01) : 1233 - 1243