Risk upper bounds for general ensemble methods with an application to multiclass classification

被引:5
|
作者
Laviolette, Francois [1 ]
Morvant, Emilie [2 ]
Ralaivola, Liva [3 ]
Roy, Jean-Francis [1 ,4 ]
机构
[1] Univ Laval, Dept Informat & Genie Logiciel, Quebec City, PQ G1K 7P4, Canada
[2] Univ Lyon, UJM St Etienne, CNRS, IOGS,Lab Hubert Curien UMR 5516, F-42023 St Etienne, France
[3] Aix Marseille Univ, CNRS, Cent Marseille, LIF,QARMA, Marseille, France
[4] Coveo Solut Inc, Quebec City, PQ, Canada
关键词
Majority vote; Ensemble methods; PAC-Bayesian Theory; Multiclass classification; Multilabel Prediction; PAC-BAYESIAN ANALYSIS;
D O I
10.1016/j.neucom.2016.09.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper generalizes a pivotal result from the PAC-Bayesian literature-the-C-bound-primarily designed for binary classification to the general case of ensemble methods of voters with arbitrary outputs. We provide a generic version of the C-bound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. On the one hand, this bound may advantageously be applied on more complex outputs than mere binary outputs, such as multiclass labels and multilabel, and on the other hand, it allows us to consider margin relaxations. We provide a specialization of the bound to multiclass classification together with empirical evidence that the presented theoretical result is tightly bound to the risk of the majority vote classifier. We also give insights as to how the proposed bound may be of use to characterize the risk of multilabel predictors.
引用
收藏
页码:15 / 25
页数:11
相关论文
共 50 条
  • [1] Blind Multiclass Ensemble Classification
    Traganitis, Panagiotis A.
    Pages-Zamora, Alba
    Giannakis, Georgios B.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (18) : 4737 - 4752
  • [2] A hybrid ensemble for classification in multiclass datasets: An application to oilseed disease dataset
    Chaudhary, Archana
    Kolhe, Savita
    Kamal, Raj
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2016, 124 : 65 - 72
  • [3] On the consistency of multiclass classification methods
    Tewari, Ambuj
    Bartlett, Peter L.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 1007 - 1025
  • [4] Multiclass classification methods in ecology
    Bourel, M.
    Segura, A. M.
    ECOLOGICAL INDICATORS, 2018, 85 : 1012 - 1021
  • [5] New Bounds on the Accuracy of Majority Voting for Multiclass Classification
    Aeeneh, Sina
    Zlatanov, Nikola
    Yu, Jiangshan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [6] Application of Chi-square discretization algorithms to ensemble classification methods
    Peker N.
    Kubat C.
    Expert Systems with Applications, 2021, 185
  • [7] Applying the Multiclass Classification Methods for the Classification of Online Social Network Friends
    Sever, Nikolina
    Humski, Luka
    Ilic, Juraj
    Skocir, Zoran
    Pintar, Damir
    Vranic, Mihaela
    2017 25TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2017, : 67 - 72
  • [8] Choosing General Gaussian Kernel Parameters for Multiclass Pattern Classification
    Wang, Tinghua
    Zhong, Liyun
    Chen, Junting
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (06) : 1045 - 1049
  • [9] Modeling corporate customers' credit risk considering the ensemble approaches in multiclass classification: evidence from Iranian corporate credits
    Vahid, Parastoo Rafiee
    Ahmadi, Abbas
    JOURNAL OF CREDIT RISK, 2016, 12 (03): : 71 - 95
  • [10] Estimating vulnerability metrics with word embedding and multiclass classification methods
    Hakan Kekül
    Burhan Ergen
    Halil Arslan
    International Journal of Information Security, 2024, 23 : 247 - 270