Deep F-Measure Maximization in Multi-label Classification: A Comparative Study

被引:2
作者
Decubber, Stijn [1 ,2 ]
Mortier, Thomas [2 ]
Dembczynski, Krzysztof [3 ]
Waegeman, Willem [2 ]
机构
[1] ML6, Esplanade Oscar Van De Voorde 1, B-9000 Ghent, Belgium
[2] Univ Ghent, Dept Data Anal & Math Modelling, Coupure Links 653, B-9000 Ghent, Belgium
[3] Poznan Univ Tech, Inst Comp Sci, Piotrowo 2, PL-60965 Poznan, Poland
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I | 2019年 / 11051卷
关键词
F-beta-measure; Bayes optimal classification; Multi-label image classification; Convolutional neural networks;
D O I
10.1007/978-3-030-10925-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years several novel algorithms have been developed for maximizing the instance-wise F-beta-measure in multi-label classification problems. However, so far, such algorithms have only been tested in tandem with shallow base learners. In the deep learning landscape, usually simple thresholding approaches are implemented, even though it is expected that such approaches are suboptimal. In this article we introduce extensions of utility maximization and decision-theoretic methods that can optimize the F-beta-measure with (convolutional) neural networks. We discuss pros and cons of the different methods and we present experimental results on several image classification datasets. The results illustrate that decision-theoretic inference algorithms are worth the investment. While being more difficult to implement compared to thresholding strategies, they lead to a better predictive performance. Overall, a decision-theoretic inference algorithm based on proportional odds models outperforms the other methods. Code related to this paper is available at: https://github.com/sdcubber/f-measure.
引用
收藏
页码:290 / 305
页数:16
相关论文
共 29 条
  • [1] Abadi M., 2015, TENSORFLOW LARGE SCA, DOI DOI 10.48550/ARXIV.1603.04467
  • [2] AGRESTI A., 2013, Categorical data analysis, VThird
  • [3] Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
  • [4] Caetano, 2011, ADV NEURAL INFORM PR, V25
  • [5] Caetano, 2010, ADV NEURAL INFORM PR, V24
  • [6] Chai K, 2005, P INT ACM C RES DEV
  • [7] Chapelle, 2007, ADV NEURAL INFORM PR, V19
  • [8] Combining instance-based learning and logistic regression for multilabel classification
    Cheng, Weiwei
    Huellermeier, Eyke
    [J]. MACHINE LEARNING, 2009, 76 (2-3) : 211 - 225
  • [9] Chollet F., 2015, Keras
  • [10] Dembczynski, 2011, ADV NEURAL INFORM PR, V25