ConfusionVis: Comparative evaluation and selection of multi-class classifiers based on confusion matrices

被引:35
|
作者
Theissler, Andreas [1 ]
Thomas, Mark [2 ]
Burch, Michael [3 ]
Gerschner, Felix [1 ]
机构
[1] Aalen Univ Appl Sci, Aalen, Germany
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
[3] Univ Appl Sci Grisons FHGR, Graubunden, Switzerland
关键词
Machine learning; Interpretable machine learning; Classification; Model selection; Species conservation; NEURAL-NETWORKS; FROBENIUS NORM; CLASSIFICATION;
D O I
10.1016/j.knosys.2022.108651
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning, the presumably best model is selected from a variety of model candidates generated by testing different model types, hyperparameters, or feature subsets. The advent of deep learning has made model selection even more challenging due to the huge parameter search space. Relying on a single metric to select the best model does not consider class imbalances or the different costs of misclassifications. We argue that incorporating human knowledge to interactively analyse the per-class errors and class confusions over all model candidates enables a more efficient training process and yields better models for given applications. This paper proposes the model-agnostic approach ConfusionVis which allows to comparatively evaluate and select multi-class classifiers based on their confusion matrices. This contributes to making the models' results understandable, while treating the models as black boxes. Therefore, we propose a novel method to measure and visualise distances between confusion matrices and an interactive query interface to incorporate all composition levels of class errors. The approach is evaluated in a user study and the applicability is shown by a case study where marine biologists investigate the conservation efforts of baleen whales by classifying whale species in acoustic recordings. ConfusionVis is available online: https://www.ml-and-vis.org/confusionvis. (c) 2022 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Hybrid decision tree and naive Bayes classifiers for multi-class classification tasks
    Farid, Dewan Md.
    Zhang, Li
    Rahman, Chowdhury Mofizur
    Hossain, M. A.
    Strachan, Rebecca
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 1937 - 1946
  • [22] Sample Complexity of Classifiers Taking Values in Q, Application to Multi-Class SVMs
    Guermeur, Yann
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2010, 39 (03) : 543 - 557
  • [23] Tune and mix: learning to rank using ensembles of calibrated multi-class classifiers
    Busa-Fekete, Robert
    Kegl, Balazs
    Elteto, Tamas
    Szarvas, Gyoergy
    MACHINE LEARNING, 2013, 93 (2-3) : 261 - 292
  • [24] Multi-class Financial Distress Prediction Based on Feature Selection and Deep Forest Algorithm
    Chen, Xiaofang
    Mao, Zengli
    Wu, Chong
    COMPUTATIONAL ECONOMICS, 2024,
  • [25] The Influence of Multi-class Feature Selection on the Prediction of Diagnostic Phenotypes
    Lausser, Ludwig
    Szekely, Robin
    Schirra, Lyn-Rouven
    Kestler, Hans A.
    NEURAL PROCESSING LETTERS, 2018, 48 (02) : 863 - 880
  • [26] Feature selection with kernelized multi-class support vector machine
    Guo, Yinan
    Zhang, Zirui
    Tang, Fengzhen
    PATTERN RECOGNITION, 2021, 117
  • [27] Feature Selection Consideration for Multi-Class Cardiac Arrhythmia Classification
    Thanawattano, Chusak
    Yingthawornsuk, Thaweesak
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 1175 - 1178
  • [28] A semi-hard voting combiner scheme to ensemble multi-class probabilistic classifiers
    Delgado, Rosario
    APPLIED INTELLIGENCE, 2022, 52 (04) : 3653 - 3677
  • [29] Hybridized intelligent multi-class classifiers for rockburst risk assessment in deep underground mines
    Shirani Faradonbeh, Roohollah
    Vaisey, Will
    Sharifzadeh, Mostafa
    Zhou, Jian
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (04) : 1681 - 1698
  • [30] Avoiding Time Series Prediction Disbelief with Ensemble Classifiers in Multi-class Problem Spaces
    Huk, Maciej
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 155 - 166