ConfusionVis: Comparative evaluation and selection of multi-class classifiers based on confusion matrices

被引:35
|
作者
Theissler, Andreas [1 ]
Thomas, Mark [2 ]
Burch, Michael [3 ]
Gerschner, Felix [1 ]
机构
[1] Aalen Univ Appl Sci, Aalen, Germany
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
[3] Univ Appl Sci Grisons FHGR, Graubunden, Switzerland
关键词
Machine learning; Interpretable machine learning; Classification; Model selection; Species conservation; NEURAL-NETWORKS; FROBENIUS NORM; CLASSIFICATION;
D O I
10.1016/j.knosys.2022.108651
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning, the presumably best model is selected from a variety of model candidates generated by testing different model types, hyperparameters, or feature subsets. The advent of deep learning has made model selection even more challenging due to the huge parameter search space. Relying on a single metric to select the best model does not consider class imbalances or the different costs of misclassifications. We argue that incorporating human knowledge to interactively analyse the per-class errors and class confusions over all model candidates enables a more efficient training process and yields better models for given applications. This paper proposes the model-agnostic approach ConfusionVis which allows to comparatively evaluate and select multi-class classifiers based on their confusion matrices. This contributes to making the models' results understandable, while treating the models as black boxes. Therefore, we propose a novel method to measure and visualise distances between confusion matrices and an interactive query interface to incorporate all composition levels of class errors. The approach is evaluated in a user study and the applicability is shown by a case study where marine biologists investigate the conservation efforts of baleen whales by classifying whale species in acoustic recordings. ConfusionVis is available online: https://www.ml-and-vis.org/confusionvis. (c) 2022 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Multi-Task EEG Signal Classification Using Correlation-Based IMF Selection and Multi-Class CSP
    Alizadeh, N.
    Afrakhteh, S.
    Mosavi, M. R.
    IEEE ACCESS, 2023, 11 : 52712 - 52725
  • [42] DWT and CNN based multi-class motor imagery electroencephalographic signal recognition
    Ma, Xunguang
    Wang, Dashuai
    Liu, Danhua
    Yang, Jimin
    JOURNAL OF NEURAL ENGINEERING, 2020, 17 (01)
  • [43] A Pareto-based Ensemble with Feature and Instance Selection for Learning from Multi-Class Imbalanced Datasets
    Fernandez, Alberto
    Jose Carmona, Cristobal
    Jose del Jesus, Maria
    Herrera, Francisco
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2017, 27 (06)
  • [44] Topological embedding and directional feature importance in ensemble classifiers for multi-class classification
    Liedl, Eloisa Rocha
    Yassin, Shabeer Mohamed
    Kasapi, Melpomeni
    Posma, Joram M.
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 4108 - 4123
  • [45] Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Lango, Mateusz
    Stefanowski, Jerzy
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (01) : 97 - 127
  • [46] Medical image retrieval with probabilistic multi-class support vector machine classifiers and adaptive similarity fusion
    Rahman, Md. Mahmudur
    Desai, Bipin C.
    Bhattacharya, Prabir
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2008, 32 (02) : 95 - 108
  • [47] Multi-class change detection of remote sensing images based on class rebalancing
    Tang, Huakang
    Wang, Honglei
    Zhang, Xiaoping
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2022, 15 (01) : 1377 - 1394
  • [48] Information theoretic approach for performance evaluation of multi-class assignment systems
    Holt, Ryan S.
    Mastromarino, Peter A.
    Kao, Edward K.
    Hurley, Michael B.
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XIX, 2010, 7697
  • [49] Dynamic ensemble selection for multi -class classification with one-class classifiers
    Krawczyk, Bartosz
    Galar, Mikel
    Wozniak, Michal
    Bustince, Humberto
    Herrera, Francisco
    PATTERN RECOGNITION, 2018, 83 : 34 - 51
  • [50] Multi-Class SVM Based Gradient Feature for Banknote Recognition
    Dittimi, Tamarafinide V.
    Hmood, Ali K.
    Suen, Ching Y.
    2017 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2017, : 1030 - 1035