ConfusionVis: Comparative evaluation and selection of multi-class classifiers based on confusion matrices

被引:35
|
作者
Theissler, Andreas [1 ]
Thomas, Mark [2 ]
Burch, Michael [3 ]
Gerschner, Felix [1 ]
机构
[1] Aalen Univ Appl Sci, Aalen, Germany
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
[3] Univ Appl Sci Grisons FHGR, Graubunden, Switzerland
关键词
Machine learning; Interpretable machine learning; Classification; Model selection; Species conservation; NEURAL-NETWORKS; FROBENIUS NORM; CLASSIFICATION;
D O I
10.1016/j.knosys.2022.108651
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning, the presumably best model is selected from a variety of model candidates generated by testing different model types, hyperparameters, or feature subsets. The advent of deep learning has made model selection even more challenging due to the huge parameter search space. Relying on a single metric to select the best model does not consider class imbalances or the different costs of misclassifications. We argue that incorporating human knowledge to interactively analyse the per-class errors and class confusions over all model candidates enables a more efficient training process and yields better models for given applications. This paper proposes the model-agnostic approach ConfusionVis which allows to comparatively evaluate and select multi-class classifiers based on their confusion matrices. This contributes to making the models' results understandable, while treating the models as black boxes. Therefore, we propose a novel method to measure and visualise distances between confusion matrices and an interactive query interface to incorporate all composition levels of class errors. The approach is evaluated in a user study and the applicability is shown by a case study where marine biologists investigate the conservation efforts of baleen whales by classifying whale species in acoustic recordings. ConfusionVis is available online: https://www.ml-and-vis.org/confusionvis. (c) 2022 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Efficient Decomposition Selection for Multi-class Classification
    Chen, Yawen
    Wen, Zeyi
    He, Bingsheng
    Chen, Jian
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 3751 - 3764
  • [32] Extended adaptive Lasso for multi-class and multi-label feature selection
    Chen, Si-Bao
    Zhang, Yu-Mei
    Ding, Chris H. Q.
    Zhang, Jian
    Luo, Bin
    KNOWLEDGE-BASED SYSTEMS, 2019, 173 : 28 - 36
  • [33] A Multi-Class BCI Based on Somatosensory Imagery
    Yao, Lin
    Mrachacz-Kersting, Natalie
    Sheng, Xinjun
    Zhu, Xiangyang
    Farina, Dario
    Jiang, Ning
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2018, 26 (08) : 1508 - 1515
  • [34] Multi-Class Feature Selection Using Pairwise-class and All-class Techniques
    Chen, Bo
    Li, Guo-Zheng
    You, Mingyu
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 644 - 647
  • [35] Greedy hierarchical binary classifiers for multi-class classification of biological data
    Salma Begum
    Ramazan S. Aygun
    Network Modeling Analysis in Health Informatics and Bioinformatics, 2014, 3 (1)
  • [36] Fuzzy Integral Combination of One-Class Classifiers Designed for Multi-class Classification
    Hadjadji, Bilal
    Chibani, Youcef
    Nemmour, Hassiba
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 320 - 328
  • [37] Feature selection and its combination with data over-sampling for multi-class imbalanced datasets
    Tsai, Chih-Fong
    Chen, Kuan-Chen
    Lin, Wei -Chao
    APPLIED SOFT COMPUTING, 2024, 153
  • [38] Feature selection for multi-class problems by using pairwise-class and all-class techniques
    You, Mingyu
    Li, Guo-Zheng
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 381 - 394
  • [39] CLASSIFICATION OF LIDAR DATA BASED ON MULTI-CLASS SVM
    Samadzadegan, F.
    Bigdeli, B.
    Ramzi, P.
    2010 CANADIAN GEOMATICS CONFERENCE AND SYMPOSIUM OF COMMISSION I, ISPRS CONVERGENCE IN GEOMATICS - SHAPING CANADA'S COMPETITIVE LANDSCAPE, 2010, 38
  • [40] MCAR: Multi-class Classification based on Association Rule
    Thabtah, Fadi
    Cowling, Peter
    Peng, Yonghong
    3RD ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, 2005, 2005,