ConfusionVis: Comparative evaluation and selection of multi-class classifiers based on confusion matrices

被引：35

作者：

Theissler, Andreas ^{[1
]}

Thomas, Mark ^{[2
]}

Burch, Michael ^{[3
]}

Gerschner, Felix ^{[1
]}

机构：

[1] Aalen Univ Appl Sci, Aalen, Germany

[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada

[3] Univ Appl Sci Grisons FHGR, Graubunden, Switzerland

来源：

KNOWLEDGE-BASED SYSTEMS | 2022年 / 247卷

关键词：

Machine learning; Interpretable machine learning; Classification; Model selection; Species conservation; NEURAL-NETWORKS; FROBENIUS NORM; CLASSIFICATION;

D O I：

10.1016/j.knosys.2022.108651

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In machine learning, the presumably best model is selected from a variety of model candidates generated by testing different model types, hyperparameters, or feature subsets. The advent of deep learning has made model selection even more challenging due to the huge parameter search space. Relying on a single metric to select the best model does not consider class imbalances or the different costs of misclassifications. We argue that incorporating human knowledge to interactively analyse the per-class errors and class confusions over all model candidates enables a more efficient training process and yields better models for given applications. This paper proposes the model-agnostic approach ConfusionVis which allows to comparatively evaluate and select multi-class classifiers based on their confusion matrices. This contributes to making the models' results understandable, while treating the models as black boxes. Therefore, we propose a novel method to measure and visualise distances between confusion matrices and an interactive query interface to incorporate all composition levels of class errors. The approach is evaluated in a user study and the applicability is shown by a case study where marine biologists investigate the conservation efforts of baleen whales by classifying whale species in acoustic recordings. ConfusionVis is available online: https://www.ml-and-vis.org/confusionvis. (c) 2022 The Author(s). Published by Elsevier B.V.

引用

页数：16

共 50 条

[21] Hybrid decision tree and naive Bayes classifiers for multi-class classification tasks
Farid, Dewan Md.
Zhang, Li
Rahman, Chowdhury Mofizur
Hossain, M. A.
Strachan, Rebecca
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 1937 - 1946
[22] Sample Complexity of Classifiers Taking Values in Q, Application to Multi-Class SVMs
Guermeur, Yann
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2010, 39 (03) : 543 - 557
[23] Tune and mix: learning to rank using ensembles of calibrated multi-class classifiers
Busa-Fekete, Robert
Kegl, Balazs
Elteto, Tamas
Szarvas, Gyoergy
MACHINE LEARNING, 2013, 93 (2-3) : 261 - 292
[24] Multi-class Financial Distress Prediction Based on Feature Selection and Deep Forest Algorithm
Chen, Xiaofang
Mao, Zengli
Wu, Chong
COMPUTATIONAL ECONOMICS, 2024,
[25] The Influence of Multi-class Feature Selection on the Prediction of Diagnostic Phenotypes
Lausser, Ludwig
Szekely, Robin
Schirra, Lyn-Rouven
Kestler, Hans A.
NEURAL PROCESSING LETTERS, 2018, 48 (02) : 863 - 880
[26] Feature selection with kernelized multi-class support vector machine
Guo, Yinan
Zhang, Zirui
Tang, Fengzhen
PATTERN RECOGNITION, 2021, 117
[27] Feature Selection Consideration for Multi-Class Cardiac Arrhythmia Classification
Thanawattano, Chusak
Yingthawornsuk, Thaweesak
INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 1175 - 1178
[28] A semi-hard voting combiner scheme to ensemble multi-class probabilistic classifiers
Delgado, Rosario
APPLIED INTELLIGENCE, 2022, 52 (04) : 3653 - 3677
[29] Hybridized intelligent multi-class classifiers for rockburst risk assessment in deep underground mines
Shirani Faradonbeh, Roohollah
Vaisey, Will
Sharifzadeh, Mostafa
Zhou, Jian
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (04) : 1681 - 1698
[30] Avoiding Time Series Prediction Disbelief with Ensemble Classifiers in Multi-class Problem Spaces
Huk, Maciej
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 155 - 166

← 1 2 3 4 5 →