ConfusionFlow: A Model-Agnostic Visualization for Temporal Analysis of Classifier Confusion

被引:25
作者
Hinterreiter, Andreas [1 ,2 ]
Ruch, Peter [3 ]
Stitz, Holger [1 ,5 ]
Ennemoser, Martin [6 ]
Bernard, Jurgen [8 ]
Strobelt, Hendrik [7 ]
Streit, Marc [4 ]
机构
[1] Johannes Kepler Univ Linz, Inst Comp Graph, A-4040 Linz, Austria
[2] Imperial Coll London, Biomed Image Anal Grp, London SW7 2AZ, England
[3] Johannes Kepler Univ Linz, Inst Machine Learning, A-4040 Linz, Austria
[4] Johannes Kepler Univ Linz, Visual Data Sci, A-4040 Linz, Austria
[5] Datavisyn GmbH, A-4040 Linz, Austria
[6] Salesbeat GmbH, A-4060 Leonding, Austria
[7] IBM Res, Cambridge, MA 02142 USA
[8] Univ British Columbia, Vancouver, BC V6T 1Z4, Canada
基金
奥地利科学基金会;
关键词
Task analysis; Analytical models; Data models; Training; Tools; Adaptation models; Data visualization; Classification; performance analysis; time series visualization; machine learning; information visualization; quality assessment; VISUAL ANALYTICS;
D O I
10.1109/TVCG.2020.3012063
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Classifiers are among the most widely used supervised machine learning algorithms. Many classification models exist, and choosing the right one for a given task is difficult. During model selection and debugging, data scientists need to assess classifiers' performances, evaluate their learning behavior over time, and compare different models. Typically, this analysis is based on single-number performance measures such as accuracy. A more detailed evaluation of classifiers is possible by inspecting class errors. The confusion matrix is an established way for visualizing these class errors, but it was not designed with temporal or comparative analysis in mind. More generally, established performance analysis systems do not allow a combined temporal and comparative analysis of class-level information. To address this issue, we propose ConfusionFlow, an interactive, comparative visualization tool that combines the benefits of class confusion matrices with the visualization of performance characteristics over time. ConfusionFlow is model-agnostic and can be used to compare performances for different model types, model architectures, and/or training and test datasets. We demonstrate the usefulness of ConfusionFlow in a case study on instance selection strategies in active learning. We further assess the scalability of ConfusionFlow and present a use case in the context of neural network pruning.
引用
收藏
页码:1222 / 1236
页数:15
相关论文
共 63 条
[1]  
Abadi M., TENSORFLOW LARGE SCA
[2]  
Aigner W, 2011, HUM-COMPUT INT-SPRIN, P1, DOI 10.1007/978-0-85729-079-3
[3]   Do Convolutional Neural Networks Learn Class Hierarchy? [J].
Alsallakh, Bilal ;
Jourabloo, Amin ;
Ye, Mao ;
Liu, Xiaoming ;
Ren, Liu .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) :152-162
[4]   Visual Methods for Analyzing Probabilistic Classification Data [J].
Alsallakh, Bilal ;
Hanbury, Allan ;
Hauser, Helwig ;
Miksch, Silvia ;
Rauber, Andreas .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (12) :1703-1712
[5]  
Andrienko G., 2006, EXPLORATORY ANAL SPA
[6]  
[Anonymous], 2017, arXiv preprint arXiv:1710.05285
[7]  
[Anonymous], 2018, ARXIV180600451
[8]  
Bernard J., 2015, THESIS TU DARMSTADT
[9]   Visual-Interactive Preprocessing of Multivariate Time Series Data [J].
Bernard, Juergen ;
Hutter, Marco ;
Reinemuth, Heiko ;
Pfeifer, Hendrik ;
Bors, Christian ;
Kohlhammer, Joern .
COMPUTER GRAPHICS FORUM, 2019, 38 (03) :401-412
[10]   VIAL: a unified process for visual interactive labeling [J].
Bernard, Juergen ;
Zeppelzauer, Matthias ;
Sedlmair, Michael ;
Aigner, Wolfgang .
VISUAL COMPUTER, 2018, 34 (09) :1189-1207