Learning-From-Disagreement: A Model Comparison and Visual Analytics Framework

被引:3
作者
Wang, Junpeng [1 ]
Wang, Liang [1 ]
Zheng, Yan [1 ]
Yeh, Chin-Chia Michael [1 ]
Jain, Shubham [1 ]
Zhang, Wei [1 ]
机构
[1] Visa Res, Palo Alto, CA 94301 USA
关键词
Learning from disagreement; model comparison; feature visualization; visual analytics; explainable AI; CLASSIFICATION;
D O I
10.1109/TVCG.2022.3172107
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the fast-growing number of classification models being produced every day, numerous model interpretation and comparison solutions have also been introduced. For example, LIME [1] and SHAP [2] can interpret what input features contribute more to a classifier's output predictions. Different numerical metrics (e.g., accuracy) can be used to easily compare two classifiers. However, few works can interpret the contribution of a data feature to a classifier in comparison with its contribution to another classifier. This comparative interpretation can help to disclose the fundamental difference between two classifiers, select classifiers in different feature conditions, and better ensemble two classifiers. To accomplish it, we propose a learning-from-disagreement (LFD) framework to visually compare two classification models. Specifically, LFD identifies data instances with disagreed predictions from two compared classifiers and trains a discriminator to learn from the disagreed instances. As the two classifiers' training features may not be available, we train the discriminator through a set of meta-features proposed based on certain hypotheses of the classifiers to probe their behaviors. Interpreting the trained discriminator with the SHAP values of different meta-features, we provide actionable insights into the compared classifiers. Also, we introduce multiple metrics to profile the importance of meta-features from different perspectives. With these metrics, one can easily identify meta-features with the most complementary behaviors in two classifiers, and use them to better ensemble the classifiers. We focus on binary classification models in the financial services and advertising industry to demonstrate the efficacy of our proposed framework and visualizations.
引用
收藏
页码:3809 / 3825
页数:17
相关论文
共 56 条
[41]   LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks [J].
Strobelt, Hendrik ;
Gehrmann, Sebastian ;
Pfister, Hanspeter ;
Rush, Alexander M. .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) :667-676
[42]  
Springenberg JT, 2015, Arxiv, DOI arXiv:1412.6806
[43]   Investigating the Evolution of Tree Boosting Models with Visual Analytics [J].
Wang, Junpeng ;
Zhang, Wei ;
Wang, Liang ;
Yang, Hao .
2021 IEEE 14TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2021), 2021, :186-195
[44]   DeepVID: Deep Visual Interpretation and Diagnosis for Image Classifiers via Knowledge Distillation [J].
Wang, Junpeng ;
Gou, Liang ;
Zhang, Wei ;
Yang, Hao ;
Shen, Han-Wei .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (06) :2168-2180
[45]   DQNViz: A Visual Analytics Approach to Understand Deep Q-Networks [J].
Wang, Junpeng ;
Gou, Liang ;
Shen, Han-Wei ;
Yang, Hao .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (01) :288-298
[46]   GANViz: A Visual Analytics Approach to Understand the Adversarial Game [J].
Wang, Junpeng ;
Gou, Liang ;
Yang, Hao ;
Shen, Han-Wei .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (06) :1905-1917
[47]  
Wang Weixin, 2006, Proc. of the SIGCHI Conf. on Human Factors in Computing Systems, P517
[48]   The What-If Tool: Interactive Probing of Machine Learning Models [J].
Wexler, James ;
Pushkarna, Mahima ;
Bolukbasi, Tolga ;
Wattenberg, Martin ;
Viegas, Fernanda ;
Wilson, Jimbo .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) :56-65
[49]   Merchant Category Identification Using Credit Card Transactions [J].
Yeh, Chin-Chia Michael ;
Zhuang, Zhongfang ;
Zheng, Yan ;
Wang, Liang ;
Wang, Junpeng ;
Zhang, Wei .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :1736-1744
[50]  
Yu W., 2016, P 33 RD INT C MACHIN