Visual Analytics for Machine Learning: A Data Perspective Survey

被引:1
作者
Wang, Junpeng [1 ]
Liu, Shixia [2 ]
Zhang, Wei [1 ]
机构
[1] Visa Res, Foster City, CA 94404 USA
[2] Tsinghua Univ, Beijing 100084, Peoples R China
关键词
Task analysis; Data models; Surveys; Analytical models; Taxonomy; Market research; Visual analytics; Explainable AI; machine learning; taxonomy; VIS4ML; visual analytics; visualization; CONVOLUTIONAL NEURAL-NETWORKS; OF-THE-ART; INTERACTIVE ANALYSIS; VISUALIZATION; MODEL; EXPLANATIONS; DIAGNOSIS; CONSTRUCTION; UNDERSTAND; EXTRACTION;
D O I
10.1109/TVCG.2024.3357065
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The past decade has witnessed a plethora of works that leverage the power of visualization (VIS) to interpret machine learning (ML) models. The corresponding research topic, VIS4ML, keeps growing at a fast pace. To better organize the enormous works and shed light on the developing trend of VIS4ML, we provide a systematic review of these works through this survey. Since data quality greatly impacts the performance of ML models, our survey focuses specifically on summarizing VIS4ML works from the data perspective. First, we categorize the common data handled by ML models into five types, explain the unique features of each type, and highlight the corresponding ML models that are good at learning from them. Second, from the large number of VIS4ML works, we tease out six tasks that operate on these types of data (i.e., data-centric tasks) at different stages of the ML pipeline to understand, diagnose, and refine ML models. Lastly, by studying the distribution of 143 surveyed papers across the five data types, six data-centric tasks, and their intersections, we analyze the prospective research directions and envision future research trends.
引用
收藏
页码:7637 / 7656
页数:20
相关论文
共 197 条
  • [151] Vaswani A, 2017, ADV NEUR IN, V30
  • [152] Comparison of descriptor spaces for chemical compound retrieval and classification
    Wale, Nikil
    Watson, Ian A.
    Karypis, George
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 14 (03) : 347 - 375
  • [153] Wang J., 2019, Interpreting and diagnosing deep learning models: A visual analytics approach
  • [154] Visual Analytics for RNN-Based Deep Reinforcement Learning
    Wang, Junpeng
    Zhang, Wei
    Yang, Hao
    Yeh, Chin-Chia Michael
    Wang, Liang
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (12) : 4141 - 4155
  • [155] Learning-From-Disagreement: A Model Comparison and Visual Analytics Framework
    Wang, Junpeng
    Wang, Liang
    Zheng, Yan
    Yeh, Chin-Chia Michael
    Jain, Shubham
    Zhang, Wei
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (09) : 3809 - 3825
  • [156] Investigating the Evolution of Tree Boosting Models with Visual Analytics
    Wang, Junpeng
    Zhang, Wei
    Wang, Liang
    Yang, Hao
    [J]. 2021 IEEE 14TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2021), 2021, : 186 - 195
  • [157] SCANViz: Interpreting the Symbol-Concept Association Captured by Deep Neural Networks through Visual Analytics
    Wang, Junpeng
    Zhang, Wei
    Yang, Hao
    [J]. 2020 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2020, : 51 - 60
  • [158] Visualization and Visual Analysis of Ensemble Data: A Survey
    Wang, Junpeng
    Hazarika, Subhashis
    Li, Cheng
    Shen, Han-Wei
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (09) : 2853 - 2872
  • [159] DeepVID: Deep Visual Interpretation and Diagnosis for Image Classifiers via Knowledge Distillation
    Wang, Junpeng
    Gou, Liang
    Zhang, Wei
    Yang, Hao
    Shen, Han-Wei
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (06) : 2168 - 2180
  • [160] DQNViz: A Visual Analytics Approach to Understand Deep Q-Networks
    Wang, Junpeng
    Gou, Liang
    Shen, Han-Wei
    Yang, Hao
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (01) : 288 - 298