Visual Analytics for Machine Learning: A Data Perspective Survey

被引:1
作者
Wang, Junpeng [1 ]
Liu, Shixia [2 ]
Zhang, Wei [1 ]
机构
[1] Visa Res, Foster City, CA 94404 USA
[2] Tsinghua Univ, Beijing 100084, Peoples R China
关键词
Task analysis; Data models; Surveys; Analytical models; Taxonomy; Market research; Visual analytics; Explainable AI; machine learning; taxonomy; VIS4ML; visual analytics; visualization; CONVOLUTIONAL NEURAL-NETWORKS; OF-THE-ART; INTERACTIVE ANALYSIS; VISUALIZATION; MODEL; EXPLANATIONS; DIAGNOSIS; CONSTRUCTION; UNDERSTAND; EXTRACTION;
D O I
10.1109/TVCG.2024.3357065
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The past decade has witnessed a plethora of works that leverage the power of visualization (VIS) to interpret machine learning (ML) models. The corresponding research topic, VIS4ML, keeps growing at a fast pace. To better organize the enormous works and shed light on the developing trend of VIS4ML, we provide a systematic review of these works through this survey. Since data quality greatly impacts the performance of ML models, our survey focuses specifically on summarizing VIS4ML works from the data perspective. First, we categorize the common data handled by ML models into five types, explain the unique features of each type, and highlight the corresponding ML models that are good at learning from them. Second, from the large number of VIS4ML works, we tease out six tasks that operate on these types of data (i.e., data-centric tasks) at different stages of the ML pipeline to understand, diagnose, and refine ML models. Lastly, by studying the distribution of 143 surveyed papers across the five data types, six data-centric tasks, and their intersections, we analyze the prospective research directions and envision future research trends.
引用
收藏
页码:7637 / 7656
页数:20
相关论文
共 197 条
  • [171] A Comprehensive Survey on Graph Neural Networks
    Wu, Zonghan
    Pan, Shirui
    Chen, Fengwen
    Long, Guodong
    Zhang, Chengqi
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (01) : 4 - 24
  • [172] Xenopoulos Peter, 2023, IEEE Trans Vis Comput Graph, V29, P853, DOI 10.1109/TVCG.2022.3209489
  • [173] Interactive Visual Cluster Analysis by Contrastive Dimensionality Reduction
    Xia J.
    Huang L.
    Lin W.
    Zhao X.
    Wu J.
    Chen Y.
    Zhao Y.
    Chen W.
    [J]. IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 734 - 744
  • [174] Revisiting Dimensionality Reduction Techniques for Visual Cluster Analysis: An Empirical Study
    Xia, Jiazhi
    Zhang, Yuchen
    Song, Jie
    Chen, Yang
    Wang, Yunhai
    Liu, Shixia
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) : 529 - 539
  • [175] Xiang SX, 2019, IEEE CONF VIS ANAL, P57, DOI [10.1109/vast47406.2019.8986943, 10.1109/VAST47406.2019.8986943]
  • [176] FairRankVis: A Visual Analytics Framework for Exploring Algorithmic Fairness in Graph Mining Models
    Xie, Tiankai
    Ma, Yuxin
    Kang, Jian
    Tong, Hanghang
    Maciejewski, Ross
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) : 368 - 377
  • [177] VAC-CNN: A Visual Analytics System for Comparative Studies of Deep Convolutional Neural Networks
    Xuan, Xiwei
    Zhang, Xiaoyu
    Kwon, Oh-Hyun
    Ma, Kwan-Liu
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (06) : 2326 - 2337
  • [178] Yang WK, 2023, Arxiv, DOI arXiv:2312.05067
  • [179] Yang WK, 2023, Arxiv, DOI arXiv:2310.05771
  • [180] Diagnosing Ensemble Few-Shot Classifiers
    Yang, Weikai
    Ye, Xi
    Zhang, Xingxing
    Xiao, Lanxi
    Xia, Jiazhi
    Wang, Zhongyuan
    Zhu, Jun
    Pfister, Hanspeter
    Liu, Shixia
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (09) : 3292 - 3306