Visual Analytics for Machine Learning: A Data Perspective Survey

被引:1
作者
Wang, Junpeng [1 ]
Liu, Shixia [2 ]
Zhang, Wei [1 ]
机构
[1] Visa Res, Foster City, CA 94404 USA
[2] Tsinghua Univ, Beijing 100084, Peoples R China
关键词
Task analysis; Data models; Surveys; Analytical models; Taxonomy; Market research; Visual analytics; Explainable AI; machine learning; taxonomy; VIS4ML; visual analytics; visualization; CONVOLUTIONAL NEURAL-NETWORKS; OF-THE-ART; INTERACTIVE ANALYSIS; VISUALIZATION; MODEL; EXPLANATIONS; DIAGNOSIS; CONSTRUCTION; UNDERSTAND; EXTRACTION;
D O I
10.1109/TVCG.2024.3357065
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The past decade has witnessed a plethora of works that leverage the power of visualization (VIS) to interpret machine learning (ML) models. The corresponding research topic, VIS4ML, keeps growing at a fast pace. To better organize the enormous works and shed light on the developing trend of VIS4ML, we provide a systematic review of these works through this survey. Since data quality greatly impacts the performance of ML models, our survey focuses specifically on summarizing VIS4ML works from the data perspective. First, we categorize the common data handled by ML models into five types, explain the unique features of each type, and highlight the corresponding ML models that are good at learning from them. Second, from the large number of VIS4ML works, we tease out six tasks that operate on these types of data (i.e., data-centric tasks) at different stages of the ML pipeline to understand, diagnose, and refine ML models. Lastly, by studying the distribution of 143 surveyed papers across the five data types, six data-centric tasks, and their intersections, we analyze the prospective research directions and envision future research trends.
引用
收藏
页码:7637 / 7656
页数:20
相关论文
共 197 条
  • [11] DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps
    Bertucci D.
    Hamid M.M.
    Anand Y.
    Ruangrotsakun A.
    Tabatabai D.
    Perez M.
    Kahng M.
    [J]. IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 320 - 330
  • [12] Bodria F., 2022, P EUROVIS SHORT PAP, P85
  • [13] A Multi-Level Typology of Abstract Visualization Tasks
    Brehmer, Matthew
    Munzner, Tamara
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) : 2376 - 2385
  • [14] V-Awake: A Visual Analytics Approach for Correcting Sleep Predictions from Deep Learning Models
    Caballero, Humberto S. Garcia
    Westenberg, Michel A.
    Gebre, Binyam
    van Wijk, Jarke J.
    [J]. COMPUTER GRAPHICS FORUM, 2019, 38 (03) : 1 - 12
  • [15] Cabrera AA, 2019, IEEE CONF VIS ANAL, P46, DOI [10.1109/VAST47406.2019.8986948, 10.1109/vast47406.2019.8986948]
  • [16] Analyzing the Noise Robustness of Deep Neural Networks
    Cao, Kelei
    Liu, Mengchen
    Su, Hang
    Wu, Jing
    Zhu, Jun
    Liu, Shixia
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (07) : 3289 - 3304
  • [17] A User-based Visual Analytics Workflow for Exploratory Model Analysis
    Cashman, Dylan
    Humayoun, Shah Rukh
    Heimerl, Florian
    Park, Kendall
    Das, Subhajit
    Thompson, John
    Saket, Bahador
    Mosca, Abigail
    Stasko, John
    Endert, Alex
    Gleicher, Michael
    Chang, Remco
    [J]. COMPUTER GRAPHICS FORUM, 2019, 38 (03) : 185 - 199
  • [18] RNNbow: Visualizing Learning Via Backpropagation Gradients in RNNs
    Cashman, Dylan
    Patterson, Genevieve
    Mosca, Abigail
    Watts, Nathan
    Robinson, Shannon
    Chang, Remco
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2018, 38 (06) : 39 - 50
  • [19] Clustrophile 2: Guided Visual Clustering Analysis
    Cavallo, Marco
    Demiralp, Cagatay
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (01) : 267 - 276
  • [20] A survey on feature selection methods
    Chandrashekar, Girish
    Sahin, Ferat
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) : 16 - 28