Visual Analytics for Machine Learning: A Data Perspective Survey

被引:1
作者
Wang, Junpeng [1 ]
Liu, Shixia [2 ]
Zhang, Wei [1 ]
机构
[1] Visa Res, Foster City, CA 94404 USA
[2] Tsinghua Univ, Beijing 100084, Peoples R China
关键词
Task analysis; Data models; Surveys; Analytical models; Taxonomy; Market research; Visual analytics; Explainable AI; machine learning; taxonomy; VIS4ML; visual analytics; visualization; CONVOLUTIONAL NEURAL-NETWORKS; OF-THE-ART; INTERACTIVE ANALYSIS; VISUALIZATION; MODEL; EXPLANATIONS; DIAGNOSIS; CONSTRUCTION; UNDERSTAND; EXTRACTION;
D O I
10.1109/TVCG.2024.3357065
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The past decade has witnessed a plethora of works that leverage the power of visualization (VIS) to interpret machine learning (ML) models. The corresponding research topic, VIS4ML, keeps growing at a fast pace. To better organize the enormous works and shed light on the developing trend of VIS4ML, we provide a systematic review of these works through this survey. Since data quality greatly impacts the performance of ML models, our survey focuses specifically on summarizing VIS4ML works from the data perspective. First, we categorize the common data handled by ML models into five types, explain the unique features of each type, and highlight the corresponding ML models that are good at learning from them. Second, from the large number of VIS4ML works, we tease out six tasks that operate on these types of data (i.e., data-centric tasks) at different stages of the ML pipeline to understand, diagnose, and refine ML models. Lastly, by studying the distribution of 143 surveyed papers across the five data types, six data-centric tasks, and their intersections, we analyze the prospective research directions and envision future research trends.
引用
收藏
页码:7637 / 7656
页数:20
相关论文
共 197 条
  • [1] FairSight: Visual Analytics for Fairness in Decision Making
    Ahn, Yongsu
    Lin, Yu-Ru
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) : 1086 - 1095
  • [2] Do Convolutional Neural Networks Learn Class Hierarchy?
    Alsallakh, Bilal
    Jourabloo, Amin
    Ye, Mao
    Liu, Xiaoming
    Ren, Liu
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) : 152 - 162
  • [3] Visual Methods for Analyzing Probabilistic Classification Data
    Alsallakh, Bilal
    Hanbury, Allan
    Hauser, Helwig
    Miksch, Silvia
    Rauber, Andreas
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (12) : 1703 - 1712
  • [4] Low-level components of analytic activity in information visualization
    Amar, R
    Eagan, J
    Stasko, J
    [J]. INFOVIS 05: IEEE SYMPOSIUM ON INFORMATION VISUALIZATION, PROCEEDINGS, 2005, : 111 - 117
  • [5] ModelTracker: Redesigning Performance Analysis Tools for Machine Learning
    Amershi, Saleema
    Chickering, Max
    Drucker, Steven M.
    Lee, Bongshin
    Simard, Patrice
    Suh, Jina
    [J]. CHI 2015: PROCEEDINGS OF THE 33RD ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2015, : 337 - 346
  • [6] Classifier-Guided Visual Correction of Noisy Labels for Image Classification Tasks
    Baeuerle, A.
    Neumann, H.
    Ropinski, T.
    [J]. COMPUTER GRAPHICS FORUM, 2020, 39 (03) : 195 - 205
  • [7] Net2Vis-A Visual Grammar for Automatically Generating Publication-Tailored CNN Architecture Visualizations
    Baeuerle, Alex
    van Onzenoodt, Christian
    Ropinski, Timo
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (06) : 2980 - 2991
  • [8] Visual Analysis and Dissemination of Scientific Literature Collections with SurVis
    Beck, Fabian
    Koch, Sebastian
    Weiskopf, Daniel
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (01) : 180 - 189
  • [9] Visually Analyzing Contextualized Embeddings
    Berger, Matthew
    [J]. 2020 IEEE VISUALIZATION CONFERENCE - SHORT PAPERS (VIS 2020), 2020, : 276 - 280
  • [10] Towards User-Centered Active Learning Algorithms
    Bernard, Juergen
    Zeppelzauer, Matthias
    Lehmann, Markus
    Mueller, Martin
    Sedlmair, Michael
    [J]. COMPUTER GRAPHICS FORUM, 2018, 37 (03) : 121 - 132