Visual Analytics for Machine Learning: A Data Perspective Survey

被引:1
|
作者
Wang, Junpeng [1 ]
Liu, Shixia [2 ]
Zhang, Wei [1 ]
机构
[1] Visa Res, Foster City, CA 94404 USA
[2] Tsinghua Univ, Beijing 100084, Peoples R China
关键词
Task analysis; Data models; Surveys; Analytical models; Taxonomy; Market research; Visual analytics; Explainable AI; machine learning; taxonomy; VIS4ML; visual analytics; visualization; CONVOLUTIONAL NEURAL-NETWORKS; OF-THE-ART; INTERACTIVE ANALYSIS; VISUALIZATION; MODEL; EXPLANATIONS; DIAGNOSIS; CONSTRUCTION; UNDERSTAND; EXTRACTION;
D O I
10.1109/TVCG.2024.3357065
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The past decade has witnessed a plethora of works that leverage the power of visualization (VIS) to interpret machine learning (ML) models. The corresponding research topic, VIS4ML, keeps growing at a fast pace. To better organize the enormous works and shed light on the developing trend of VIS4ML, we provide a systematic review of these works through this survey. Since data quality greatly impacts the performance of ML models, our survey focuses specifically on summarizing VIS4ML works from the data perspective. First, we categorize the common data handled by ML models into five types, explain the unique features of each type, and highlight the corresponding ML models that are good at learning from them. Second, from the large number of VIS4ML works, we tease out six tasks that operate on these types of data (i.e., data-centric tasks) at different stages of the ML pipeline to understand, diagnose, and refine ML models. Lastly, by studying the distribution of 143 surveyed papers across the five data types, six data-centric tasks, and their intersections, we analyze the prospective research directions and envision future research trends.
引用
收藏
页码:7637 / 7656
页数:20
相关论文
共 50 条
  • [41] Design of New Dispersants Using Machine Learning and Visual Analytics
    Martinez, Maria Jimena
    Naveiro, Roi
    Soto, Axel J.
    Talavante, Pablo
    Kim Lee, Shin-Ho
    Gomez Arrayas, Ramon
    Franco, Mario
    Mauleon, Pablo
    Lozano Ordonez, Hector
    Revilla Lopez, Guillermo
    Bernabei, Marco
    Campillo, Nuria E.
    Ponzoni, Ignacio
    POLYMERS, 2023, 15 (05)
  • [42] FAIRVIS: Visual Analytics for Discovering Intersectional Bias in Machine Learning
    Cabrera, Angel Alexander
    Epperson, Will
    Hohman, Fred
    Kahng, Minsuk
    Morgenstern, Jamie
    Chau, Duen Horng
    2019 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2019, : 46 - 56
  • [43] Visual Analytics in Deep Learning: An Interrogative Survey for the Next Frontiers
    Hohman, Fred
    Kahng, Minsuk
    Pienta, Robert
    Chau, Duen Horng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (08) : 2674 - 2693
  • [44] Visual Analytics of Heterogeneous Data Using Hypergraph Learning
    Xie, Cong
    Zhong, Wen
    Xu, Wei
    Mueller, Klaus
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2019, 10 (01)
  • [45] Spatiotemporal Urban-Data Analysis A Visual Analytics Perspective
    Doraiswamy, Harish
    Freire, Juliana
    Lage, Marcos
    Miranda, Fabio
    Silva, Claudio
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2018, 38 (05) : 26 - 35
  • [46] Visual saliency computation: A machine learning perspective
    Li, Jia
    Gao, Wen
    1600, Springer Verlag (8408): : 1 - 249
  • [47] Data Analytics for Cybersecurity Based on Machine Learning Algorithms
    Wang, Lidong
    Mosher, Reed L.
    Duett, Patti
    Falls, Terril C.
    SOUTHEASTCON 2023, 2023, : 810 - 814
  • [48] Big data analytics and machine learning: 2015 and beyond
    Passos, Ives Cavalcante
    Mwangi, Benson
    Kapczinski, Flavio
    LANCET PSYCHIATRY, 2016, 3 (01): : 13 - 15
  • [49] Analytics of Epidemiological Data using Machine Learning Models
    Barapatre, Harshita
    Jangir, Jatin
    Bajpai, Sudhanshu
    Chawla, Bhavesh
    Keswani, Gunjan
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 255 - 262
  • [50] Big Data Analytics using Machine Learning Techniques
    Mittal, Shweta
    Sangwan, Om Prakash
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 203 - 207