Investigating User Estimation of Missing Data in Visual Analysis

被引:0
|
作者
Sun, Maoyuan [1 ]
Wang, Yuanxin [2 ]
Bolton, Courtney [1 ]
Ma, Yue [1 ]
Li, Tianyi [3 ]
Zhao, Jian [2 ]
机构
[1] Northern Illinois Univ, De Kalb, IL 60115 USA
[2] Univ Waterloo, Waterloo, ON, Canada
[3] Purdue Univ, W Lafayette, IN USA
来源
PROCEEDINGS OF THE 50TH GRAPHICS INTERFACE CONFERENCE, GI 2024 | 2024年
基金
加拿大自然科学与工程研究理事会;
关键词
Missing data; time series; visual analysis; UNCERTAINTY; IMPUTATION; VISUALIZATION; KNOWLEDGE; MODEL;
D O I
10.1145/3670947.3670977
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing data is a pervasive issue in real-world analytics, stemming from a multitude of factors (e.g., device malfunctions and network disruptions), making it a ubiquitous challenge in many domains. Misperception of missing data impacts decision-making and causes severe consequences. To mitigate risks from missing data and facilitate proper handling, computing methods (e.g., imputation) have been studied, which often culminate in the visual representation of data for analysts to further check. Yet, the influence of these computed representations on user judgment regarding missing data remains unclear. To study potential influencing factors and their impact on user judgment, we conducted a crowdsourcing study. We controlled 4 factors: the distribution, imputation, and visualization of missing data, and the prior knowledge of data. We compared users' estimations of missing data with computed imputations under different combinations of these factors. Our results offer useful guidance for visualizing missing data and their imputations, which informs future studies on developing trustworthy computing methods for visual analysis of missing data.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Haplotype frequency estimation error analysis in the presence of missing genotype data
    Kelly, ED
    Sievers, F
    McManus, R
    BMC BIOINFORMATICS, 2004, 5 (1)
  • [32] Handling Missing Data in COVID-19 Incidence Estimation:Secondary Data Analysis
    Pham, Hai-Thanh
    Do, Toan
    Baek, Jonggyu
    Nguyen, Cong-Khanh
    Pham, Quang-Thai
    Nguyen, Hoa L.
    Goldberg, Robert
    Pham, Quang Loc
    Giang, Le Minh
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2024, 10
  • [33] Kernel PCA Regression for Missing Data Estimation in DNA Microarray Analysis
    Shan, Ying
    Deng, Guang
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 1477 - 1480
  • [34] Estimation in semiparametric models with missing data
    Chen, Song Xi
    Van Keilegom, Ingrid
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2013, 65 (04) : 785 - 805
  • [35] Efficient estimation with missing data and endogeneity
    Rai, Bhavna
    ECONOMETRIC REVIEWS, 2023, 42 (02) : 220 - 239
  • [36] SPECTRAL ESTIMATION IN THE PRESENCE OF MISSING DATA
    Bahamonde, Natalia
    Doukhan, Paul
    THEORY OF PROBABILITY AND MATHEMATICAL STATISTICS, 2016, 95 : 55 - 74
  • [37] Estimation in semiparametric models with missing data
    Song Xi Chen
    Ingrid Van Keilegom
    Annals of the Institute of Statistical Mathematics, 2013, 65 : 785 - 805
  • [38] Nonparametric mean estimation with missing data
    González-Manteiga, W
    Pérez-González, A
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2004, 33 (02) : 277 - 303
  • [39] Robust location estimation with missing data
    Sued, Mariela
    Yohai, Victor J.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2013, 41 (01): : 111 - 132
  • [40] Missing data estimation by separable deblurring
    Qi, HR
    Snyder, WE
    Bilbro, GL
    IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 348 - 353