Towards insight-driven sampling for big data visualisation

被引:1
作者
Masiane, Moeti M. [1 ]
Driscoll, Anne [1 ]
Feng, Wuchun [1 ]
Wenskovitch, John [1 ]
North, Chris [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Visualisation; insight; big data; sampling; error; UNCERTAINTY;
D O I
10.1080/0144929X.2019.1616223
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Creating an interactive, accurate, and low-latency big data visualisation is challenging due to the volume, variety, and velocity of the data. Visualisation options range from visualising the entire big dataset, which could take a long time and be taxing to the system, to visualising a small subset of the dataset, which could be fast and less taxing to the system but could also lead to a less-beneficial visualisation as a result of information loss. The main research questions investigated by this work are what effect sampling has on visualisation insight and how to provide guidance to users in navigating this trade-off. To investigate these issues, we study an initial case of simple estimation tasks on histogram visualisations of sampled big data, in hopes that these results may generalise. Leveraging sampling, we generate subsets of large datasets and create visualisations for a crowd-sourced study involving a simple cognitive visualisation task. Using the results of this study, we quantify insight, sampling, visualisation, and perception error in comparison to the full dataset. We use these results to model the relationship between sample size and insight error, and we propose the use of our model to guide big data visualisation sampling.
引用
收藏
页码:788 / 807
页数:20
相关论文
共 39 条
[1]   On the Greenness of In-Situ and Post-Processing Visualization Pipelines [J].
Adhinarayanan, Vignesh ;
Feng, Wu-chun ;
Woodring, Jonathan ;
Rogers, David ;
Ahrens, James .
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, :880-887
[2]  
[Anonymous], P 2008 WORKSH TIM ER
[3]  
[Anonymous], 2019, VIKALPA
[4]  
Berres A. S., 2017, TECHNICAL REPORT
[5]   Pricing schemes for energy-efficient HPC systems: Design and exploration [J].
Borghesi, Andrea ;
Bartolini, Andrea ;
Milano, Michela ;
Benini, Luca .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (04) :716-734
[6]  
Card S. K., 1999, READINGS INFORM VISU, P579
[7]   Defining Insight for Visual Analytics [J].
Chang, Remco ;
Ziemkiewicz, Caroline ;
Green, Tera Marie ;
Ribarsky, William .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2009, 29 (02) :14-17
[8]   Uncertainty-Aware Multidimensional Ensemble Data Visualization and Exploration [J].
Chen, Haidong ;
Zhang, Song ;
Chen, Wei ;
Mei, Honghui ;
Zhang, Jiawei ;
Mercer, Andrew ;
Liang, Ronghua ;
Qu, Huamin .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2015, 21 (09) :1072-1086
[9]   Characterizing Visualization Insights from Quantified Selfers' Personal Data Presentations [J].
Choe, Eun Kyoung ;
Lee, Bongshin ;
Schraefel, M. C. .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2015, 35 (04) :28-37
[10]  
Dahshan Mai, 2018, SC 2018 DALL TEX NOV