Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis

被引:47
作者
Zgraggen, Emanuel [1 ]
Zhao, Zheguang [1 ]
Zeleznik, Robert [1 ]
Kraska, Tim [1 ,2 ]
机构
[1] Brown Univ, Providence, RI 02912 USA
[2] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018) | 2018年
关键词
Multiple Comparisons Problem; Visual Analysis; Visualization; Statistics; Experiment; BASE-RATE FALLACY; HOT HAND; ANALYTICS; INFERENCE; INSIGHT; METHODOLOGY; PEN;
D O I
10.1145/3173574.3174053
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of a visualization system is to facilitate data-driven insight discovery. But what if the insights are spurious? Features or patterns in visualizations can be perceived as relevant insights, even though they may arise from noise. We often compare visualizations to a mental image of what we are interested in: a particular trend, distribution or an unusual pattern. As more visualizations are examined and more comparisons are made, the probability of discovering spurious insights increases. This problem is well-known in Statistics as the multiple comparisons problem (MCP) but overlooked in visual analysis. We present a way to evaluate MCP in visualization tools by measuring the accuracy of user reported insights on synthetic datasets with known ground truth labels. In our experiment, over 60% of user insights were false. We show how a confirmatory analysis approach that accounts for all visual comparisons, insights and non-insights, can achieve similar results as one that requires a validation dataset.
引用
收藏
页数:12
相关论文
共 53 条
[1]   Synthetic Generation of High-Dimensional Datasets [J].
Albuquerque, Georgia ;
Loewe, Thomas ;
Magnor, Marcus .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) :2317-2324
[2]  
[Anonymous], 1993, PROTOCOL ANAL
[3]  
[Anonymous], IEEE T VISUALIZATION
[4]  
[Anonymous], 2004, P WORK C ADV VIS INT, DOI DOI 10.1145/989863.989880
[5]  
[Anonymous], P 2012 ACM SIGMOD IN
[6]  
[Anonymous], 2012, arXiv
[7]  
[Anonymous], 2002, INTRO PROBABILITY
[8]  
[Anonymous], 2013, THESIS COLUMBIA U NE
[9]   Sample size used to validate a scale: A review of publications on newly-developed patient reported outcomes measures [J].
Anthoine E. ;
Moret L. ;
Regnault A. ;
Sbille V. ;
Hardouin J.-B. .
Health and Quality of Life Outcomes, 12 (1)
[10]   The hot hand fallacy and the gambler's fallacy: Two faces of subjective randomness? [J].
Ayton, P ;
Fischer, I .
MEMORY & COGNITION, 2004, 32 (08) :1369-1378