High-Volume Hypothesis Testing: Systematic Exploration of Event Sequence Comparisons

被引:32
作者
Malik, Sana [1 ]
Shneiderman, Ben [1 ]
Du, Fan [1 ]
Plaisant, Catherine [1 ]
Bjarnadottir, Margret [2 ]
机构
[1] Univ Maryland, Human Comp Interact Lab, College Pk, MD 20742 USA
[2] Univ Maryland, Robert H Smith Sch Business, College Pk, MD 20742 USA
关键词
Cohort comparison; event sequences; visual analytics;
D O I
10.1145/2890478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cohort comparison studies have traditionally been hypothesis driven and conducted in carefully controlled environments (such as clinical trials). Given two groups of event sequence data, researchers test a single hypothesis (e.g., does the group taking Medication A exhibit more deaths than the group taking Medication B?). Recently, however, researchers have been moving toward more exploratory methods of retrospective analysis with existing data. In this article, we begin by showing that the task of cohort comparison is specific enough to support automatic computation against a bounded set of potential questions and objectives, a method that we refer to as High-Volume Hypothesis Testing (HVHT). From this starting point, we demonstrate that the diversity of these objectives, both across and within different domains, as well as the inherent complexities of real-world datasets, still requires human involvement to determine meaningful insights. We explore how visualization and interaction better support the task of exploratory data analysis and the understanding of HVHT results (how significant they are, why they are meaningful, and whether the entire dataset has been exhaustively explored). Through interviews and case studies with domain experts, we iteratively design and implement visualization and interaction techniques in a visual analytics tool, CoCo. As a result of our evaluation, we propose six design guidelines for enabling users to explore large result sets of HVHT systematically and flexibly in order to glean meaningful insights more quickly. Finally, we illustrate the utility of this method with three case studies in the medical domain.
引用
收藏
页数:23
相关论文
共 39 条
[11]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&
[12]   Survival of patients with bronchiectasis after the first ICU stay for respiratory failure [J].
Dupont, M ;
Gacouin, A ;
Lena, H ;
Lavoué, S ;
Brinchault, G ;
Delaval, P ;
Thomas, R .
CHEST, 2004, 125 (05) :1815-1820
[13]  
Federico P., 2015, EUROVIS WORKSHOP VIS, DOI [10.2312/eurova.20151108, DOI 10.2312/EUROVA.20151108]
[14]   Visual comparison for information visualization [J].
Gleicher, Michael ;
Albers, Danielle ;
Walker, Rick ;
Jusufi, Ilir ;
Hansen, Charles D. ;
Roberts, Jonathan C. .
INFORMATION VISUALIZATION, 2011, 10 (04) :289-309
[15]  
Goel Manish Kumar, 2010, Int J Ayurveda Res, V1, P274, DOI 10.4103/0974-7788.76794
[16]  
Guerra-Gomez J. A., 2011, P TRANSP RES BOARD 9
[17]   Frequent pattern mining: current status and future directions [J].
Han, Jiawei ;
Cheng, Hong ;
Xin, Dong ;
Yan, Xifeng .
DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 15 (01) :55-86
[18]   Randomized trial of short-versus long-course radiotherapy for palliation of painful bone metastases [J].
Hartsell, WF ;
Scott, CB ;
Bruner, DW ;
Scarantino, CW ;
Ivker, RA ;
Roach, M ;
Suh, JH ;
Demas, WF ;
Movsas, B ;
Petersen, IA ;
Konski, AA ;
Cleeland, CS ;
Janjan, NA ;
DeSilvio, M .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2005, 97 (11) :798-804
[19]  
Lage MJ, 2008, AM J MANAG CARE, V14, P317
[20]   Mind the time: Unleashing temporal aspects in pattern discovery [J].
Lammarsch, T. ;
Aigner, W. ;
Bertone, A. ;
Miksch, S. ;
Rind, A. .
COMPUTERS & GRAPHICS-UK, 2014, 38 :38-50