Observation-Level and Parametric Interaction for High-Dimensional Data Analysis

被引:27
作者
Self, Jessica Zeitz [1 ]
Dowling, Michelle [2 ]
Wenskovitch, John [2 ]
Crandell, Ian [3 ]
Wang, Ming [2 ]
House, Leanna [3 ]
Leman, Scotland [3 ]
North, Chris [2 ]
机构
[1] Univ Mary Washington, Dept Comp Sci, 1301 Coll Ave, Fredericksburg, VA 22401 USA
[2] Virginia Tech, Dept Comp Sci, 225 Stanger St, Blacksburg, VA 24061 USA
[3] Virginia Tech, Dept Stat, 250 Drillfield Dr, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Usability; user interface; visual analytics; dimension reduction; interaction; evaluation; data analysis; SEMANTIC INTERACTION; VISUALIZATION; REDUCTION;
D O I
10.1145/3158230
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploring high-dimensional data is challenging. Dimension reduction algorithms, such as weighted multidimensional scaling, support data exploration by projecting datasets to two dimensions for visualization. These projections can be explored through parametric interaction, tweaking underlying parameterizations, and observation-level interaction, directly interacting with the points within the projection. In this article, we present the results of a controlled usability study determining the differences, advantages, and drawbacks among parametric interaction, observation-level interaction, and their combination. The study assesses both interaction technique effects on domain-specific high-dimensional data analyses performed by non-experts of statistical algorithms. This study is performed using Andromeda, a tool that enables both parametric and observation-level interaction to provide in-depth data exploration. The results indicate that the two forms of interaction serve different, but complementary, purposes in gaining insight through steerable dimension reduction algorithms.
引用
收藏
页数:36
相关论文
共 50 条
[41]   Pollux: Interactive Cluster-First Projections of High-Dimensional Data [J].
Wenskovitch, John ;
North, Chris .
2019 IEEE VISUALIZATION IN DATA SCIENCE (VDS), 2019, :38-47
[42]   Meeting the Challenges of High-Dimensional Single-Cell Data Analysis in Immunology [J].
Patil, Subarea ;
Heuser, Christoph ;
de Almeida, Gustavo P. ;
Theis, Fabian J. ;
Zielinski, Christina E. .
FRONTIERS IN IMMUNOLOGY, 2019, 10
[43]   Local-Learning-Based Feature Selection for High-Dimensional Data Analysis [J].
Sun, Yijun ;
Todorovic, Sinisa ;
Goodison, Steve .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1610-1626
[44]   Deep Clustering and Visualization for End-to-End High-Dimensional Data Analysis [J].
Wu, Lirong ;
Yuan, Lifan ;
Zhao, Guojiang ;
Lin, Haitao ;
Li, Stan Z. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) :8543-8554
[45]   Diffusion maps for high-dimensional single-cell analysis of differentiation data [J].
Haghverdi, Laleh ;
Buettner, Florian ;
Theis, Fabian J. .
BIOINFORMATICS, 2015, 31 (18) :2989-2998
[46]   Buckley-James Boosting for Survival Analysis with High-Dimensional Biomarker Data [J].
Wang, Zhu ;
Wang, C. Y. .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2010, 9 (01)
[47]   Exploring high-dimensional biological data with sparse contrastive principal component analysis [J].
Boileau, Philippe ;
Hejazi, Nima S. ;
Dudoit, Sandrine .
BIOINFORMATICS, 2020, 36 (11) :3422-3430
[48]   Partial least squares: a versatile tool for the analysis of high-dimensional genomic data [J].
Boulesteix, Anne-Laure ;
Strimmer, Korbinian .
BRIEFINGS IN BIOINFORMATICS, 2007, 8 (01) :32-44
[49]   SpaceMAP: Visualizing High-dimensional Data by Space Expansion [J].
Zu, Xinrui ;
Tao, Qian .
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[50]   High-dimensional Density Estimation for Data Mining Tasks [J].
Kuleshov, Alexander ;
Bernstein, Alexander ;
Yanovich, Yury .
2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, :523-530