Integrated Dual Analysis of Quantitative and Qualitative High-Dimensional Data

被引:4
|
作者
Muller, Juliane [1 ]
Garrison, Laura [2 ,3 ]
Ulbrich, Philipp [1 ,4 ]
Schreiber, Stefanie [1 ,2 ,3 ,4 ]
Bruckner, Stefan [2 ]
Hauser, Helwig [2 ,3 ]
Oeltze-Jafra, Steffen [1 ,4 ]
机构
[1] Otto von Guericke Univ, Dept Neurol, D-39106 Magdeburg, Germany
[2] Univ Bergen, Haukeland Univ Hosp, Dept Informat, N-5021 Bergen, Norway
[3] Univ Bergen, Dept Radiol, Haukeland Univ Hosp, Mohn Med Imaging & Visualizat Ctr, N-5021 Bergen, Norway
[4] Otto von Guericke Univ, Ctr Behav Brain Sci, D-39106 Magdeburg, Germany
关键词
Correlation; Visualization; Data visualization; Tools; Standards; Neurology; Cultural differences; Dual analysis approach; high-dimensional data; mixed data; mixed statistical analysis; INTERACTIVE VISUAL ANALYSIS; VISUALIZATION; DIVERSITY; COHORT; SYSTEM; MODEL;
D O I
10.1109/TVCG.2021.3056424
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Dual Analysis framework is a powerful enabling technology for the exploration of high dimensional quantitative data by treating data dimensions as first-class objects that can be explored in tandem with data values. In this article, we extend the Dual Analysis framework through the joint treatment of quantitative (numerical) and qualitative (categorical) dimensions. Computing common measures for all dimensions allows us to visualize both quantitative and qualitative dimensions in the same view. This enables a natural joint treatment of mixed data during interactive visual exploration and analysis. Several measures of variation for nominal qualitative data can also be applied to ordinal qualitative and quantitative data. For example, instead of measuring variability from a mean or median, other measures assess inter-data variation or average variation from a mode. In this work, we demonstrate how these measures can be integrated into the Dual Analysis framework to explore and generate hypotheses about high-dimensional mixed data. A medical case study using clinical routine data of patients suffering from Cerebral Small Vessel Disease (CSVD), conducted with a senior neurologist and a medical student, shows that a joint Dual Analysis approach for quantitative and qualitative data can rapidly lead to new insights based on which new hypotheses may be generated.
引用
收藏
页码:2953 / 2966
页数:14
相关论文
共 50 条
  • [41] An Examination of Grouping and Spatial Organization Tasks for High-Dimensional Data Exploration
    Wenskovitch, John
    North, Chris
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1742 - 1752
  • [42] Dynamic visualization of statistical learning in the context of high-dimensional textual data
    Greenacre, Michael
    Hastie, Trevor
    JOURNAL OF WEB SEMANTICS, 2010, 8 (2-3): : 163 - 168
  • [43] Exploring high-dimensional biological data with sparse contrastive principal component analysis
    Boileau, Philippe
    Hejazi, Nima S.
    Dudoit, Sandrine
    BIOINFORMATICS, 2020, 36 (11) : 3422 - 3430
  • [44] TOWARD AGILE: AN INTEGRATED ANALYSIS OF QUANTITATIVE AND QUALITATIVE FIELD DATA ON SOFTWARE DEVELOPMENT AGILITY
    Lee, Gwanhoo
    Xia, Weidong
    MIS QUARTERLY, 2010, 34 (01) : 87 - 114
  • [45] Meeting the Challenges of High-Dimensional Single-Cell Data Analysis in Immunology
    Patil, Subarea
    Heuser, Christoph
    de Almeida, Gustavo P.
    Theis, Fabian J.
    Zielinski, Christina E.
    FRONTIERS IN IMMUNOLOGY, 2019, 10
  • [46] Diffusion maps for high-dimensional single-cell analysis of differentiation data
    Haghverdi, Laleh
    Buettner, Florian
    Theis, Fabian J.
    BIOINFORMATICS, 2015, 31 (18) : 2989 - 2998
  • [47] Focused multidimensional scaling: interactive visualization for exploration of high-dimensional data
    Lea M. Urpa
    Simon Anders
    BMC Bioinformatics, 20
  • [48] Network-based Clustering and Embedding for High-Dimensional Data Visualization
    Zhang, Hengyuan
    Chen, Xiaowu
    2013 INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2013, : 290 - 297
  • [49] Focused multidimensional scaling: interactive visualization for exploration of high-dimensional data
    Urpa, Lea M.
    Anders, Simon
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [50] High-dimensional covariance forecasting based on principal component analysis of high-frequency data
    Jian, Zhihong
    Deng, Pingjun
    Zhu, Zhican
    ECONOMIC MODELLING, 2018, 75 : 422 - 431