Visualizing Multidimensional Data with Order Statistics

被引:4
作者
Raj, M. [1 ]
Whitaker, R. T. [1 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
基金
美国国家科学基金会;
关键词
CHARACTERIZING UNCERTAINTY; STRICTLY MONOTONE; BOXPLOTS; REGRESSION; ENSEMBLES; DEPTH;
D O I
10.1111/cgf.13419
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multidimensional data sets are common in many domains, and dimensionality reduction methods that determine a lower dimensional embedding are widely used for visualizing such data sets. This paper presents a novel method to project data onto a lower dimensional space by taking into account the order statistics of the individual data points, which are quantified by their depth or centrality in the overall set. Thus, in addition to conveying relative distances in the data, the proposed method also preserves the order statistics, which are often lost or misrepresented by existing visualization methods. The proposed method entails a modification of the optimization objective of conventional multidimensional scaling (MDS) by introducing a term that penalizes discrepancies between centrality structures in the original space and the embedding. We also introduce two strategies for visualizing lower dimensional embeddings of multidimensional data that takes advantage of the coherent representation of centrality provided by the proposed projection method. We demonstrate the effectiveness of our visualization with comparisons on different kinds of multidimensional data, including categorical and multimodal, from a variety of domains such as botany and health care.
引用
收藏
页码:277 / 287
页数:11
相关论文
共 38 条
  • [1] [Anonymous], 2010, Modern multidimensional scaling: theory and applications
  • [2] Baingana B., 2014, ARXIV14014408
  • [3] Kernel Functions for Categorical Variables with Application to Problems in the Life Sciences
    Belanche, Lluis A.
    Villegas, Marco A.
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE OF THE CATALAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE, 2013, 256 : 171 - 180
  • [5] Communicating centrality in policy network drawings
    Brandes, U
    Kenis, P
    Wagner, D
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2003, 9 (02) : 241 - 253
  • [6] Brandes U., 2009, International Symposium on Graph Drawing, P107
  • [7] STRUCTURE ACTIVITY RELATIONSHIP OF MUTAGENIC AROMATIC AND HETEROAROMATIC NITRO-COMPOUNDS - CORRELATION WITH MOLECULAR-ORBITAL ENERGIES AND HYDROPHOBICITY
    DEBNATH, AK
    DECOMPADRE, RLL
    DEBNATH, G
    SHUSTERMAN, AJ
    HANSCH, C
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (02) : 786 - 797
  • [8] Strictly monotone and smooth nonparametric regression for two or more variables
    Dette, Holger
    Scheder, Regine
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2006, 34 (04): : 535 - 561
  • [9] A simple nonparametric estimator of a strictly monotone regression function
    Dette, Holger
    Neumeyer, Natalie
    Pilz, Kay F.
    [J]. BERNOULLI, 2006, 12 (03) : 469 - 490
  • [10] Exact computation of the halfspace depth
    Dyckerhoff, Rainer
    Mozharovskyi, Pavlo
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 98 : 19 - 30