New Approaches in Visualization of Categorical Data: R Package extracat

被引:0
|
作者
Pilhoefer, Alexander [1 ]
Unwin, Antony [1 ]
机构
[1] Univ Augsburg, Inst Math, Dept Comp Oriented Stat & Data Anal, D-86135 Augsburg, Germany
来源
JOURNAL OF STATISTICAL SOFTWARE | 2013年 / 53卷 / 07期
关键词
categorical data; multiple barcharts; parallel coordinates; R; DISPLAYS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The R package extracat provides two new graphical methods for displaying categorical data extending the concepts of multiple barcharts and parallel coordinates plots. The first method called rmb plot uses a crossover of mosaicplots and multiple barcharts to display the frequencies of a data table split up into conditional relative frequencies of one target variable and the absolute frequencies of the corresponding combinations of the remaining explanatory variables. It provides a well-structured representation of the data which is easy to interpret and allows precise comparisons. The graphic can additionally be used as a generalization of spineplots or with barcharts for the conditional relative frequencies. Several options, including ceiling censored zooming, residual shadings and a choice of color palettes, are provided. An interactive version based on the R package iWidgets is also presented. The second graphic cpcp uses the interactive parallel coordinates plots in the iplots package to visualize categorical data. Sequences of points are used to represent each of the variable categories, while ordering algorithms are applied to represent a hierarchical structure in the data and keep the arrangement clear. This interactive graphic is well-suited for exploratory analysis and allows a visual interpretation even for a higher number of variables and a mixture of categorical and numeric scales.
引用
收藏
页码:1 / 25
页数:25
相关论文
共 50 条
  • [21] Processing Ecological Data in R with the mefa Package
    Solymos, Peter
    JOURNAL OF STATISTICAL SOFTWARE, 2009, 29 (08): : 1 - 28
  • [22] ggenealogy: An R Package for Visualizing Genealogical Data
    Rutter, Lindsay
    VanderPlas, Susan
    Cook, Dianne
    Graham, Michelle A.
    JOURNAL OF STATISTICAL SOFTWARE, 2019, 89 (13): : 1 - 31
  • [23] Anthropometry: An R Package for Analysis of Anthropometric Data
    Vinue, Guillermo
    JOURNAL OF STATISTICAL SOFTWARE, 2017, 77 (06): : 1 - 39
  • [24] A New Approach for Calculating Similarity of Categorical Data
    Jin, Cheng Hao
    Li, Xun
    Lee, Yang Koo
    Pok, Gouchol
    Ryu, Keun Ho
    CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, 2011, 206 : 584 - +
  • [25] Quantitative Evaluation of Big Data Categorical Variables through R
    Pandey, Rajiv
    Dhoundiyal, Manoj
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 582 - 588
  • [26] stab: An R package for drug stability data analysis
    Lee, Hsin-ya
    Wu, Pao- chu
    Lee, Yung-jin
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2010, 100 (02) : 140 - 148
  • [27] Simulation of Synthetic Complex Data: The R Package simPop
    Templ, Matthias
    Meindl, Bernhard
    Kowarik, Alexander
    Dupriez, Olivier
    JOURNAL OF STATISTICAL SOFTWARE, 2017, 79 (10): : 1 - 38
  • [28] ggroups: an R package for pedigree and genetic groups data
    Nilforooshan, Mohammad Ali
    Saavedra-Jimenez, Luis Antonio
    HEREDITAS, 2020, 157 (01)
  • [29] Analyzing Intraday Financial Data in R: The highfrequency Package
    Boudt, Kris
    Kleen, Onno
    Sjorup, Emil
    JOURNAL OF STATISTICAL SOFTWARE, 2022, 104 (08): : 1 - 36
  • [30] ggroups: an R package for pedigree and genetic groups data
    Mohammad Ali Nilforooshan
    Luis Antonio Saavedra-Jiménez
    Hereditas, 157