Examining data visualization pitfalls in scientific publications

被引:0
|
作者
Vinh T Nguyen
Kwanghee Jung
Vibhuti Gupta
机构
[1] TNU – University of Information and Communication Technology,Department of Information Technology
[2] Texas Tech University,Department of Educational Psychology, Leadership, and Counseling
[3] Meharry Medical College,Department of Computer Science and Data Science
来源
Visual Computing for Industry, Biomedicine, and Art | / 4卷
关键词
Data visualization; Graphical representations; Misinformation; Visual encodings; Association rule mining; Word cloud; Cochran’s Q test; McNemar’s test;
D O I
暂无
中图分类号
学科分类号
摘要
Data visualization blends art and science to convey stories from data via graphical representations. Considering different problems, applications, requirements, and design goals, it is challenging to combine these two components at their full force. While the art component involves creating visually appealing and easily interpreted graphics for users, the science component requires accurate representations of a large amount of input data. With a lack of the science component, visualization cannot serve its role of creating correct representations of the actual data, thus leading to wrong perception, interpretation, and decision. It might be even worse if incorrect visual representations were intentionally produced to deceive the viewers. To address common pitfalls in graphical representations, this paper focuses on identifying and understanding the root causes of misinformation in graphical representations. We reviewed the misleading data visualization examples in the scientific publications collected from indexing databases and then projected them onto the fundamental units of visual communication such as color, shape, size, and spatial orientation. Moreover, a text mining technique was applied to extract practical insights from common visualization pitfalls. Cochran’s Q test and McNemar’s test were conducted to examine if there is any difference in the proportions of common errors among color, shape, size, and spatial orientation. The findings showed that the pie chart is the most misused graphical representation, and size is the most critical issue. It was also observed that there were statistically significant differences in the proportion of errors among color, shape, size, and spatial orientation.
引用
收藏
相关论文
共 50 条
  • [31] Data visualization in environmental proteomics
    Mehlan, Henry
    Schmidt, Frank
    Weiss, Stefan
    Schueler, Julia
    Fuchs, Stephan
    Riedel, Katharina
    Bernhardt, Joerg
    PROTEOMICS, 2013, 13 (18-19) : 2805 - 2821
  • [32] Effective Data Visualization in Cybersecurity
    Adams, Chelsea Noelani
    Snider, Dallas H.
    IEEE SOUTHEASTCON 2018, 2018,
  • [33] Color Computing in Data Visualization
    Zeng Q.
    Wang Y.
    Tu C.
    Chen B.
    1600, Institute of Computing Technology (32): : 1549 - 1559
  • [34] A Data Provenance Visualization Approach
    Yazici, Ilkay Melek
    Karabulut, Erkan
    Aktas, Mehmet S.
    2018 14TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2018, : 84 - 91
  • [35] Aggregated Visualization of Playtesting Data
    Wallner, Gunter
    Halabi, Nour
    Mirza-Babaei, Pejman
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [36] THE SHISHUO XINYU AS DATA VISUALIZATION
    Chen, Jack W.
    Borovsky, Zoe
    Kawano, Yoh
    Chen, Ryan
    EARLY MEDIEVAL CHINA, 2014, (20) : 23 - 59
  • [37] Visualization of univariate data for comparison
    Farago, Csaba
    ANNALES MATHEMATICAE ET INFORMATICAE, 2015, 45 : 39 - 53
  • [38] Data visualization for asymmetric relations
    Choi, Heeyoul
    NEUROCOMPUTING, 2014, 124 : 97 - 104
  • [39] Data Visualization for Digital Twins
    Comba, Joao L. D.
    Santos, Nicolau O.
    Rivera, Jonathan C.
    Romeu, Regis K.
    Abel, Mara
    COMPUTING IN SCIENCE & ENGINEERING, 2023, 25 (02) : 58 - 63
  • [40] Data visualization: A unique storyteller
    Dong, Xiaoxu
    TECHNOETIC ARTS, 2019, 17 (03) : 259 - 279