VICTOR: A visual analytics web application for comparing cluster sets

被引:8
|
作者
Karatzas, Evangelos [1 ]
Gkonta, Maria [1 ,2 ]
Hotova, Joana [1 ,2 ]
Baltoumas, Fotis A. [1 ]
Kontou, Panagiota, I [3 ]
Bobotsis, Christopher J. [4 ]
Bagos, Pantelis G. [3 ]
Pavlopoulos, Georgios A. [1 ]
机构
[1] Inst Fundamental Biomed Res BSRC Alexander Flemin, 34 Fleming St, Vari, Greece
[2] Univ Athens, Dept Biol, Athens, Greece
[3] Univ Thessaly, Dept Comp Sci & Biomed Informat, Lamia, Greece
[4] Univ Waterloo, Sch Comp Sci, Waterloo, ON, Canada
关键词
Cluster sets comparison; Interactive visualization; Cluster conductance; Counting pairs; Set overlaps; Mutual information; ALGORITHM;
D O I
10.1016/j.compbiomed.2021.104557
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering is the process of grouping different data objects based on similar properties. Clustering has applications in various case studies from several fields such as graph theory, image analysis, pattern recognition, statistics and others. Nowadays, there are numerous algorithms and tools able to generate clustering results. However, different algorithms or parameterizations may produce quite dissimilar cluster sets. In this way, the user is often forced to manually filter and compare these results in order to decide which of them generate the ideal clusters. To automate this process, in this study, we present VICTOR, the first fully interactive and dependency-free visual analytics web application which allows the visual comparison of the results of various clustering algorithms. VICTOR can handle multiple cluster set results simultaneously and compare them using ten different metrics. Clustering results can be filtered and compared to each other with the use of data tables or interactive heatmaps, bar plots, correlation networks, sankey and circos plots. We demonstrate VICTOR's functionality using three examples. In the first case, we compare five different network clustering algorithms on a Yeast protein-protein interaction dataset whereas in the second example, we test four different parameters of the MCL clustering algorithm on the same dataset. Finally, as a third example, we compare four different metaanalyses with hierarchically clustered differentially expressed genes found to be involved in myocardial infarction. VICTOR is available at http://victor.pavlopouloslab.info or http://bib.fleming.gr:3838/VICTOR.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Graph-based techniques for visual analytics of scientific data sets
    Wang C.
    Computing in Science and Engineering, 2018, 20 (01): : 93 - 103
  • [22] Graph-Based Techniques for Visual Analytics of Scientific Data Sets
    Wang, Chaoli
    COMPUTING IN SCIENCE & ENGINEERING, 2018, 20 (01) : 93 - 103
  • [23] An application of visual analytics methods to cluster and categorize data processing jobs in high energy and nuclear physics experiments
    Galkin T.
    Grigoryeva M.
    Klimentov A.
    Korchuganova T.
    Milman I.
    Pilyugin V.
    Titov M.
    2018, National Research Nuclear University (10): : 32 - 44
  • [24] AppMine: Behavioral Analytics for Web Application Vulnerability Detection
    Jana, Indranil
    Oprea, Alina
    CCSW'19: PROCEEDINGS OF THE 2019 ACM SIGSAC CONFERENCE ON CLOUD COMPUTING SECURITY WORKSHOP, 2019, : 69 - 80
  • [25] Mountaineer: Topology-Driven Visual Analytics for Comparing Local Explanations
    Solunke, Parikshit
    Guardieiro, Vitoria
    Rulff, Joao
    Xenopoulos, Peter
    Chan, Gromit Yeuk-Yin
    Barr, Brian
    Nonato, Luis Gustavo
    Silva, Claudio
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (12) : 7763 - 7775
  • [26] A Family of Two-Dimensional Benchmark Data Sets and Its Application to Comparing Different Cluster Validation Indices
    Santos, Jorge M.
    Embrechts, Mark
    PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 41 - +
  • [27] Discrete perfect sets and their application in cluster analysis
    Agayan S.M.
    Bogoutdinov S.R.
    Dobrovolsky M.N.
    Cybernetics and Systems Analysis, 1600, Springer Science and Business Media, LLC (50): : 176 - 190
  • [28] Visual Analytics Toolkit for Cluster-Based Classification of Mobility Data
    Andrienko, Gennady
    Andrienko, Natalia
    Rinzivillo, Salvatore
    Nanni, Mirco
    Pedreschi, Dino
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2009, 5644 : 432 - +
  • [29] Application of Visual Analytics to Ocean Science: Case Studies
    Matsuoka, Daisuke
    Araki, Fumiaki
    Yamashita, Yumi
    2014 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2014, : 340 - 341
  • [30] Interactive Visual Analytics Application for Spatiotemporal Movement Data
    Guan Yifei
    Seong, Kam Tin
    2017 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2017, : 195 - 196