Visualizing large knowledge graphs: A performance analysis

被引:22
作者
Gomez-Romero, Juan [1 ,2 ]
Molina-Solana, Miguel [2 ]
Oehmichen, Axel [2 ]
Guo, Yike [2 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, C Periodista Daniel Saucedo Aranda S-N, E-18071 Granada, Spain
[2] Imperial Coll London, Data Sci Inst, London, England
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 89卷
关键词
Graphs; Visualization; Big data; Linked data; Performance analysis; LARGE-SCALE; LINKED DATA; EXPLORATION; ONTOLOGIES;
D O I
10.1016/j.future.2018.06.015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Knowledge graphs are an increasingly important source of data and context information in Data Science. A first step in data analysis is data exploration, in which visualization plays a key role. Currently, Semantic Web technologies are prevalent for modeling and querying knowledge graphs; however, most visualization approaches in this area tend to be overly simplified and targeted to small-sized representations. In this work, we describe and evaluate the performance of a Big Data architecture applied to large-scale knowledge graph visualization. To do so, we have implemented a graph processing pipeline in the Apache Spark framework and carried out several experiments with real-world and synthetic graphs. We show that distributed implementations of the graph building, metric calculation and layout stages can efficiently manage very large graphs, even without applying partitioning or incremental processing strategies. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:224 / 238
页数:15
相关论文
共 77 条
  • [1] Statistical mechanics of complex networks
    Albert, R
    Barabási, AL
    [J]. REVIEWS OF MODERN PHYSICS, 2002, 74 (01) : 47 - 97
  • [2] [Anonymous], 2002, Information visualization in data mining and knowledge discovery
  • [3] Arleo Alessio, 2018, Graph Drawing and Network Visualization. 25th International Symposium, GD 2017. Revised Selected Papers: LNCS 10692, P256, DOI 10.1007/978-3-319-73915-1_21
  • [4] Large graph visualizations using a distributed computing platform
    Arleo, Alessio
    Didimo, Walter
    Liotta, Giuseppe
    Montecchiani, Fabrizio
    [J]. INFORMATION SCIENCES, 2017, 381 : 124 - 141
  • [5] Atemezing G. A., 2014, P C CONS LINK DAT CO, V1264, P1
  • [6] Baader F., 2010, DESCRIPTION LOGIC HD, V2nd
  • [7] Visualizing Populated Ontologies with OntoTrix
    Bach, Benjamin
    Pietriga, Emmanuel
    Liccardi, Ilaria
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2013, 9 (04) : 17 - 40
  • [8] Work efficient parallel algorithms for large graph exploration on emerging heterogeneous architectures
    Banerjee, Dip Sankar
    Kumar, Ashutosh
    Chaitanya, Meher
    Sharma, Shashank
    Kothapalli, Kishore
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 76 : 81 - 93
  • [9] Bastian M., 2009, 3 INT AAAI C WEBLOGS, DOI [10.13140/2.1.1341.1520, DOI 10.1609/ICWSM.V3I1.13937]
  • [10] Bikakis N., 2016, P EDBT ICDT 2016 JT