Effective Tooling for Linked Data Publishing in Scientific Research

被引:6
|
作者
Purohit, Sumit [1 ]
Smith, William [1 ]
Chappell, Alan [1 ]
Stephan, Eric [1 ]
West, Patrick [2 ]
Lee, Benno [2 ]
Fox, Peter [2 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99354 USA
[2] Rensselaer Polytech Inst, Tetherless World Constellat, Troy, NY 12180 USA
来源
2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC) | 2016年
关键词
Linked Data Publishing; Semantic Data Curation; Data Publishing Tools; Data Discovery; BENCHMARK; ACCESS; SYSTEM;
D O I
10.1109/ICSC.2016.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Challenges that make it difficult to find, share, and combine published data, such as data heterogeneity and resource discovery, have led to increased adoption of semantic data standards and data publishing technologies. To make data more accessible, interconnected and discoverable, some domains are being encouraged to publish their data as Linked Data. Consequently, this trend greatly increases the amount of data that semantic web tools are required to process, store, and interconnect. In attempting to process and manipulate large data sets, tools-ranging from simple text editors to modern triplestores-eventually breakdown upon reaching undefined thresholds. This paper shares our experiences in curating metadata, primarily to illustrate the challenges, and resulting limitations that data publishers and consumers have in the current technological environment. This paper also provides a Linked Data based solution to the research problem of resource discovery, and offers a systematic approach that the data publishers can take to select suitable tools to meet their data publishing needs. We present a real-world use case, the Resource Discovery for Extreme Scale Collaboration (RDESC), which features a scientific dataset(maximum size of 1.4 billion triples) used to evaluate a toolbox for data publishing in climate research. This paper also introduces a semantic data publishing software suite developed for the RDESC project.
引用
收藏
页码:24 / 31
页数:8
相关论文
共 50 条
  • [1] Modeling, Generating, and Publishing Knowledge as Linked Data
    Dimou, Anastasia
    Heyvaert, Pieter
    Taelman, Ruben
    Verborgh, Ruben
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, 2017, 10180 : 3 - 14
  • [2] Logical Foundations of Privacy-Preserving Publishing of Linked Data
    Grau, Bernardo Cuenca
    Kostylev, Egor V.
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 943 - 949
  • [3] Publishing Eurac Research data on the GEOSS Platform
    Roncella, Roberto
    Ventura, Bartolomeo
    Vianello, Andrea
    Boldrini, Enrico
    Santoro, Mattia
    Mazzetti, Paolo
    Nativi, Stefano
    BIG EARTH DATA, 2023, 7 (02) : 428 - 450
  • [4] The strain on scientific publishing
    Hanson, Mark A.
    Barreiro, Pablo Gomez
    Crosetto, Paolo
    Brockington, Dan
    QUANTITATIVE SCIENCE STUDIES, 2024, 5 (04): : 823 - 843
  • [5] Ten guidelines for effective data visualization in scientific publications
    Kelleher, Christa
    Wagener, Thorsten
    ENVIRONMENTAL MODELLING & SOFTWARE, 2011, 26 (06) : 822 - 827
  • [6] Challenges as enablers for high quality Linked Data: insights from the Semantic Publishing Challenge
    Dimou, Anastasia
    Vahdati, Sahar
    Di Iorio, Angelo
    Lange, Christoph
    Verborgh, Ruben
    Mannens, Erik
    PEERJ COMPUTER SCIENCE, 2017,
  • [7] Analysis of Scientific Production of IoE Big Data Research
    Kaur, Jaswinder
    Wongthongtham, Pornpit
    Abu-Salih, Bilal
    Fathy, Sogand
    2018 32ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2018, : 715 - 720
  • [8] Semantic Finlex: Transforming, Publishing, and Using Finnish Legislation and Case Law As Linked Open Data on the Web
    Oksanen, Arttu
    Tamper, Minna
    Tuominen, Jouni
    Makela, Eetu
    Hietanen, Aki
    Hyvonen, Eero
    KNOWLEDGE OF THE LAW IN THE BIG DATA AGE, 2019, 317 : 212 - 228
  • [9] Publishing publicly available interview data: an empirical example of the experience of publishing interview data
    Enriquez, Diana
    FRONTIERS IN SOCIOLOGY, 2024, 9
  • [10] Linked Data in Education: A Survey and a Synthesis of Actual Research and Future Challenges
    Pereira, Crystiam Kelle
    Matsui Siqueira, Sean Wolfgand
    Nunes, Bernardo Pereira
    Dietze, Stefan
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2018, 11 (03): : 400 - 412