Effective Tooling for Linked Data Publishing in Scientific Research

被引:6
|
作者
Purohit, Sumit [1 ]
Smith, William [1 ]
Chappell, Alan [1 ]
Stephan, Eric [1 ]
West, Patrick [2 ]
Lee, Benno [2 ]
Fox, Peter [2 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99354 USA
[2] Rensselaer Polytech Inst, Tetherless World Constellat, Troy, NY 12180 USA
来源
2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC) | 2016年
关键词
Linked Data Publishing; Semantic Data Curation; Data Publishing Tools; Data Discovery; BENCHMARK; ACCESS; SYSTEM;
D O I
10.1109/ICSC.2016.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Challenges that make it difficult to find, share, and combine published data, such as data heterogeneity and resource discovery, have led to increased adoption of semantic data standards and data publishing technologies. To make data more accessible, interconnected and discoverable, some domains are being encouraged to publish their data as Linked Data. Consequently, this trend greatly increases the amount of data that semantic web tools are required to process, store, and interconnect. In attempting to process and manipulate large data sets, tools-ranging from simple text editors to modern triplestores-eventually breakdown upon reaching undefined thresholds. This paper shares our experiences in curating metadata, primarily to illustrate the challenges, and resulting limitations that data publishers and consumers have in the current technological environment. This paper also provides a Linked Data based solution to the research problem of resource discovery, and offers a systematic approach that the data publishers can take to select suitable tools to meet their data publishing needs. We present a real-world use case, the Resource Discovery for Extreme Scale Collaboration (RDESC), which features a scientific dataset(maximum size of 1.4 billion triples) used to evaluate a toolbox for data publishing in climate research. This paper also introduces a semantic data publishing software suite developed for the RDESC project.
引用
收藏
页码:24 / 31
页数:8
相关论文
共 50 条
  • [31] OPPORTUNITIES FOR HISTORICAL AND PSYCHOLOGICAL RESEARCH OF SCIENTIFIC TRADITIONS
    Artem'eva, O. A.
    PSIKHOLOGICHESKII ZHURNAL, 2025, 46 (01) : 108 - 112
  • [32] Data-driven method for mobile game publishing revenue forecast
    Yanhui Su
    Per Backlund
    Henrik Engström
    Service Oriented Computing and Applications, 2022, 16 : 67 - 76
  • [33] Accelerating imaging research at large-scale scientific facilities through scientific computing
    Wang, Chunpeng
    Li, Xiaoyun
    Wan, Rongzheng
    Chen, Jige
    Ye, Jing
    Li, Ke
    Li, Aiguo
    Tai, Renzhong
    Sepe, Alessandro
    JOURNAL OF SYNCHROTRON RADIATION, 2024, 31 : 1317 - 1326
  • [34] Data-driven method for mobile game publishing revenue forecast
    Su, Yanhui
    Backlund, Per
    Engstrom, Henrik
    SERVICE ORIENTED COMPUTING AND APPLICATIONS, 2022, 16 (01) : 67 - 76
  • [35] A composable data management architecture for scientific applications
    Ma, Y
    Bramley, R
    CLADE 2005: CHALLENGES OF LARGE APPLICATIONS IN DISTRIBUTED ENVIRONMENTS, PROCEEDINGS, 2005, : 35 - 44
  • [36] Scientific Data: Increasing Transparency and Reducing the Grey
    Carroll, Bonnie C.
    Crowe, June
    Candlish, J. R.
    TRANSPARENCY IN GREY LITERATURE: GREY TECH APPROACHES TO HIGH TECH ISSUES, 2011, 12 : 83 - 89
  • [37] A Performance Evaluation of Hive for Scientific Data Management
    Liu, Taoying
    Liu, Jing
    Liu, Hong
    Li, Wei
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [38] A Conceptual Modelling Approach to Visualising Linked Data
    McBrien, Peter
    Poulovassilis, Alexandra
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2019 CONFERENCES, 2019, 11877 : 227 - 245
  • [39] Research on the Evaluation of Scientific Research Performance of Local Universities Based on Balanced Scorecard
    Yang De-shan
    Xu Ai-zhen
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON PUBLIC ADMINISTRATION (5TH), VOL III, 2009, : 579 - 585
  • [40] Where Big Data meets Linked Data: Applying standard data models to environmental data streams
    Leadbetter, Adam
    Smyth, Damian
    Fuller, Robert
    O'Grady, Eoin
    Shepherd, Adam
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2929 - 2937