Integrated Access to Big Data Polystores through a Knowledge-driven Framework

被引:0
|
作者
McHugh, Justin [1 ]
Cuddihy, Paul E. [1 ]
Williams, Jenny Weisenberg [1 ]
Aggour, Kareem S. [1 ]
Kumar, Vijay S. [1 ]
Mulwad, Varish [1 ]
机构
[1] GE Global Res, AI & Machine Learning Knowledge Serv & Big Data, Niskayuna, NY 12309 USA
关键词
semantic modeling; knowledge representation; big data; data integration; query processing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent successes of commercial cognitive and AI applications have cast a spotlight on knowledge graphs and the benefits of consuming structured semantic data. Today, knowledge graphs are ubiquitous to the extent that organizations often view them as a "single source of truth" for all of their data and other digital artifacts. In most organizations, however, Big Data comes in many different forms including time series, images, and unstructured text, which often are not suitable for efficient storage within a knowledge graph. This paper presents the Semantics Toolkit (SemTK), a framework that enables access to polyglotpersistent Big Data stores while giving the appearance that all data is fully captured within a knowledge graph. SemTK allows data to be stored across multiple storage platforms (e.g., Big Data stores such as Hadoop, graph databases, and semantic triple stores) - with the best-suited platform adopted for each data type - while maintaining a single logical interface and point of access, thereby giving users a knowledge-driven veneer across their data. We describe the ease of use and benefits of constructing and querying polystore knowledge graphs with SemTK via four industrial use cases at GE.
引用
收藏
页码:1494 / 1503
页数:10
相关论文
共 50 条
  • [41] Knowledge-Driven Interpretation of Multi-View Data in Medicine
    Pillai, Parvathy Sudhir
    Feng, Lei
    Leong, Tze-Yun
    BUILDING CONTINENTS OF KNOWLEDGE IN OCEANS OF DATA: THE FUTURE OF CO-CREATED EHEALTH, 2018, 247 : 745 - 749
  • [42] Data and knowledge-driven named entity recognition for cyber security
    Chen Gao
    Xuan Zhang
    Hui Liu
    Cybersecurity, 4
  • [43] An integrated approach to knowledge-driven structure-based virtual screening
    Angela M. Henzler
    Sascha Urbaczek
    Matthias Hilbig
    Matthias Rarey
    Journal of Computer-Aided Molecular Design, 2014, 28 : 927 - 939
  • [44] Data and knowledge-driven named entity recognition for cyber security
    Gao, Chen
    Zhang, Xuan
    Liu, Hui
    CYBERSECURITY, 2021, 4 (01)
  • [45] A Data-Driven and Knowledge-Driven Method towards the IRP of Modern Logistics
    Wang, Tiexin
    Wu, Yi
    Lamothe, Jacques
    Benaben, Frederick
    Wang, Ruofan
    Liu, Wenjing
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [46] Framework of Integrated Big Data: A Review
    Chen, Zhikui
    Zhong, Fangming
    Yuan, Xu
    Hu, Yueming
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2016, : 158 - 162
  • [47] KNOWLEDGE-DRIVEN ANALYSIS AND DATA INTEGRATION FOR HIGH-THROUGHPUT BIOLOGICAL DATA
    Ochs, M. F.
    Quackenbush, J.
    Davuluri, R.
    Ressom, H.
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2009, 2009, : 353 - +
  • [48] A Knowledge-Driven Framework for a Decision Support Platform in Sustainable Viticulture: Integrating Climate Data and Supporting Stakeholder Collaboration
    Simeunovic, Marko
    Ratkovic, Kruna
    Kovac, Natasa
    Rackovic, Tamara
    Fernandes, Antonio
    SUSTAINABILITY, 2025, 17 (04)
  • [49] A Service-Oriented Framework for Big Data-Driven Knowledge Management Systems
    Thang Le Dinh
    Thuong-Cang Phan
    Trung Bui
    Manh Chien Vu
    EXPLORING SERVICES SCIENCE (IESS 2016), 2016, 247 : 509 - 521
  • [50] A Knowledge-Driven Network-Based Analytical Framework for the Identification of Rumen Metabolites
    Wang, Mengyuan
    Wang, Haiying
    Zheng, Huiru
    Dewhurst, Richard
    Roehe, Rainer
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2020, 19 (03) : 518 - 526