Integrated Access to Big Data Polystores through a Knowledge-driven Framework

被引:0
|
作者
McHugh, Justin [1 ]
Cuddihy, Paul E. [1 ]
Williams, Jenny Weisenberg [1 ]
Aggour, Kareem S. [1 ]
Kumar, Vijay S. [1 ]
Mulwad, Varish [1 ]
机构
[1] GE Global Res, AI & Machine Learning Knowledge Serv & Big Data, Niskayuna, NY 12309 USA
关键词
semantic modeling; knowledge representation; big data; data integration; query processing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent successes of commercial cognitive and AI applications have cast a spotlight on knowledge graphs and the benefits of consuming structured semantic data. Today, knowledge graphs are ubiquitous to the extent that organizations often view them as a "single source of truth" for all of their data and other digital artifacts. In most organizations, however, Big Data comes in many different forms including time series, images, and unstructured text, which often are not suitable for efficient storage within a knowledge graph. This paper presents the Semantics Toolkit (SemTK), a framework that enables access to polyglotpersistent Big Data stores while giving the appearance that all data is fully captured within a knowledge graph. SemTK allows data to be stored across multiple storage platforms (e.g., Big Data stores such as Hadoop, graph databases, and semantic triple stores) - with the best-suited platform adopted for each data type - while maintaining a single logical interface and point of access, thereby giving users a knowledge-driven veneer across their data. We describe the ease of use and benefits of constructing and querying polystore knowledge graphs with SemTK via four industrial use cases at GE.
引用
收藏
页码:1494 / 1503
页数:10
相关论文
共 50 条
  • [21] A Framework for Knowledge-Driven Innovation in Small and Medium Enterprises
    Lee, Jia En
    Rosdi, Intan Soraya
    Wai, Chew Kok
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INNOVATION AND ENTREPRENEURSHIP (ICIE 2017), 2017, : 69 - 77
  • [22] Structured reviews for data and knowledge-driven research
    Queralt-Rosinach, Nuria
    Stupp, Gregory S.
    Li, Tong Shu
    Mayers, Michael
    Hoatlin, Maureen E.
    Might, Matthew
    Good, Benjamin M.
    Su, Andrew I.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2020,
  • [23] Data-driven tools for the optimization of a pharmaceutical process through its knowledge-driven model
    Castaldello, Christopher
    Facco, Pierantonio
    Bezzo, Fabrizio
    Georgakis, Christos
    Barolo, Massimiliano
    AICHE JOURNAL, 2023, 69 (04)
  • [24] Integrating knowledge-driven and data-driven approaches to modeling
    Todorovski, L
    Dzeroski, S
    ECOLOGICAL MODELLING, 2006, 194 (1-3) : 3 - 13
  • [25] A General Paradigm of Knowledge-driven and Data-driven Fusion
    Hu, Fei
    Zhong, Wei
    Ye, Long
    Duan, Danting
    Zhang, Qin
    2023 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE, ICACI, 2023,
  • [26] Knowledge-driven Analysis Framework of Anomaly Propagation in Manufacturing Workshop
    Wang Shengbo
    Guo Yu
    2023 INTERNATIONAL CONFERENCE ON ADVANCED ENTERPRISE INFORMATION SYSTEM, AEIS 2023, 2023, : 58 - 62
  • [27] A Knowledge-driven Data Warehouse Model for Analysis Evolution
    Favre, Cecile
    Bentayeb, Fadila
    Boussaid, Omar
    LEADING THE WEB IN CONCURRENT ENGINEERING: NEXT GENERATION CONCURRENT ENGINEERING, 2006, 143 : 271 - +
  • [28] A Knowledge-Driven Anomaly Detection Framework for Social Production System
    Li, Zheng
    Xu, Xiaolong
    Hang, Tian
    Xiang, Haolong
    Cui, Yan
    Qi, Lianyong
    Zhou, Xiaokang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (03) : 3179 - 3192
  • [29] A knowledge-driven approach for designing data analytics platforms
    Bandara, Madhushi
    Rabhi, Fethi A.
    Bano, Muneera
    REQUIREMENTS ENGINEERING, 2023, 28 (02) : 195 - 212
  • [30] Semantic Water Data Translation: A Knowledge-driven Approach
    Shu, Yanfeng
    Ratcliffe, David
    Taylor, Kerry
    Wu, Jemma
    Ackland, Ross
    Terhorst, Andrew
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 52 - 60