Representation of conceptual ETL designs in natural language using Semantic Web technology

被引:23
|
作者
Simitsis, Alkis [1 ]
Skoutas, Dimitrios [2 ]
Castellanos, Malu [1 ]
机构
[1] IIML, Hewlett Packard Labs, Palo Alto, CA 94304 USA
[2] Natl Tech Univ Athens, Knowledge & Database Syst Lab, Athens, Greece
关键词
ETL; Data warehouses; Conceptual model; Natural language; Ontologies; Semantic Web; Metadata;
D O I
10.1016/j.datak.2009.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extract-Transform-Load (ETL) processes constitute the back stage of Data Warehouse architectures. Several studies characterize the ETL design as a time-consuming and error-prone procedure. A critical phase in the ETL lifecycle involves the early communications and design steps that aim at producing a conceptual ETL design. Various research approaches have dealt with the conceptual modeling of ETL processes, but all share two inconveniences: they require intensive human effort from the designers to create them, as well as technical knowledge from the business people to understand them. In this paper, we focus on the second aspect and provide a method for the representation of a conceptual ETL design as a narrative, which is the most natural means of communication and does not require particular technical skills or familiarity with any specific model. Specifically, this work builds upon previously proposed techniques that automate the conceptual design by leveraging Semantic Web technology. The key idea is to map the involved data stores, either source or target, to a domain ontology and then, to use a reasoner for producing the ETL design. We discuss how linguistic techniques can be used for the establishment of a common application vocabulary. We present a flexible and customizable template-based mechanism for the representation of the ETL design as a narrative. Finally, we discuss issues related to the production of meaningful reports and we provide implementation details. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:96 / 115
页数:20
相关论文
共 50 条
  • [31] A reusable commitment management service using Semantic Web technology
    Preece, Alun
    Chalmers, Stuart
    McKenzie, Craig
    KNOWLEDGE-BASED SYSTEMS, 2007, 20 (02) : 143 - 151
  • [32] An Exploration of Explainable Machine Learning Using Semantic Web Technology
    Procko, Tyler
    Elvira, Timothy
    Ochoa, Omar
    Del Rio, Nicholas
    16TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2022), 2022, : 143 - 146
  • [33] Building the search pattern of web users using conceptual semantic space model
    Xu, Zheng
    Wei, Xiao
    Liu, Yunhuai
    Mei, Lin
    Hu, Chuanping
    Choo, Kim-Kwang Raymond
    Zhu, Yiwei
    Sugumaran, Vijayan
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2016, 12 (03) : 328 - 347
  • [34] Filling the gap between Web 2.0 technologies and natural language processing pipelines with semantic web the semtoolbox approach
    Durville, Priscille
    Gandon, Fabien
    2009 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, 2009, : 109 - +
  • [36] Context awareness using semantic web technology in the ubiquitous learning service
    Kim, Tai-Jong
    Kim, Min-Cheol
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 1, PROCEEDINGS, 2008, 5072 : 501 - 515
  • [37] Improving life sciences information retrieval using semantic web technology
    Quan, Dennis
    BRIEFINGS IN BIOINFORMATICS, 2007, 8 (03) : 172 - 182
  • [38] Route description using natural language generation technology
    Zhang, XueYing
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 454 - 459
  • [39] Accommodating Negation in an Efficient Event-based Natural Language Query Interface to the Semantic Web
    Peelar, Shane
    Frost, Richard A.
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST), 2020, : 83 - 92
  • [40] Using Semantic Web and service oriented technologies to build loosely coupled systems - SWOAT - A Service and Semantic Web Oriented Architecture Technology
    Caires, Bruno
    Cardoso, Jorge
    ICEIS 2007: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2007, : 226 - +