Representation of conceptual ETL designs in natural language using Semantic Web technology

被引:23
|
作者
Simitsis, Alkis [1 ]
Skoutas, Dimitrios [2 ]
Castellanos, Malu [1 ]
机构
[1] IIML, Hewlett Packard Labs, Palo Alto, CA 94304 USA
[2] Natl Tech Univ Athens, Knowledge & Database Syst Lab, Athens, Greece
关键词
ETL; Data warehouses; Conceptual model; Natural language; Ontologies; Semantic Web; Metadata;
D O I
10.1016/j.datak.2009.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extract-Transform-Load (ETL) processes constitute the back stage of Data Warehouse architectures. Several studies characterize the ETL design as a time-consuming and error-prone procedure. A critical phase in the ETL lifecycle involves the early communications and design steps that aim at producing a conceptual ETL design. Various research approaches have dealt with the conceptual modeling of ETL processes, but all share two inconveniences: they require intensive human effort from the designers to create them, as well as technical knowledge from the business people to understand them. In this paper, we focus on the second aspect and provide a method for the representation of a conceptual ETL design as a narrative, which is the most natural means of communication and does not require particular technical skills or familiarity with any specific model. Specifically, this work builds upon previously proposed techniques that automate the conceptual design by leveraging Semantic Web technology. The key idea is to map the involved data stores, either source or target, to a domain ontology and then, to use a reasoner for producing the ETL design. We discuss how linguistic techniques can be used for the establishment of a common application vocabulary. We present a flexible and customizable template-based mechanism for the representation of the ETL design as a narrative. Finally, we discuss issues related to the production of meaningful reports and we provide implementation details. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:96 / 115
页数:20
相关论文
共 50 条
  • [1] A Controlled Natural Language for the Semantic Web
    Schwitter, Rolf
    JOURNAL OF INTELLIGENT SYSTEMS, 2008, 17 (1-3) : 125 - 141
  • [2] Sustainable building technology knowledge representation: Using Semantic Web techniques
    Tah, Joseph H. M.
    Abanda, Henry F.
    ADVANCED ENGINEERING INFORMATICS, 2011, 25 (03) : 547 - 558
  • [3] SWSNL: Semantic Web Search Using Natural Language
    Habernal, Ivan
    Konopik, Miloslav
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (09) : 3649 - 3664
  • [4] Analysis of natural language understanding technology based on Semantic Web ontology
    Wang, Yi
    Zhang, Jianming
    Cao, Zhenjie
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2015, 8 : 889 - 893
  • [5] Natural Language Processing Resources: Using Semantic Web Technologies
    Pohorec, Sandi
    Ceh, Ines
    Zorman, Milan
    Mernik, Marjan
    Kokol, Peter
    PROCEEDINGS OF THE ITI 2012 34TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES (ITI), 2012, : 397 - 402
  • [6] OOML(Object Oriented Semantic Web Markup Language) : Improved markup language for semantic web using object oriented technology
    Lee, K
    Yoo, M
    Chung, I
    Min, J
    Park, K
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIV, PROCEEDINGS: COMPUTER SCIENCE, ENGINEERING AND APPLICATIONS, 2003, : 259 - 264
  • [7] Language-Agnostic Knowledge Representation for a Truly Multilingual Semantic Web
    Jain, Sarika
    Kysliak, Anastasiia
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2022, 13 (01)
  • [8] The semantic web as a Linguistic resource:: Opportunities for natural language generation
    Mellish, Chris
    Sun, Xiantang
    KNOWLEDGE-BASED SYSTEMS, 2006, 19 (05) : 298 - 303
  • [9] Using a natural language understanding system to generate semantic web content
    Java, Akshay
    Nirneburg, Sergei
    McShane, Marjorie
    Finin, Timothy
    English, Jesse
    Joshi, Anupam
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2007, 3 (04) : 50 - 74
  • [10] Improved markup language for semantic web using object oriented technology
    Lee, KC
    Yoo, MW
    Chung, IJ
    Min, JH
    Park, K
    APCC 2003: 9TH ASIA-PACIFIC CONFERENCE ON COMMUNICATION, VOLS 1-3, PROCEEDINGS, 2003, : 330 - 334