Representation of conceptual ETL designs in natural language using Semantic Web technology

被引:23
|
作者
Simitsis, Alkis [1 ]
Skoutas, Dimitrios [2 ]
Castellanos, Malu [1 ]
机构
[1] IIML, Hewlett Packard Labs, Palo Alto, CA 94304 USA
[2] Natl Tech Univ Athens, Knowledge & Database Syst Lab, Athens, Greece
关键词
ETL; Data warehouses; Conceptual model; Natural language; Ontologies; Semantic Web; Metadata;
D O I
10.1016/j.datak.2009.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extract-Transform-Load (ETL) processes constitute the back stage of Data Warehouse architectures. Several studies characterize the ETL design as a time-consuming and error-prone procedure. A critical phase in the ETL lifecycle involves the early communications and design steps that aim at producing a conceptual ETL design. Various research approaches have dealt with the conceptual modeling of ETL processes, but all share two inconveniences: they require intensive human effort from the designers to create them, as well as technical knowledge from the business people to understand them. In this paper, we focus on the second aspect and provide a method for the representation of a conceptual ETL design as a narrative, which is the most natural means of communication and does not require particular technical skills or familiarity with any specific model. Specifically, this work builds upon previously proposed techniques that automate the conceptual design by leveraging Semantic Web technology. The key idea is to map the involved data stores, either source or target, to a domain ontology and then, to use a reasoner for producing the ETL design. We discuss how linguistic techniques can be used for the establishment of a common application vocabulary. We present a flexible and customizable template-based mechanism for the representation of the ETL design as a narrative. Finally, we discuss issues related to the production of meaningful reports and we provide implementation details. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:96 / 115
页数:20
相关论文
共 50 条
  • [21] Representation of product model in collaborative design based on semantic web technology
    Feng, Xiangzhong
    JOURNAL OF VACUUM SCIENCE & TECHNOLOGY B, 2009, 27 (03): : 1543 - 1547
  • [22] Data Sharing of Mechanical Design Formulas Using Semantic Web Technology
    Zhou, Jun
    Watanuki, Keiichi
    JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2007, 1 (04): : 530 - 540
  • [23] Meta lingua: A Language to Mediate Communication with Semantic Web in Natural Languages
    Drugus, Ioachim
    ADVANCED INFORMATION TECHNOLOGY IN EDUCATION, 2012, 126 : 109 - 115
  • [24] An Application of the Semantic Web Inspired by Human Learning and Natural Language Processing
    Spiegel, R.
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2006, 1 (03):
  • [25] Using semantic web technology for e-learning
    Brussee, R
    Alberink, M
    Veenstra, M
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: CONCEPTS AND APPLICATIONS OF SYSTEMICS, CYBERNETICS AND INFORMATICS I, 2002, : 448 - 451
  • [26] Aggregation of bioinformatics data using Semantic Web technology
    Stephens, Susie
    LaVigna, David
    DiLascio, Mike
    Luciano, Joanne
    JOURNAL OF WEB SEMANTICS, 2006, 4 (03): : 216 - 221
  • [27] A semantic representation of EO data for image retrieval based on natural language queries
    Polignano, Marco
    de Gemmis, Marco
    Kopsacheilis, Vasilis
    Vaitis, Michail
    Malig, Jenny
    Grether, Dominik
    Ioannou, Ilias
    Sarelli, Anastasia
    De Pasquale, Vito
    Samarelli, Sergio
    Kolokoussis, Pol
    Karamvasis, Kleanthis
    Miltiadou, Milto
    Papoutsa, Christiana
    Regniers, Olivier
    Lafon, Virginie
    Topouzelis, Konstantinos
    Despotov, Bogdan
    SIXTH INTERNATIONAL CONFERENCE ON REMOTE SENSING AND GEOINFORMATION OF THE ENVIRONMENT (RSCY2018), 2018, 10773
  • [28] Towards a semantic PACS: Using Semantic Web technology to represent imaging data
    Van Soest, Johan
    Lustberg, Tim
    Grittner, Detlef
    Marshall, M. Scott
    Persoon, Lucas
    Nijsten, Bas
    Fel Tens, Peter
    Dekker, Andre
    E-HEALTH - FOR CONTINUITY OF CARE, 2014, 205 : 166 - 170
  • [29] Semantic expansion of geographic web queries based on natural language positioning expressions
    Department of Computer Science, Federal University of Minas Gerais, Minas Gerais, Brazil
    不详
    不详
    Trans. GIS, 2007, 3 (377-397):
  • [30] A novel semantic representation approach for web documents using deep entity linking and multidocument support
    Urkude, Giridhar
    Pandey, Manju
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022, 35 (08)