A family of experiments to validate measures for UML activity diagrams of ETL processes in data warehouses

被引:19
作者
Munoz, Lilia [1 ]
Mazon, Jose-Norberto [2 ]
Trujillo, Juan [2 ]
机构
[1] Technol Univ Panama, Dept Informat Syst Control Evaluat & Comp Resourc, Lucentia Res Grp, Panama City, Panama
[2] Univ Alicante, Dept Software & Comp Syst, Lucentia Res Grp, San Vicente Del Raspeig 03080, Spain
关键词
ETL processes; Measure validation; Activity diagrams; Empirical software engineering; Data warehouse conceptual modeling; Quality; DESIGN;
D O I
10.1016/j.infsof.2010.06.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In data warehousing, Extract, Transform, and Load (ETL) processes are in charge of extracting the data from the data sources that will be contained in the data warehouse. Their design and maintenance is thus a cornerstone in any data warehouse development project. Due to their relevance, the quality of these processes should be formally assessed early in the development in order to avoid populating the data warehouse with incorrect data. To this end, this paper presents a set of measures with which to evaluate the structural complexity of ETL process models at the conceptual level. This study is, moreover, accompanied by the application of formal frameworks and a family of experiments whose aim is to theoretical and empirically validate the proposed measures, respectively. Our experiments show that the use of these measures can aid designers to predict the effort associated with the maintenance tasks of ETL processes and to make ETL process models more usable. Our work is based on Unified Modeling Language (UML) activity diagrams for modeling ETL processes, and on the Framework for the Modeling and Evaluation of Software Processes (FMESP) framework for the definition and validation of the measures. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:1188 / 1203
页数:16
相关论文
共 52 条
[1]  
[Anonymous], 1996, ISERN9610 U STRATHCL
[2]  
[Anonymous], P 6 INT C EMP ASS SO
[3]  
[Anonymous], 2008, OMG
[4]  
[Anonymous], 1995, ISERN9503
[5]  
BALTA M, 1978, USING SHANNON ENTROP, P151
[6]   THE TAME PROJECT - TOWARDS IMPROVEMENT-ORIENTED SOFTWARE ENVIRONMENTS [J].
BASILI, VR ;
ROMBACH, HD .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1988, 14 (06) :758-773
[7]   Building knowledge through families of experiments [J].
Basili, VR ;
Lanubile, F .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (04) :456-473
[8]  
Berenguer G, 2005, LECT NOTES COMPUT SC, V3589, P95, DOI 10.1007/11546849_10
[9]  
Briand L., 1999, EMPIR SOFTW ENG, V4, P387
[10]  
Briand L., 1998, International Software Engineering Research Network technical report ISERN-98-29