Multidimensional Model Design using Data Mining: A Rapid Prototyping Methodology

被引:11
作者
Bimonte, Sandro [1 ]
Sautot, Lucile [2 ]
Journaux, Ludovic [3 ]
Faivre, Bruno [4 ]
机构
[1] IRSTEA, Clermont Ferrand, France
[2] AgroParisTech, TETIS, Montpellier, France
[3] AgroSupDijon, LE21, Dijon, France
[4] Univ Burgundy Franche Comte, UMR CNRS BioGeoSci, Dijon, France
关键词
Data Mining; Data Warehouse; Methodologies and Tools; OLAP; DATA WAREHOUSES; UML PROFILE; CONCEPTUAL-MODEL;
D O I
10.4018/IJDWM.2017010101
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Designing and building a Data Warehouse (DW), and associated OLAP cubes, are long processes, during which decision-maker requirements play an important role. But decision-makers are not OLAP experts and can find it difficult to deal with the concepts behind DW and OLAP. To support DW design in this context, we propose: (i) a new rapid prototyping methodology, integrating two different DM algorithms, to define dimension hierarchies according to decision-maker knowledge; (ii) a complete UML Profile, to define a DW schema that integrates both the DM algorithms; (iii) a mapping process to transform multidimensional schemata according to the results of the DM algorithms; (iv) a tool implementing the proposed methodology; (v) a full validation, based on a real case study concerning bird biodiversity. In conclusion, we confirm the rapidity and efficacy of our methodology and tool in providing a multidimensional schema to satisfy decision-maker analytical needs.
引用
收藏
页码:1 / 35
页数:35
相关论文
共 61 条
[41]   Contextualizing data warehouses with documents [J].
Perez-Martinez, Juan Manuel ;
Berlanga-Llavori, Rafael ;
Aramburu-Cabo, Maria Jose ;
Pedersen, Torben Bach .
DECISION SUPPORT SYSTEMS, 2008, 45 (01) :77-94
[42]  
Phipps C., 2002, P 4 INT WORKSH DES M
[43]  
Ralph C.J., 1981, Estimating the numbers of terrestrial birds
[44]  
Rehman N. U., 2012, ISMIS 12, P425
[45]  
Rizzi S, 2004, P INT WORKSH PATT RE
[46]   Automatic validation of requirements to support multidimensional design [J].
Romero, Oscar ;
Abello, Alberto .
DATA & KNOWLEDGE ENGINEERING, 2010, 69 (09) :917-942
[47]   A Survey of Multidimensional Modeling Methodologies [J].
Romero, Oscar ;
Abello, Alberto .
INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2009, 5 (02) :1-23
[48]   The hierarchical agglomerative clustering with Gower index: A methodology for automatic design of OLAP cube in ecological data processing context [J].
Sautot, Lucile ;
Faivre, Bruno ;
Journaux, Ludovic ;
Molin, Paul .
ECOLOGICAL INFORMATICS, 2015, 26 :217-230
[49]   A methodology and tool for rapid prototyping of data warehouses using data mining: Application to birds biodiversity [J].
Sautot, Lucile ;
Bimonte, Sandro ;
Journaux, Ludovic ;
Faivre, Bruno .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8748 :250-257
[50]   Face detection using discriminating feature analysis and Support Vector Machine [J].
Shih, PC ;
Liu, CJ .
PATTERN RECOGNITION, 2006, 39 (02) :260-276