A relational data harmonization approach to XML

被引:10
作者
Niemi, Timo [1 ]
Nappila, Turkka [1 ]
Jarvelin, Kalervo [2 ]
机构
[1] Univ Tampere, Dept Comp Sci, FI-33014 Tampere, Finland
[2] Univ Tampere, Dept Informat Studies, FI-33014 Tampere, Finland
基金
芬兰科学院;
关键词
data harmonization; data heterogeneity; XML; OLAP;
D O I
10.1177/0165551509104231
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are numerous approaches for integrating data from heterogeneous data sources. A common background assumption is that the data sources remain quite stable and are known in advance. Hence an integration system can be built to manipulate them. In practice there is, however, often a demand for supporting ad hoc information needs concerning unexpected autonomous data sources containing volatile data. A different approach is therefore needed. We propose that semantically similar data are harmonized when extracting data from XML-based data sources. We introduce a constructor algebra, which is a powerful tool in the harmonization of XML data. This algebra is able to form for any XML data source a unique relational representation, called an XML relation. We demonstrate that the XML relation representation supports grouping and aggregation of data needed, for example, in OLAP (online analytical processing) -style applicationsn
引用
收藏
页码:571 / 601
页数:31
相关论文
共 43 条
[1]  
ABITEBOUL S, 2001, IEEE DATA ENG B, V24, P3
[2]   Views in a large-scale XML repository [J].
Aguilera, V ;
Cluet, S ;
Milo, T ;
Veltri, P ;
Vodislav, D .
VLDB JOURNAL, 2002, 11 (03) :238-255
[3]  
[Anonymous], P 13 INT C VER LARG
[4]  
[Anonymous], 2007, XQuery 1.0: An XML Query Language
[5]  
[Anonymous], P VLDB
[6]  
Arenas Marcelo., 2005, PODS, P13
[7]  
BEYER K., 2005, P 2005 ACM SIGMOD IN, P347
[8]  
Beyer K., 2005, SIGMOD 05 P 2005 ACM, P503
[9]  
Bray T., Extensible markup language (xml) 1.0
[10]  
BREITBART Y, 1986, P IEEE INT C DAT ENG, P301