Contextualizing data warehouses with documents

被引:22
作者
Perez-Martinez, Juan Manuel [1 ]
Berlanga-Llavori, Rafael [1 ]
Aramburu-Cabo, Maria Jose [1 ]
Pedersen, Torben Bach [2 ]
机构
[1] Jaume I Univ, Castellon de La Plana, Spain
[2] Aalborg Univ, Aalborg, Denmark
关键词
OLAP; text-rich XML documents; information retrieval;
D O I
10.1016/j.dss.2006.12.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current data warehouse and OLAP technologies are applied to analyze the structured data that companies store in databases. The context that helps to understand data over time is usually described separately in text-rich documents. This paper proposes to integrate the traditional corporate data warehouse with a document warehouse, resulting in a contextualized warehouse. Thus, the user first selects an analysis context by supplying some keywords. Then, the analysis is performed on a novel type of OLAP cube, called an R-cube, which is materialized by retrieving and ranking the documents and corporate facts related to the selected context. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:77 / 94
页数:18
相关论文
共 25 条
[1]  
[Anonymous], EXT MARK LANG XML
[2]  
Badia A., 2006, PROCESSING MANAGING, P96
[3]  
Baeza-Yates R.A., 1999, Modern Information Retrieval
[4]  
Beyer K., 2005, SIGMOD, P503
[5]  
BHOWMICK S, 1998, P INT WORKSH DAT WAR, P93
[6]  
CHINENYANGA TT, 2000, P 4 INT WORKSH WEB D, P1
[7]  
CROFT WB, 2000, ADV INFORM RETRIEVAL, P1
[8]  
Danger R, 2004, LECT NOTES COMPUT SC, V3287, P462
[9]  
FUHR N, 2000, P ACM SIGIR 2000 WOR, P172
[10]  
Inmon W.H., 1996, BUILDING DATA WAREHO