Refreshing data warehouses with near real-time updates

被引:0
|
作者
Rahman, Nayem [1 ]
机构
[1] Intel Corp, Business Intelligence Serv, Aloha, OR 97002 USA
关键词
data warehouse; near real-time; real-time; observation timestamp; metadata; incremental updates;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In traditional decision support systems, data warehouses have been used to analyze historical information. In the past it was relatively easy to keep data acquisition and maintenance activities to an as-needed basis by using batch windows at night when the business users went home. Now, however, decision makers need up-to-date information to make strategic business decisions, requiring data warehouses to be refreshed several times a day. This paper presents a technical outline for a near real-time decision support system where data warehouses are refreshed using a metadata model and incremental refreshes to increase the frequency of batch cycle runs. We propose a staging area in the data warehouse to capture data updates from external sources. Based on new data in the staging tables, we propose to load the actual analytical tables in the data warehouse using the database system as a transformation engine. We also propose making the database transformation tasks, such as stored procedures execution, metadata driven. The metadata model lets the stored procedures in different business and analytical subject areas run only when source data changes in the source subject area tables, and then implements a delta refresh of tables for which new data has arrived from the operational databases. Skipping unnecessary loads via this metadata-driven approach allows for faster cycle refreshes. The cycle refresh time statistics captured from an actual production data warehouse demonstrate the excellent reductions in cycle times achieved by our batch technique.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 50 条
  • [41] Dealing with query contention issue in real-time data warehouses by dynamic multi-level caches
    Lin, Ziyu
    Yang, Dongqing
    Song, Guojie
    Wang, Tengjiao
    2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 122 - +
  • [42] Triggered Updates for Temporal Consistency in Real-Time Databases
    Quazi N. Ahmed
    Susan V. Vrbsky
    Real-Time Systems, 2000, 19 : 209 - 243
  • [43] A New Method for Real-Time PPP Correction Updates
    Gao, Yang
    Zhang, Wentao
    Li, Yihe
    INTERNATIONAL SYMPOSIUM ON EARTH AND ENVIRONMENTAL SCIENCES FOR FUTURE GENERATIONS, 2018, 147 : 223 - 228
  • [44] Propagating updates in real-time search:: HLRTA*(k)
    Hernandez, Carlos
    Meseguer, Pedro
    CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2006, 4177 : 379 - 388
  • [45] Triggered updates for temporal consistency in real-time databases
    Ahmed, QN
    Vrbsky, SV
    REAL-TIME SYSTEMS, 2000, 19 (03) : 209 - 243
  • [46] Real-time traffic updates in moving objects Databases
    Trajcevski, G
    Wolfson, O
    Xu, B
    Nelson, P
    13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 698 - 702
  • [47] REAL-TIME STOCK CONTROL SYSTEM - WAREHOUSES OF MILES-DRUCE-GROUP ARE LINKED BY A REAL-TIME COMPUTING NETWORK
    不详
    DATA PROCESSING, 1968, 10 (03): : 162 - 166
  • [48] A Dynamic Jamming Game for Real-Time Status Updates
    Xiao, Yuanzhang
    Sun, Yin
    IEEE INFOCOM 2018 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2018, : 354 - 360
  • [49] Propagating updates in real-time search:: FALCONS(k)
    Hernández, C
    Meseguer, P
    SCCC 2005: XXV International Conference of the Chilean Computer Science Society, Proceedings, 2005, : 37 - 44
  • [50] Real-Time Data ETL Framework for Big Real-Time Data Analysis
    Li, Xiaofang
    Mao, Yingchi
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 1289 - 1294