Refreshing data warehouses with near real-time updates

被引:0
|
作者
Rahman, Nayem [1 ]
机构
[1] Intel Corp, Business Intelligence Serv, Aloha, OR 97002 USA
关键词
data warehouse; near real-time; real-time; observation timestamp; metadata; incremental updates;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In traditional decision support systems, data warehouses have been used to analyze historical information. In the past it was relatively easy to keep data acquisition and maintenance activities to an as-needed basis by using batch windows at night when the business users went home. Now, however, decision makers need up-to-date information to make strategic business decisions, requiring data warehouses to be refreshed several times a day. This paper presents a technical outline for a near real-time decision support system where data warehouses are refreshed using a metadata model and incremental refreshes to increase the frequency of batch cycle runs. We propose a staging area in the data warehouse to capture data updates from external sources. Based on new data in the staging tables, we propose to load the actual analytical tables in the data warehouse using the database system as a transformation engine. We also propose making the database transformation tasks, such as stored procedures execution, metadata driven. The metadata model lets the stored procedures in different business and analytical subject areas run only when source data changes in the source subject area tables, and then implements a delta refresh of tables for which new data has arrived from the operational databases. Skipping unnecessary loads via this metadata-driven approach allows for faster cycle refreshes. The cycle refresh time statistics captured from an actual production data warehouse demonstrate the excellent reductions in cycle times achieved by our batch technique.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 50 条
  • [21] Differential Encoding for Real-Time Status Updates
    Bhambay, Sanidhay
    Poojary, Sudheer
    Parag, Parimal
    2017 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2017,
  • [22] Scheduling Updates in a Real-Time Stream Warehouse
    Golab, Lukasz
    Johnson, Theodore
    Shkapenyuk, Vladislav
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1207 - 1210
  • [23] Real-Time Status Updates for Correlated Source
    Poojary, Sudheer
    Bhambay, Sanidhay
    Parag, Parimal
    2017 IEEE INFORMATION THEORY WORKSHOP (ITW), 2017, : 274 - 278
  • [24] Real-Time Status Updates for Markov Source
    Poojary, Sudheer
    Bhambay, Sanidhay
    Parag, Parimal
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (09) : 5737 - 5749
  • [25] The Use of Real-Time Online Updates for Physicians
    Gaudette, Renee
    Yarcusko, John
    Saif, Muhammad Wasif
    JOURNAL OF THE PANCREAS, 2009, 10 (04): : 351 - 351
  • [26] Scalable Scheduling of Updates in Streaming Data Warehouses
    Golab, Lukasz
    Johnson, Theodore
    Shkapenyuk, Vladislav
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (06) : 1092 - 1105
  • [27] Real-Time or Near Real-Time Persisting Daily Healthcare Data Into HDFS and ElasticSearch Index Inside a Big Data Platform
    Chen, Dequan
    Chen, Yi
    Brownlow, Brian N.
    Kanjamala, Pradip P.
    Arredondo, Carlos A. Garcia
    Radspinner, Bryan L.
    Raveling, Matthew A.
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (02) : 595 - 606
  • [28] FAST: Near Real-time Searchable Data Analytics for the Cloud
    Hua, Yu
    Jiang, Hong
    Feng, Dan
    SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 754 - 765
  • [29] Balsam: Near Real-time Experimental Data Analysis on Supercomputers
    Salim, Michael A.
    Uram, Thomas D.
    Childers, J. Taylor
    Vishwanath, Venkatram
    Papka, Michael E.
    PROCEEDINGS OF XLOOP 2019: IEEE/ACM 1ST ANNUAL WORKSHOP ON LARGE-SCALE EXPERIMENT-IN-THE-LOOP COMPUTING (XLOOP), 2019, : 26 - 31
  • [30] Near Real-Time Big Data Analysis on Vehicular Networks
    Daniel, Alfred
    Paul, Anand
    Ahmad, Awais
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORKS SECURITY (ICSNS 2015), 2015,