Scheduling to Minimize Staleness and Stretch in Real-Time Data Warehouses

被引:0
|
作者
MohammadHossein Bateni
Lukasz Golab
MohammadTaghi Hajiaghayi
Howard Karloff
机构
[1] Princeton University,
[2] AT&T Labs–Research,undefined
来源
关键词
On-line scheduling; Data warehouse maintenance; Competitive analysis;
D O I
暂无
中图分类号
学科分类号
摘要
We study scheduling algorithms for loading data feeds into real time data warehouses, which are used in applications such as IP network monitoring, online financial trading, and credit card fraud detection. In these applications, the warehouse collects a large number of streaming data feeds that are generated by external sources and arrive asynchronously. Data for each table in the warehouse are generated at a constant rate, different tables possibly at different rates. For each data feed, the arrival of new data triggers an update that seeks to append the new data to the corresponding table; if multiple updates are pending for the same table, they are batched together before being loaded. At time τ, if a table has been updated with information up to time r≤τ, its staleness is defined as τ−r.
引用
收藏
页码:757 / 780
页数:23
相关论文
共 50 条
  • [1] Scheduling to Minimize Staleness and Stretch in Real-Time Data Warehouses
    Bateni, MohammadHossein
    Golab, Lukasz
    Hajiaghayi, MohammadTaghi
    Karloff, Howard
    THEORY OF COMPUTING SYSTEMS, 2011, 49 (04) : 757 - 780
  • [2] Scheduling to Minimize Staleness and Stretch in Real-Time Data Warehouses
    Bateni, MohammadHossein
    Golab, Lukasz
    Hajiaghayi, MohammadTaghi
    Karloff, Howard
    SPAA'09: PROCEEDINGS OF THE TWENTY-FIRST ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2009, : 29 - 38
  • [3] Multi-objective scheduling for real-time data warehouses
    Thiele, Maik
    Bader, Andreas
    Lehner, Wolfgang
    COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2009, 24 (03): : 137 - 151
  • [4] Requirement-Based Query and Update Scheduling in Real-Time Data Warehouses
    Leng, Fangling
    Bao, Yubin
    Yu, Ge
    Shi, Jingang
    Cai, Xiaoyan
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 379 - 389
  • [5] Query optimisation in real-time data warehouses
    Hamdi I.
    Bouazizi E.
    Feki J.
    International Journal of Intelligent Information and Database Systems, 2019, 12 (04) : 245 - 278
  • [6] Real-time scheduling to minimize machine busy times
    Rohit Khandekar
    Baruch Schieber
    Hadas Shachnai
    Tami Tamir
    Journal of Scheduling, 2015, 18 : 561 - 573
  • [7] Real-time scheduling to minimize machine busy times
    Khandekar, Rohit
    Schieber, Baruch
    Shachnai, Hadas
    Tamir, Tami
    JOURNAL OF SCHEDULING, 2015, 18 (06) : 561 - 573
  • [8] Refreshing data warehouses with near real-time updates
    Rahman, Nayem
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2007, 47 (03) : 71 - 80
  • [9] Dynamic Management of Materialized Views in Real-Time Data Warehouses
    Hamdi, Issam
    Bouazizi, Emna
    Feki, Jamel
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 168 - 173
  • [10] Scheduling Real-Time Parallel Applications in Cloud to Minimize Energy Consumption
    Hu, Biao
    Cao, Zhengcai
    Zhou, Mengchu
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (01) : 662 - 674