IPSO: A Scaling Model for Data-Intensive Applications

被引:1
|
作者
Li, Zhongwei [1 ]
Duan, Feng [1 ]
Minh Nguyen [1 ]
Che, Hao [1 ]
Lei, Yu [1 ]
Jiang, Hong [1 ]
机构
[1] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
关键词
scale-out workload; cloud computing; speedup; performance evaluation; Amdahl's Law; Gustafson's Law; AMDAHLS LAW;
D O I
10.1109/ICDCS.2019.00032
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's data center applications are predominantly data-intensive, calling for scaling out the workload to a large number of servers for parallel processing. Unfortunately, the existing scaling laws, notably, Amdahl's and Gustafson's laws are inadequate to characterize the scaling properties of dataintensive workloads. To fill this void, in this paper, we put forward a new scaling model, called In-Proportion and Scale-Out-induced scaling model (IPSO). IPSO generalizes the existing scaling models in two important aspects. First, it accounts for the possible in-proportion scaling, i.e., the scaling of the serial portion of the workload in proportion to the scaling of the parallelizable portion of the workload. Second, it takes into account the possible scaleout-induced scaling, i.e., the scaling of the collective overhead or workload induced by scaling out. IPSO exposes scaling properties of data-intensive workloads, rendering the existing scaling laws its special cases. In particular, IPSO reveals two new pathological scaling properties. Namely, the speedup may level off even in the case of the fixed-time workload underlying Gustafson's law, and it may peak and then fall as the system scales out. Extensive MapReduce and Spark-based case studies demonstrate that IPSO successfully captures diverse scaling properties of data-intensive applications. As a result, it can serve as a diagnostic tool to gain insights on or even uncover counter-intuitive root causes of observed scaling behaviors, especially pathological ones, for data-intensive applications. Finally, preliminary results also demonstrate the promising prospects of IPSO to facilitate effective resource provisioning to achieve the best speedup-versus-cost tradeoffs for data-intensive applications.
引用
收藏
页码:238 / 248
页数:11
相关论文
共 50 条
  • [21] Citus: Distributed PostgreSQL for Data-Intensive Applications
    Cubukcu, Umur
    Erdogan, Ozgun
    Pathak, Sumedh
    Sannakkayala, Sudhakar
    Slot, Marco
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2490 - 2502
  • [22] Understanding performance of distributed data-intensive applications
    Miceli, Christopher
    Miceli, Michael
    Rodriguez-Milla, Bety
    Jha, Shantenu
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2010, 368 (1926): : 4089 - 4102
  • [23] GORDON:. AN IMPROVED ARCHITECTURE FOR DATA-INTENSIVE APPLICATIONS
    Caulfield, Adrian M.
    Grupp, Laura M.
    Swanson, Steven
    IEEE MICRO, 2010, 30 (01) : 121 - 130
  • [24] System dynamics simulations for data-intensive applications
    Neuwirth, Christian
    ENVIRONMENTAL MODELLING & SOFTWARE, 2017, 96 : 140 - 145
  • [25] Enhancing Parallelism of Data-Intensive Bioinformatics Applications
    Xie, Zheng
    Han, Liangxiu
    Baldock, Richard
    2013 8TH EUROSIM CONGRESS ON MODELLING AND SIMULATION (EUROSIM), 2013, : 519 - 524
  • [26] Conceptual modeling of data-intensive Web applications
    Ceri, S
    Fraternali, P
    Matera, M
    IEEE INTERNET COMPUTING, 2002, 6 (04) : 20 - 30
  • [27] Privacy-Aware Data-Intensive Applications
    Guerriero, Michele
    PROCEEDINGS OF THE 2017 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE'17), 2017, : 1030 - 1033
  • [28] Memory Hotspot Optimization for Data-Intensive Applications
    2019 28TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2019), 2019, : 466 - 467
  • [29] Probabilistic advisory systems for data-intensive applications
    Quinn, A
    Ettler, P
    Jirsa, L
    Nagy, I
    Nedoma, P
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2003, 17 (02) : 133 - 148
  • [30] A dynamically reconfigurable IP for data-intensive applications
    Miyamoto, N
    Karnan, L
    Kotani, K
    Ohmi, T
    PROCEEDINGS OF 2004 IEEE ASIA-PACIFIC CONFERENCE ON ADVANCED SYSTEM INTEGRATED CIRCUITS, 2004, : 404 - 405