Auto-Scaling Cloud-Based Memory-Intensive Applications

被引:4
|
作者
Novak, Joe [1 ]
Kasera, Sneha Kumar [1 ]
Stutsman, Ryan [1 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
来源
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020) | 2020年
基金
美国国家科学基金会;
关键词
auto-scaling; memory-intensive; miss ratio curve;
D O I
10.1109/CLOUD49709.2020.00042
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today, Cloud providers offer simplistic scaling policies that rely on thresholds that force tenants to have a priori knowledge of their workloads. We develop a new method for scaling memory-intensive workloads that needs no thresholds. This makes it worry-free for tenants, and it adapts even as workloads evolve. This is especially hard for memory-bound applications where even a small decrease in the amount of memory available can have a dramatic, almost unbounded impact on performance. Hence, sizing a machine's physical memory correctly is critical to application performance and operating cost. To determine a natural threshold for memory-intensive applications, our approach automatically analyzes an application's miss ratio curve (MRC) and models it as a hyperbola. Intuitively, a memory scaling policy should operate at the point where the curve flattens: that is, at its intersection with its latus rectum (LR). Our system uses a new approach to constructing and analyzing MRCs at run time that captures memory references from a slice of any scalable application as it executes on standard virtual machines from any major Cloud provider. We demonstrate with multiple applications running on Amazon Web Services (AWS) and Microsoft Azure. Our implementation and evaluation show that, though the LR doesn't require tenants to set thresholds, it is effective in scaling memory-intensive workloads to save on operating costs while avoiding queuing, thrashing, or collapse. It increases throughput by 1.5 x and reduces queuing delay by 2 x in our evaluation.
引用
收藏
页码:229 / 237
页数:9
相关论文
共 50 条
  • [1] MultiScaler: A Multi-Loop Auto-Scaling Approach for Cloud-Based Applications
    Al-Dulaimy, Auday
    Taheri, Javid
    Kassler, Andreas
    HoseinyFarahabady, M. Reza
    Deng, Shuiguang
    Zomaya, Albert
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (04) : 2769 - 2786
  • [2] Performance modelling and verification of cloud-based auto-scaling policies
    Evangelidis, Alexandros
    Parker, David
    Bahsoon, Rami
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 629 - 638
  • [3] Performance Modelling and Verification of Cloud-based Auto-Scaling Policies
    Evangelidis, Alexandros
    Parker, David
    Bahsoon, Rami
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 355 - 364
  • [4] An Autonomic Auto-scaling Controller for Cloud Based Applications
    Londono-Peldaez, Jorge M.
    Florez-Samur, Carlos A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (09) : 1 - 6
  • [5] Auto-Scaling Web Applications in Hybrid Cloud Based on Docker
    Li, Yunchun
    Xia, Yumeng
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 75 - 79
  • [6] Auto-Scaling Approach for Cloud based Mobile Learning Applications
    Almutlaq, Amani Nasser
    Daadaa, Yassine
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 472 - 479
  • [7] Auto-scaling techniques for IoT-based cloud applications: a review
    Verma, Shveta
    Bala, Anju
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 2425 - 2459
  • [8] Auto-Scaling Method in Hybrid Cloud for Scientific Applications
    Ahn, Younsun
    Choi, Jieun
    Jeong, Sol
    Kim, Yoonhee
    2014 16TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2014,
  • [9] Auto-scaling techniques for IoT-based cloud applications: a review
    Shveta Verma
    Anju Bala
    Cluster Computing, 2021, 24 : 2425 - 2459
  • [10] Optimal Cloud Resource Auto-Scaling for Web Applications
    Jiang, Jing
    Lu, Jie
    Zhang, Guangquan
    Long, Guodong
    PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 58 - 65