Performance Interference-Aware Vertical Elasticity for Cloud-hosted Latency-Sensitive Applications

被引:30
作者
Shekhar, Shashank [1 ]
Abdel-Aziz, Hamzah [1 ]
Bhattacharjee, Anirban [1 ]
Gokhale, Aniruddha [1 ]
Koutsoukos, Xenofon [1 ]
机构
[1] Vanderbilt Univ, Dept EECS, 221 Kirkland Hall, Nashville, TN 37235 USA
来源
PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD) | 2018年
基金
美国国家科学基金会;
关键词
Cloud computing; Data center; Multi-tenancy; Workload variability; Latency sensitive; Performance interference; Vertical elasticity; Virtualization; Linux containers; Docker; Online predictive models; Gaussian processes;
D O I
10.1109/CLOUD.2018.00018
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Elastic auto-scaling in cloud platforms has primarily used horizontal scaling by assigning application instances to distributed resources. Owing to rapid advances in hardware, cloud providers are now seeking vertical elasticity before attempting horizontal scaling to provide elastic auto-scaling for applications. Vertical elasticity solutions must, however, be cognizant of performance interference that stems from multi-tenant collocated applications since interference significantly impacts application quality-of-service (QoS) properties, such as latency. The problem becomes more pronounced for latency-sensitive applications that demand strict QoS properties. Further exacerbating the problem are variations in workloads, which make it hard to determine the right kinds of timely resource adaptations for latency-sensitive applications. To address these challenges and overcome limitations in existing offline approaches, we present an online, data-driven approach which utilizes Gaussian Processes-based machine learning techniques to build runtime predictive models of the performance of the system under different levels of interference. The predictive online models are then used in dynamically adapting to the workload variability by vertically auto-scaling co-located applications such that performance interference is minimized and QoS properties of latency-sensitive applications are met.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 35 条
  • [1] Ahmad M., 2011, EDBT, P449
  • [2] A survey on virtual machine migration and server consolidation frameworks for cloud data centers
    Ahmad, Raja Wasim
    Gani, Abdullah
    Ab Hamid, Siti Hafizah
    Shiraz, Muhammad
    Yousafzai, Abdullah
    Xia, Feng
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2015, 52 : 11 - 25
  • [3] Al-Dhuraibi Y., 2017, 10 IEEE INT C CLOUD
  • [4] [Anonymous], HELP MEAS PERF
  • [5] [Anonymous], 2011, P 2 ACM S CLOUD COMP
  • [6] [Anonymous], 2013, P 40 ANN INT S COMP
  • [7] Intelligent, Performance Interference-aware Resource Management for IoT Cloud Backends
    Caglar, Faruk
    Shekhar, Shashank
    Gokhale, Aniruddha
    Koutsoukos, Xenofon
    [J]. PROCEEDINGS 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON INTERNET-OF-THINGS DESIGN AND IMPLEMENTATION IOTDI 2016, 2016, : 95 - 105
  • [8] Paragon: QoS-Aware Scheduling for Heterogeneous Datacenters
    Delimitrou, Christina
    Kozyrakis, Christos
    [J]. ACM SIGPLAN NOTICES, 2013, 48 (04) : 77 - 88
  • [9] Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
  • [10] Self-adaptive workload classification and forecasting for proactive resource provisioning
    Herbst, Nikolas Roman
    Huber, Nikolaus
    Kounev, Samuel
    Amrehn, Erich
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (12) : 2053 - 2078