A Deep Recurrent-Reinforcement Learning Method for Intelligent AutoScaling of Serverless Functions

被引:4
|
作者
Agarwal, Siddharth [1 ]
Rodriguez, Maria A. [1 ]
Buyya, Rajkumar [1 ]
机构
[1] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Vic 3010, Australia
关键词
Serverless computing; function-as-a-service; AutoScaling; reinforcement learning; constraint-awareness;
D O I
10.1109/TSC.2024.3387661
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Function-as-a-Service (FaaS) introduces a lightweight, function-based cloud execution model that finds its relevance in a range of applications like IoT-edge data processing and anomaly detection. While cloud service providers (CSPs) offer a near-infinite function elasticity, these applications often experience fluctuating workloads and stricter performance constraints. A typical CSP strategy is to empirically determine and adjust desired function instances or resources, known as autoscaling, based on monitoring-based thresholds such as CPU or memory, to cope with demand and performance. However, threshold configuration either requires expert knowledge, historical data or a complete view of the environment, making autoscaling a performance bottleneck that lacks an adaptable solution. Reinforcement learning (RL) algorithms are proven to be beneficial in analysing complex cloud environments and result in an adaptable policy that maximizes the expected objectives. Most realistic cloud environments usually involve operational interference and have limited visibility, making them partially observable. A general solution to tackle observability in highly dynamic settings is to integrate Recurrent units with model-free RL algorithms and model a decision process as a Partially Observable Markov Decision Process (POMDP). Therefore, in this article, we investigate model-free Recurrent RL agents for function autoscaling and compare them against the model-free Proximal Policy Optimisation (PPO) algorithm. We explore the integration of a Long-Short Term Memory (LSTM) network with the state-of-the-art PPO algorithm to find that under our experimental and evaluation settings, recurrent policies were able to capture the environment parameters and show promising results for function autoscaling. We further compare a PPO-based autoscaling agent with commercially used threshold-based function autoscaling and posit that a LSTM-based autoscaling agent is able to improve throughput by 18%, function execution by 13% and account for 8.4% more function instances.
引用
收藏
页码:1899 / 1910
页数:12
相关论文
共 50 条
  • [11] Deep reinforcement learning for application scheduling in resource-constrained, multi-tenant serverless computing environments
    Mampage, Anupama
    Karunasekera, Shanika
    Buyya, Rajkumar
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 143 : 277 - 292
  • [12] Def-DReL: Towards a sustainable serverless functions deployment strategy for fog-cloud environments using deep reinforcement learning
    Dehury, Chinmaya Kumar
    Poojara, Shivananda
    Srirama, Satish Narayana
    APPLIED SOFT COMPUTING, 2024, 152
  • [13] A survey of reinforcement and deep reinforcement learning for coordination in intelligent traffic light control
    Saadi, Aicha
    Abghour, Noureddine
    Chiba, Zouhair
    Moussaid, Khalid
    Ali, Saadi
    JOURNAL OF BIG DATA, 2025, 12 (01)
  • [14] Using Deep Reinforcement Learning to Build Intelligent Tutoring Systems
    Paduraru, Ciprian
    Paduraru, Miruna
    Iordache, Stefan
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES (ICSOFT), 2022, : 288 - 298
  • [15] A multi-agent deep reinforcement learning approach for optimal resource management in serverless computing
    Singh, Ashutosh Kumar
    Kumar, Satender
    Jain, Sarika
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
  • [16] Intelligent Adapted e-Learning System based on Deep Reinforcement Learning
    El Fouki, Mohammed
    Aknin, Noura
    El Kadiri, K. Ed
    ICCWCS'17: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND WIRELESS COMMUNICATION SYSTEMS, 2017,
  • [17] Intelligent Anti-Jamming Based on Deep Reinforcement Learning and Transfer Learning
    Janiar, Siavash Barqi
    Wang, Ping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8825 - 8834
  • [18] GeoPM-DMEIRL: A deep inverse reinforcement learning security trajectory generation framework with serverless computing
    Huang, Yi-rui
    Zhang, Jing
    Hou, Hong-ming
    Ye, Xiu-cai
    Chen, Yi
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 123 - 139
  • [19] Towards Intelligent Adaptive Edge Caching Using Deep Reinforcement Learning
    Wang, Ting
    Deng, Yuxiang
    Mao, Jiawei
    Chen, Mingsong
    Liu, Gang
    Di, Jieming
    Li, Keqin
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9289 - 9303
  • [20] Intelligent Residential Energy Management System Using Deep Reinforcement Learning
    Mathew, Alwyn
    Roy, Abhijit
    Mathew, Jimson
    IEEE SYSTEMS JOURNAL, 2020, 14 (04): : 5362 - 5372