A Deep Recurrent-Reinforcement Learning Method for Intelligent AutoScaling of Serverless Functions

被引：4

作者：

Agarwal, Siddharth ^{[1
]}

Rodriguez, Maria A. ^{[1
]}

Buyya, Rajkumar ^{[1
]}

机构：

[1] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Vic 3010, Australia

来源：

IEEE TRANSACTIONS ON SERVICES COMPUTING | 2024年 / 17卷 / 05期

关键词：

Serverless computing; function-as-a-service; AutoScaling; reinforcement learning; constraint-awareness;

D O I：

10.1109/TSC.2024.3387661

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Function-as-a-Service (FaaS) introduces a lightweight, function-based cloud execution model that finds its relevance in a range of applications like IoT-edge data processing and anomaly detection. While cloud service providers (CSPs) offer a near-infinite function elasticity, these applications often experience fluctuating workloads and stricter performance constraints. A typical CSP strategy is to empirically determine and adjust desired function instances or resources, known as autoscaling, based on monitoring-based thresholds such as CPU or memory, to cope with demand and performance. However, threshold configuration either requires expert knowledge, historical data or a complete view of the environment, making autoscaling a performance bottleneck that lacks an adaptable solution. Reinforcement learning (RL) algorithms are proven to be beneficial in analysing complex cloud environments and result in an adaptable policy that maximizes the expected objectives. Most realistic cloud environments usually involve operational interference and have limited visibility, making them partially observable. A general solution to tackle observability in highly dynamic settings is to integrate Recurrent units with model-free RL algorithms and model a decision process as a Partially Observable Markov Decision Process (POMDP). Therefore, in this article, we investigate model-free Recurrent RL agents for function autoscaling and compare them against the model-free Proximal Policy Optimisation (PPO) algorithm. We explore the integration of a Long-Short Term Memory (LSTM) network with the state-of-the-art PPO algorithm to find that under our experimental and evaluation settings, recurrent policies were able to capture the environment parameters and show promising results for function autoscaling. We further compare a PPO-based autoscaling agent with commercially used threshold-based function autoscaling and posit that a LSTM-based autoscaling agent is able to improve throughput by 18%, function execution by 13% and account for 8.4% more function instances.

引用

页码：1899 / 1910

页数：12

共 50 条

[11] Deep reinforcement learning for application scheduling in resource-constrained, multi-tenant serverless computing environments
Mampage, Anupama
Karunasekera, Shanika
Buyya, Rajkumar
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 143 : 277 - 292
[12] Def-DReL: Towards a sustainable serverless functions deployment strategy for fog-cloud environments using deep reinforcement learning
Dehury, Chinmaya Kumar
Poojara, Shivananda
Srirama, Satish Narayana
APPLIED SOFT COMPUTING, 2024, 152
[13] A survey of reinforcement and deep reinforcement learning for coordination in intelligent traffic light control
Saadi, Aicha
Abghour, Noureddine
Chiba, Zouhair
Moussaid, Khalid
Ali, Saadi
JOURNAL OF BIG DATA, 2025, 12 (01)
[14] Using Deep Reinforcement Learning to Build Intelligent Tutoring Systems
Paduraru, Ciprian
Paduraru, Miruna
Iordache, Stefan
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES (ICSOFT), 2022, : 288 - 298
[15] A multi-agent deep reinforcement learning approach for optimal resource management in serverless computing
Singh, Ashutosh Kumar
Kumar, Satender
Jain, Sarika
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
[16] Intelligent Adapted e-Learning System based on Deep Reinforcement Learning
El Fouki, Mohammed
Aknin, Noura
El Kadiri, K. Ed
ICCWCS'17: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND WIRELESS COMMUNICATION SYSTEMS, 2017,
[17] Intelligent Anti-Jamming Based on Deep Reinforcement Learning and Transfer Learning
Janiar, Siavash Barqi
Wang, Ping
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8825 - 8834
[18] GeoPM-DMEIRL: A deep inverse reinforcement learning security trajectory generation framework with serverless computing
Huang, Yi-rui
Zhang, Jing
Hou, Hong-ming
Ye, Xiu-cai
Chen, Yi
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 123 - 139
[19] Towards Intelligent Adaptive Edge Caching Using Deep Reinforcement Learning
Wang, Ting
Deng, Yuxiang
Mao, Jiawei
Chen, Mingsong
Liu, Gang
Di, Jieming
Li, Keqin
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9289 - 9303
[20] Intelligent Residential Energy Management System Using Deep Reinforcement Learning
Mathew, Alwyn
Roy, Abhijit
Mathew, Jimson
IEEE SYSTEMS JOURNAL, 2020, 14 (04): : 5362 - 5372

← 1 2 3 4 5 →