A meta reinforcement learning-based virtual machine placement algorithm in mobile edge computing

被引:5
作者
Xu, Hao [1 ]
Jian, Chengfeng [1 ]
机构
[1] Zhejiang Univ Technol, Comp Sci & Technol Coll, Hangzhou 310023, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2024年 / 27卷 / 02期
基金
中国国家自然科学基金;
关键词
Mobile edge computing; Deep reinforcement learning; Meta reinforcement learning; Virtual machine placement;
D O I
10.1007/s10586-023-04030-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile edge computing requires more and more high-performance servers, resulting in increased energy consumption. As an effective means to reduce energy consumption, virtual machine placement (VMP) has been widely studied. In the edge computing environment, as the number of terminal device requests continues to increase, the scale of VMP becomes larger and larger, and existing research algorithms may take a long time to converge. The reason is that as the number of VMs increases, the search space of the policy becomes larger and the agent needs to interact with the environment for a longer time to make the best decision. In addition, existing research methods only consider reducing energy consumption, rarely consider the response latency of virtual machines, and almost ignore the dynamic changes of the edge environment. To overcome these drawbacks, we propose a virtual machine placement algorithm based on meta-reinforcement learning, which consists of an inner and outer loop. The inner loop designs a deep reinforcement learning algorithm combined with the order exchange and migration mechanism to generate the best decision, and the outer loop provides meta-strategy parameters for the inner loop based on meta-learning to accelerate the convergence capability of the inner loop, thereby obtaining efficient virtual machine placement decisions quickly from a new environment. Through simulation experiments, we demonstrate that our approach effectively reduces the energy consumption of the edge server and the response latency of VMs at different problem sizes compared to the three baseline algorithms. At the same time, it quickly adapts to the new environment with only a small number of gradient updates.
引用
收藏
页码:1883 / 1896
页数:14
相关论文
共 32 条
[1]   A thermal-aware energy-efficient virtual machine placement algorithm based on fuzzy controlled binary gravitational search algorithm (FC-BGSA) [J].
Aghasi, Ali ;
Jamshidi, Kamal ;
Bohlooli, Ali .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (02) :1015-1033
[2]  
Ahmed A, 2016, PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16)
[3]   An Improved Ant Colony Algorithm for Solving a Virtual Machine Placement Problem in a Cloud Computing Environment [J].
Alharbe, Nawaf ;
Rakrouki, Mohamed Ali ;
Aljohani, Abeer .
IEEE ACCESS, 2022, 10 :44869-44880
[4]   Amulti-objective krill herd algorithm for virtual machine placement in cloud computing [J].
Baalamurugan, K. M. ;
Bhanu, S. Vijay .
JOURNAL OF SUPERCOMPUTING, 2020, 76 (06) :4525-4542
[5]   Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing [J].
Beloglazov, Anton ;
Abawajy, Jemal ;
Buyya, Rajkumar .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (05) :755-768
[6]   Reinforcement Learning, Fast and Slow [J].
Botvinick, Matthew ;
Ritter, Sam ;
Wang, Jane X. ;
Kurth-Nelson, Zeb ;
Blundell, Charles ;
Hassabis, Demis .
TRENDS IN COGNITIVE SCIENCES, 2019, 23 (05) :408-422
[7]  
Caili Wen, 2019, 2019 IEEE 21st International Conference on High Performance Computing and Communications
[8]  
IEEE 17th International Conference on Smart City
[9]  
IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). Proceedings, P1343, DOI 10.1109/HPCC/SmartCity/DSS.2019.00187
[10]   Deep reinforcement learning for multi-objective placement of virtual machines in cloud datacenters [J].
Caviglione, Luca ;
Gaggero, Mauro ;
Paolucci, Massimo ;
Ronco, Roberto .
SOFT COMPUTING, 2021, 25 (19) :12569-12588