DDQP: A Double Deep Q-Learning Approach to Online Fault-Tolerant SFC Placement

被引:37
|
作者
Wang, Lei [1 ,2 ]
Mao, Weixi [1 ,2 ]
Zhao, Jin [1 ,2 ]
Xu, Yuedong [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200438, Peoples R China
[2] Shanghai Key Lab Intelligent Informat Proc, Shanghai 200438, Peoples R China
[3] Fudan Univ, Sch Informat Sci & Technol, Shanghai 200438, Peoples R China
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2021年 / 18卷 / 01期
基金
中国国家自然科学基金;
关键词
Fault tolerant systems; Fault tolerance; Software; Routing; Real-time systems; Software reliability; Reinforcement learning; Deep reinforcement learning; service function chain; network function virtualization; fault tolerance; NETWORK; ALLOCATION; GAME; GO;
D O I
10.1109/TNSM.2021.3049298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since Network Function Virtualization (NFV) decouples network functions (NFs) from the underlying dedicated hardware and realizes them in the form of software called Virtual Network Functions (VNFs), they are enabled to run in any resource-sufficient virtual machines. A service function chain (SFC) is composed of a sequential set of VNFs. As VNFs are vulnerable to various faults such as software failures, we consider how to deploy both active and standby SFC instances. Given the complexity and unpredictability of the network state, we propose a double deep Q-networks based online SFC placement scheme DDQP. Specifically, DDQP uses deep neural networks to deal with large continuous network state space. In the case of stateful VNFs, we offer constant generated state updates from active instances to standby instances to guarantee seamless redirection after failures. With the goal of balancing the waste of resources and ensuring service reliability, we introduce five progressive schemes of resource reservations to meet different customer needs. Our experimental results demonstrate that DDQP responds rapidly to arriving requests and reaches near-optimal performance. Specifically, DDQP outweighs the state-of-the-art method by 16.30% and 38.51% higher acceptance ratio under different schemes with 82x speedup on average. In order to enhance the integrity of the SFC state transition, we further proposed DDQP+, which extends DDQP by adding the delayed placement mechanism. Compared with DDQP, the design of the DDQP+ algorithm is more reasonable and comprehensive. The experiment results also show that DDQP+ achieved further improvement in multiple performance indicators.
引用
收藏
页码:118 / 132
页数:15
相关论文
共 50 条
  • [21] Double Deep Q-Learning Based Irrigation and Chemigation Control
    Song, Jianfeng
    Porter, Dana
    Hu, Jiang
    Marek, Thomas
    PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022), 2022, : 414 - 419
  • [22] An Adaptive Learning Approach for Fault-Tolerant Routing in Internet of Things
    Misra, Sudip
    Krishna, P. Venkata
    Agarwal, Harshit
    Gupta, Anshima
    Obaidat, Mohammad S.
    2012 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2012,
  • [23] Fault-tolerant pattern formation by multiple robots: a learning approach
    Wang, Jia
    Cao, Jiannong
    Jiang, Shan
    2017 IEEE 36TH INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2017, : 268 - 269
  • [24] SHIELDeNN: Online Accelerated Framework for Fault-Tolerant Deep Neural Network Architectures
    Khoshavi, Navid
    Roohi, Arman
    Broyles, Connor
    Sargolzaei, Saman
    Bi, Yu
    Pan, David Z.
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [25] Deep reinforcement learning for fault-tolerant workflow scheduling in cloud environment
    Tingting Dong
    Fei Xue
    Hengliang Tang
    Chuangbai Xiao
    Applied Intelligence, 2023, 53 : 9916 - 9932
  • [26] Deep reinforcement learning for fault-tolerant workflow scheduling in cloud environment
    Dong, Tingting
    Xue, Fei
    Tang, Hengliang
    Xiao, Chuangbai
    APPLIED INTELLIGENCE, 2023, 53 (09) : 9916 - 9932
  • [27] Deep Learning Based Active Fault-Tolerant Control for Missile Actuators
    Jin, Luohuan
    Du, Mingjian
    Ma, Jianjun
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5760 - 5766
  • [28] Deep Reinforcement Learning-Based Approach for Fault-Tolerant Control of PV Systems in Smart Grids
    Karaki, Tala
    Saied, Majd
    Shraim, Hassan
    2022 10TH INTERNATIONAL CONFERENCE ON SYSTEMS AND CONTROL (ICSC), 2022, : 283 - 288
  • [29] A Deep Q-Learning based approach applied to the Snake game
    Sebastianelli, Alessandro
    Tipaldi, Massimo
    Ullo, Silvia Liberata
    Glielmo, Luigi
    2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 348 - 353
  • [30] A Deep Q-Learning Approach for Dynamic Management of Heterogeneous Processors
    Gupta, Ujjwal
    Mandal, Sumit K.
    Mao, Manqing
    Chakrabarti, Chaitali
    Ogras, Umit Y.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (01) : 14 - 17