DRL-QOR: Deep Reinforcement Learning-Based QoS/QoE-Aware Adaptive Online Orchestration in NFV-Enabled Networks

被引：43

作者：

Chen, Jing ^{[1
]}

Chen, Jia ^{[1
]}

Zhang, Hongke ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Elect & Informat Engn, Beijing 100044, Peoples R China

来源：

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2021年 / 18卷 / 02期

关键词：

Quality of service; Quality of experience; Adaptation models; System performance; Optimization; Delays; Servers; Network functions virtualization (NFV); service function chain (SFC); quality of service (QoS); quality of experience (QoE); deep reinforcement learning; orchestration; RESOURCE OPTIMIZATION; FUNCTION PLACEMENT; ALGORITHM;

D O I：

10.1109/TNSM.2021.3055494

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Faced with fluctuating network traffic and unknown underlying network traffic dynamics, developing an effective orchestration model with low network cost is still a critical issue in Network Functions Virtualization (NFV)-enabled networks. Thus we propose a Deep Reinforcement Learning based Quality of Service (QoS)/Quality of Experience (QoE)-Aware Adaptive Online Orchestration (DRL-QOR) approach to adapt to the real- time network variations. We formulate the stochastic resource optimization as a Parameterized Action Markov Decision Process (PAMDP), with QoE and specific QoS requirements as key factors in formulating the reward function, aiming to maximize QoE while satisfying QoS constraints. Then we propose DRL-QOR to solve the Non-deterministic Polynomial hard (NP-hard) problem with consideration of improving the long-term profits, where deep neural network combinatorial optimization theory is extended under the constraints of the binary integer programming model. Extensive experimental results in real USANET topology demonstrate that our proposed DRL-QOR converges fast during the training process. Compared with other benchmarks that only consider the current system performance, it shows good performance in QoE provisioning and QoS requirements maintenance for orchestrating SFCs.

引用

页码：1758 / 1774

页数：17

共 43 条

[11] The Next Generation Heterogeneous Satellite Communication Networks: Integration of Resource Management and Deep Reinforcement Learning [J].

Deng, Boyu ;

Jiang, Chunxiao ;

Yao, Haipeng ;

Guo, Song ;

Zhao, Shanghong .

IEEE WIRELESS COMMUNICATIONS, 2020, 27 (02) :105-111

[12] An Approach for Service Function Chain Routing and Virtual Function Network Instance Migration in Network Function Virtualization Architectures [J].

Eramo, Vincenzo ;

Miucci, Emanuele ;

Ammar, Mostafa ;

Lavacca, Francesco Giacinto .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2017, 25 (04) :2008-2025

[13]

Fei XC, 2018, IEEE INFOCOM SER, P486, DOI 10.1109/INFOCOM.2018.8486320

[14] A generic quantitative relationship between quality of experience and quality of service [J].

Fiedler, Markus ;

Hossfeld, Tobias ;

Tran-Gia, Phuoc .

IEEE NETWORK, 2010, 24 (02) :36-41

[15] Virtual Network Embedding: A Survey [J].

Fischer, Andreas ;

Botero, Juan Felipe ;

Beck, Michael Till ;

de Meer, Hermann ;

Hesselbach, Xavier .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2013, 15 (04) :1888-1906

[16] Dynamic Service Function Chain Embedding for NFV-Enabled IoT: A Deep Reinforcement Learning Approach [J].

Fu, Xiaoyuan ;

Yu, F. Richard ;

Wang, Jingyu ;

Qi, Qi ;

Liao, Jianxin .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (01) :507-519

[17] Cooperating with the future [J].

Hauser, Oliver P. ;

Rand, David G. ;

Peysakhovich, Alexander ;

Nowak, Martin A. .

NATURE, 2014, 511 (7508) :220-+

[18] Resource Allocation in NFV: A Comprehensive Survey [J].

Herrera, Juliver Gil ;

Botero, Juan Felipe .

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2016, 13 (03) :518-532

[19]

Kikuchi H, 2015, ANN CONF PRIV SECUR, P14, DOI 10.1109/PST.2015.7232949

[20]

Lee Giwon., 2015, The 10th International Conference on Future Internet, P17, DOI DOI 10.1145/2775088.2775103

← 1 2 3 4 5 →