Virtual Network Function Placement Optimization With Deep Reinforcement Learning

被引：97

作者：

Solozabal, Ruben ^{[1
]}

Ceberio, Josu ^{[2
]}

Sanchoyerto, Aitor ^{[1
]}

Zabala, Luis ^{[1
]}

Blanco, Bego ^{[3
]}

Liberal, Fidel ^{[1
]}

机构：

[1] Univ Basque Country, Networking Qual & Secur Dept, Bilbao, Spain

[2] Univ Basque Country, Dept Comp Sci & Artificial Intelligence, Donostia San Sebastian 20080, Spain

[3] Univ Basque Country UPV EHU, Dept Comp Languages & Syst, Bilbao 48013, Spain

来源：

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS | 2020年 / 38卷 / 02期

基金：

欧盟地平线“2020”;

关键词：

Constrained combinatorial optimization; Reinforcement Leaning; 5G; NFV;

D O I：

10.1109/JSAC.2019.2959183

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Network Function Virtualization (NFV) introduces a new network architecture framework that evolves network functions, traditionally deployed over dedicated equipment, to software implementations that run on general-purpose hardware. One of the main challenges for deploying NFV is the optimal resource placement of demanded network services in the NFV infrastructure. The virtual network function placement and network embedding can be formulated as a mathematical optimization problem concerned with a set of feasibility constraints that express the restrictions of the network infrastructure and the services contracted. This problem has been reported to be NP-hard, as a result most of the optimization work carried out in the area has focused on designing heuristic and metaheuristic algorithms. Nevertheless, in highly constrained problems, as in this case, inferring a competitive heuristic can be a daunting task that requires expertise. Consequently, an interesting solution is the use of Reinforcement Learning to model an optimization policy. The work presented here extends the Neural Combinatorial Optimization theory by considering constraints in the definition of the problem. The resulting agent is able to learn placement decisions by exploring the NFV infrastructure with the aim of minimizing the overall power consumption. The experiments conducted demonstrate that when the proposed strategy is also combined with heuristics, highly competitive results are achieved using relatively simple algorithms.

引用

页码：292 / 303

页数：12

共 27 条

[1]

Addis B, 2015, IEEE INT CONF CL NET, P171, DOI 10.1109/CloudNet.2015.7335301

[2]

[Anonymous], 2014, P EUR C NETW COMM

[3]

[Anonymous], [No title captured]

[4]

[Anonymous], 2014, ETSI GS NFV 002

[5]

[Anonymous], 2015, CORR

[6]

[Anonymous], 1998, INTRO REINFORCEMENT

[7]

Bahdanau D, 2014, 3 INT C LEARN REPR

[8]

Bari MF, 2015, INT CONF NETW SER, P50, DOI 10.1109/CNSM.2015.7367338

[9]

Bello I., 2016, ARXIV

[10]

Bertsekas D. P., 1997, J. Oper. Res. Soc., V48, P334, DOI DOI 10.1057/PALGRAVE.JORS.2600425

← 1 2 3 →