Multi-Agent Reinforcement Learning Framework in SDN-IoT for Transient Load Detection and Prevention

被引：23

作者：

Dake, Delali Kwasi ^{[1
]}

Gadze, James Dzisi ^{[1
]}

Klogo, Griffith Selorm ^{[1
]}

Nunoo-Mensah, Henry ^{[1
]}

机构：

[1] Kwame Nkrumah Univ Sci & Technol KNUST, Fac Elect & Comp Engn, AK-0395028 Kumasi, Ghana

来源：

TECHNOLOGIES | 2021年 / 9卷 / 03期

关键词：

MADDPG; SDN; IoT; routing; reinforcement learning; DDoS; SECURITY; CHALLENGES; MANAGEMENT; NETWORKS;

D O I：

10.3390/technologies9030044

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The fast emergence of IoT devices and its accompanying big and complex data has necessitated a shift from the traditional networking architecture to software-defined networks (SDNs) in recent times. Routing optimization and DDoS protection in the network has become a necessity for mobile network operators in maintaining a good QoS and QoE for customers. Inspired by the recent advancement in Machine Learning and Deep Reinforcement Learning (DRL), we propose a novel MADDPG integrated Multiagent framework in SDN for efficient multipath routing optimization and malicious DDoS traffic detection and prevention in the network. The two MARL agents cooperate within the same environment to accomplish network optimization task within a shorter time. The state, action, and reward of the proposed framework were further modelled mathematically using the Markov Decision Process (MDP) and later integrated into the MADDPG algorithm. We compared the proposed MADDPG-based framework to DDPG for network metrics: delay, jitter, packet loss rate, bandwidth usage, and intrusion detection. The results show a significant improvement in network metrics with the two agents.

引用

页数：22

共 46 条

[1] Akbari I., 2020, P NOMS 2020 2020 IEE, DOI [10.1109/NOMS47738.2020.9110426, DOI 10.1109/NOMS47738.2020.9110426]
[2] Almseidin M, 2017, I S INTELL SYST INFO, P277, DOI 10.1109/SISY.2017.8080566
[3] Amruthnath N, 2018, 2018 5TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS (ICIEA), P355, DOI 10.1109/IEA.2018.8387124
[4] [Anonymous], 2018, ADV NEURAL DYN
[5] Asadollahi S., 2018, P 2018 IEEE INT C CU, P3
[6] Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies
Asiain, Erick
Clempner, Julio B.
Poznyak, Alexander S.
[J]. SOFT COMPUTING, 2019, 23 (11) : 3591 - 3604
[7] Optimal Voltage Control Strategy for Voltage Regulators in Active Unbalanced Distribution Systems Using Multi-Agents
Bedawy, Ahmed
Yorino, Naoto
Mahmoud, Karar
Zoka, Yoshifumi
Sasaki, Yutaka
[J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (02) : 1023 - 1035
[8] Bhunia SS, 2017, 2017 27TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), P84
[9] Cruz Santos Antonio Fernando., 2018, Advances in Intelligent Systems and Computing, V558, P501, DOI [10.1007/978-3-319-54978-164, DOI 10.1007/978-3-319-54978-164]
[10] Dharmadhikari C., 2019, IRJET, V6, P448

← 1 2 3 4 5 →