Multi-Agent Reinforcement Learning Framework in SDN-IoT for Transient Load Detection and Prevention

被引:23
作者
Dake, Delali Kwasi [1 ]
Gadze, James Dzisi [1 ]
Klogo, Griffith Selorm [1 ]
Nunoo-Mensah, Henry [1 ]
机构
[1] Kwame Nkrumah Univ Sci & Technol KNUST, Fac Elect & Comp Engn, AK-0395028 Kumasi, Ghana
关键词
MADDPG; SDN; IoT; routing; reinforcement learning; DDoS; SECURITY; CHALLENGES; MANAGEMENT; NETWORKS;
D O I
10.3390/technologies9030044
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The fast emergence of IoT devices and its accompanying big and complex data has necessitated a shift from the traditional networking architecture to software-defined networks (SDNs) in recent times. Routing optimization and DDoS protection in the network has become a necessity for mobile network operators in maintaining a good QoS and QoE for customers. Inspired by the recent advancement in Machine Learning and Deep Reinforcement Learning (DRL), we propose a novel MADDPG integrated Multiagent framework in SDN for efficient multipath routing optimization and malicious DDoS traffic detection and prevention in the network. The two MARL agents cooperate within the same environment to accomplish network optimization task within a shorter time. The state, action, and reward of the proposed framework were further modelled mathematically using the Markov Decision Process (MDP) and later integrated into the MADDPG algorithm. We compared the proposed MADDPG-based framework to DDPG for network metrics: delay, jitter, packet loss rate, bandwidth usage, and intrusion detection. The results show a significant improvement in network metrics with the two agents.
引用
收藏
页数:22
相关论文
共 46 条
  • [1] Akbari I., 2020, P NOMS 2020 2020 IEE, DOI [10.1109/NOMS47738.2020.9110426, DOI 10.1109/NOMS47738.2020.9110426]
  • [2] Almseidin M, 2017, I S INTELL SYST INFO, P277, DOI 10.1109/SISY.2017.8080566
  • [3] Amruthnath N, 2018, 2018 5TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS (ICIEA), P355, DOI 10.1109/IEA.2018.8387124
  • [4] [Anonymous], 2018, ADV NEURAL DYN
  • [5] Asadollahi S., 2018, P 2018 IEEE INT C CU, P3
  • [6] Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies
    Asiain, Erick
    Clempner, Julio B.
    Poznyak, Alexander S.
    [J]. SOFT COMPUTING, 2019, 23 (11) : 3591 - 3604
  • [7] Optimal Voltage Control Strategy for Voltage Regulators in Active Unbalanced Distribution Systems Using Multi-Agents
    Bedawy, Ahmed
    Yorino, Naoto
    Mahmoud, Karar
    Zoka, Yoshifumi
    Sasaki, Yutaka
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (02) : 1023 - 1035
  • [8] Bhunia SS, 2017, 2017 27TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), P84
  • [9] Cruz Santos Antonio Fernando., 2018, Advances in Intelligent Systems and Computing, V558, P501, DOI [10.1007/978-3-319-54978-164, DOI 10.1007/978-3-319-54978-164]
  • [10] Dharmadhikari C., 2019, IRJET, V6, P448