Minimizing Malware Propagation in Internet of Things Networks: An Optimal Control Using Feedback Loop Approach

被引：6

作者：

Tayseer Jafar, Mousa ^{[1
]}

Yang, Lu-Xing ^{[1
]}

Li, Gang ^{[1
]}

Zhu, Qingyi ^{[2
]}

Gan, Chenquan ^{[2
]}

机构：

[1] Deakin Univ, Sch Informat Technol, Melbourne, Vic 3125, Australia

[2] Chongqing Univ Posts & Telecommun, Sch Cyber Secur & Informat Law, Chongqing 400065, Peoples R China

来源：

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY | 2024年 / 19卷

关键词：

Malware; Internet of Things; Optimal control; Prevention and mitigation; Resource management; Epidemics; Costs; hybrid framework; feedback loop; closed-loop; model predictive control; reinforcement learning; IoT; VIRUS PROPAGATION; DYNAMICAL ANALYSIS; COMPUTER VIRUS; MODEL; INFORMATION; STABILITY; EPIDEMICS;

D O I：

10.1109/TIFS.2024.3463965

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Despite extensive research on optimal control formulations for cyber threat mitigation, a significant gap persists between theoretical and practical implementation in real-time scenarios. The open-loop structure of the optimal control framework is insufficiently robust for effectively addressing cyber threats. To overcome this, adopting a model learning process that iteratively updates the optimal control strategy is proposed. This paper proposes an innovative approach to addressing cybersecurity attacks in the Internet of Things (IoT) networks by integrating reinforcement learning (RL) and model predictive control (MPC) in a hybrid framework to optimize control parameters and enhance system effectiveness in combating malware. This novel approach aims to overcome the limitations of the previous approaches and establish superior control strategies for IoT network security. This approach enhances the adaptability and responsiveness of the mitigation process, improving the handling of evolving cyber threats in real-world applications. This framework enhances the security and resilience of IoT networks against malicious activities, offering a robust solution for mitigating cyber threats by leveraging RL algorithms and the proactive capabilities of MPC. A comprehensive evaluation demonstrates the effectiveness and efficiency of the hybrid framework, highlighting its potential to protect IoT networks from evolving cybersecurity risks. The primary aim extends beyond using an RL agent solely for computing control actions to optimize closed-loop performance and stability. It also leverages RL to estimate model parameters that are currently unknown but within known bounds. Our main objective in using the RL agent is to accurately estimate unidentified model parameters within specified limits. The simulation results provide compelling evidence supporting the effectiveness of this methodology in mitigating malware propagation, highlighting its superior performance compared to state-of-the-art methods. RLMPC rapidly initiated recovery, achieving full network restoration in 8 seconds and recovering 60 IoT devices. Also, the evaluation focused on average speed, scalability, and performance under various cyber-attack scenarios.

引用

页码：9682 / 9697

页数：16

共 74 条

[31] Optimal Quarantining of Wireless Malware Through Reception Gain Control [J].

Khouzani, M. H. R. ;

Altman, Eitan ;

Sarkar, Saswati .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (01) :49-61

[32]

Kirk D., 1998, OPTIMAL CONTROL THEO, DOI DOI 10.1007/978-3-319-98237-3

[33] Exploring the behavior of malware propagation on mobile wireless sensor networks: Stability and control analysis [J].

Kumari, Sangeeta ;

Upadhyay, Ranjit Kumar .

MATHEMATICS AND COMPUTERS IN SIMULATION, 2021, 190 :246-269

[34] Deep Reinforcement Learning for Large-Scale Epidemic Control [J].

Libin, Pieter J. K. ;

Moonens, Arno ;

Verstraeten, Timothy ;

Perez-Sanjines, Fabian ;

Hens, Niel ;

Lemey, Philippe ;

Nowe, Ann .

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 :155-170

[35] Analysis and Control of Malware Mutation Model in Wireless Rechargeable Sensor Network with Charging Delay [J].

Liu, Guiyun ;

Peng, Zhimin ;

Liang, Zhongwei ;

Zhong, Xiaojing ;

Xia, Xinhai .

MATHEMATICS, 2022, 10 (14)

[36] A Novel Epidemic Model Base on Pulse Charging in Wireless Rechargeable Sensor Networks [J].

Liu, Guiyun ;

Su, Xiaokai ;

Hong, Fenghuo ;

Zhong, Xiaojing ;

Liang, Zhongwei ;

Wu, Xilai ;

Huang, Ziyi .

ENTROPY, 2022, 24 (02)

[37] Dynamics Analysis of a Wireless Rechargeable Sensor Network for Virus Mutation Spreading [J].

Liu, Guiyun ;

Peng, Zhimin ;

Liang, Zhongwei ;

Li, Junqiang ;

Cheng, Lefeng .

ENTROPY, 2021, 23 (05)

[38] Web malware spread modelling and optimal control strategies [J].

Liu, Wanping ;

Zhong, Shouming .

SCIENTIFIC REPORTS, 2017, 7

[39] Optimal Control of Malware Spreading Model with Tracing and Patching in Wireless Sensor Networks [J].

Muthukrishnan, Senthilkumar ;

Muthukumar, Sumathi ;

Chinnadurai, Veeramani .

WIRELESS PERSONAL COMMUNICATIONS, 2021, 117 (03) :2061-2083

[40] Spread of epidemic disease on networks [J].

Newman, MEJ .

PHYSICAL REVIEW E, 2002, 66 (01) :1-016128

← 1 2 3 4 5 6 7 8 →