Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology

被引：2

作者：

Rjoub, Gaith ^{[1
,2
]}

Islam, Saidul ^{[2
]}

Bentahar, Jamal ^{[2
,3
]}

Almaiah, Mohammed Amin ^{[4
,5
]}

Alrawashdeh, Rana ^{[6
]}

机构：

[1] Aqaba Univ Technol, Fac Informat Technol, Aqaba, Jordan

[2] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada

[3] Khalifa Univ, Dept Comp Sci, Abu Dhabi, U Arab Emirates

[4] Univ Jordan, King Abdullah II Sch Informat Technol, Amman, Jordan

[5] Appl Sci Private Univ, Fac Informat Technol, Amman, Jordan

[6] King Fand Univ Petr & Minerals, Dept Informat & Comp Sci, Dhahran, Saudi Arabia

来源：

20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024 | 2024年

关键词：

Internet of Things (IoT); Reinforcement Learning (RL); Proximal Policy Optimization (PPO); Transformers;

D O I：

10.1109/IWCMC61514.2024.10592607

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The proliferation of the Internet of Things (IoT) has led to an explosion of data generated by interconnected devices, presenting both opportunities and challenges for intelligent decision-making in complex environments. Traditional Reinforcement Learning (RL) approaches often struggle to fully harness this data due to their limited ability to process and interpret the intricate patterns and dependencies inherent in IoT applications. This paper introduces a novel framework that integrates transformer architectures with Proximal Policy Optimization (PPO) to address these challenges. By leveraging the self-attention mechanism of transformers, our approach enhances RL agents' capacity for understanding and acting within dynamic IoT environments, leading to improved decision-making processes. We demonstrate the effectiveness of our method across various IoT scenarios, from smart home automation to industrial control systems, showing marked improvements in decision-making efficiency and adaptability. Our contributions include a detailed exploration of the transformer's role in processing heterogeneous IoT data, a comprehensive evaluation of the framework's performance in diverse environments, and a benchmark against traditional RL methods. The results indicate significant advancements in enabling RL agents to navigate the complexities of IoT ecosystems, highlighting the potential of our approach to revolutionize intelligent automation and decision-making in the IoT landscape.

引用

页码：1418 / 1423

页数：6

共 22 条

[1] Machine learning and data analytics for the IoT
Adi, Erwin
Anwar, Adnan
Baig, Zubair
Zeadally, Sherali
[J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20) : 16205 - 16233
[2] Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation
Chen, Shi-Yong
Yu, Yang
Da, Qing
Tan, Jun
Huang, Hai-Kuan
Tang, Hai-Hong
[J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1187 - 1196
[3] Ensemble machine learning approach for classification of IoT devices in smart home
Cvitic, Ivan
Perakovic, Dragan
Perisa, Marko
Gupta, Brij
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (11) : 3179 - 3202
[4] A comprehensive survey on applications of transformers for deep learning tasks
Islam, Saidul
Elmekki, Hanae
Elsebai, Ahmed
Bentahar, Jamal
Drawel, Nagat
Rjoub, Gaith
Pedrycz, Witold
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
[5] A new method of hybrid time window embedding with transformer-based traffic data classification in IoT-networked environment
Kozik, Rafal
Pawlicki, Marek
Choras, Michal
[J]. PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) : 1441 - 1449
[6] Dynamic Path Planning of Unknown Environment Based on Deep Reinforcement Learning
Lei, Xiaoyun
Zhang, Zhian
Dong, Peifang
[J]. JOURNAL OF ROBOTICS, 2018, 2018
[7] Melo L. C., 2022, PMLR, V162, p15 340
[8] Mirowski P, 2017, Arxiv, DOI arXiv:1611.03673
[9] Parisotto E., 2020, PMLR, P7487
[10] A multi-head attention-based transformer model for traffic flow forecasting with a comparative analysis to recurrent neural networks
Reza, Selim
Ferreira, Marta Campos
Machado, J. J. M.
Tavares, Joao Manuel R. S.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 202

← 1 2 3 →