Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology

被引:2
作者
Rjoub, Gaith [1 ,2 ]
Islam, Saidul [2 ]
Bentahar, Jamal [2 ,3 ]
Almaiah, Mohammed Amin [4 ,5 ]
Alrawashdeh, Rana [6 ]
机构
[1] Aqaba Univ Technol, Fac Informat Technol, Aqaba, Jordan
[2] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
[3] Khalifa Univ, Dept Comp Sci, Abu Dhabi, U Arab Emirates
[4] Univ Jordan, King Abdullah II Sch Informat Technol, Amman, Jordan
[5] Appl Sci Private Univ, Fac Informat Technol, Amman, Jordan
[6] King Fand Univ Petr & Minerals, Dept Informat & Comp Sci, Dhahran, Saudi Arabia
来源
20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024 | 2024年
关键词
Internet of Things (IoT); Reinforcement Learning (RL); Proximal Policy Optimization (PPO); Transformers;
D O I
10.1109/IWCMC61514.2024.10592607
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The proliferation of the Internet of Things (IoT) has led to an explosion of data generated by interconnected devices, presenting both opportunities and challenges for intelligent decision-making in complex environments. Traditional Reinforcement Learning (RL) approaches often struggle to fully harness this data due to their limited ability to process and interpret the intricate patterns and dependencies inherent in IoT applications. This paper introduces a novel framework that integrates transformer architectures with Proximal Policy Optimization (PPO) to address these challenges. By leveraging the self-attention mechanism of transformers, our approach enhances RL agents' capacity for understanding and acting within dynamic IoT environments, leading to improved decision-making processes. We demonstrate the effectiveness of our method across various IoT scenarios, from smart home automation to industrial control systems, showing marked improvements in decision-making efficiency and adaptability. Our contributions include a detailed exploration of the transformer's role in processing heterogeneous IoT data, a comprehensive evaluation of the framework's performance in diverse environments, and a benchmark against traditional RL methods. The results indicate significant advancements in enabling RL agents to navigate the complexities of IoT ecosystems, highlighting the potential of our approach to revolutionize intelligent automation and decision-making in the IoT landscape.
引用
收藏
页码:1418 / 1423
页数:6
相关论文
共 22 条
  • [1] Machine learning and data analytics for the IoT
    Adi, Erwin
    Anwar, Adnan
    Baig, Zubair
    Zeadally, Sherali
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20) : 16205 - 16233
  • [2] Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation
    Chen, Shi-Yong
    Yu, Yang
    Da, Qing
    Tan, Jun
    Huang, Hai-Kuan
    Tang, Hai-Hong
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1187 - 1196
  • [3] Ensemble machine learning approach for classification of IoT devices in smart home
    Cvitic, Ivan
    Perakovic, Dragan
    Perisa, Marko
    Gupta, Brij
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (11) : 3179 - 3202
  • [4] A comprehensive survey on applications of transformers for deep learning tasks
    Islam, Saidul
    Elmekki, Hanae
    Elsebai, Ahmed
    Bentahar, Jamal
    Drawel, Nagat
    Rjoub, Gaith
    Pedrycz, Witold
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [5] A new method of hybrid time window embedding with transformer-based traffic data classification in IoT-networked environment
    Kozik, Rafal
    Pawlicki, Marek
    Choras, Michal
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) : 1441 - 1449
  • [6] Dynamic Path Planning of Unknown Environment Based on Deep Reinforcement Learning
    Lei, Xiaoyun
    Zhang, Zhian
    Dong, Peifang
    [J]. JOURNAL OF ROBOTICS, 2018, 2018
  • [7] Melo L. C., 2022, PMLR, V162, p15 340
  • [8] Mirowski P, 2017, Arxiv, DOI arXiv:1611.03673
  • [9] Parisotto E., 2020, PMLR, P7487
  • [10] A multi-head attention-based transformer model for traffic flow forecasting with a comparative analysis to recurrent neural networks
    Reza, Selim
    Ferreira, Marta Campos
    Machado, J. J. M.
    Tavares, Joao Manuel R. S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 202