RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks

被引:27
|
作者
Sharma, Deepak Kumar [1 ]
Rodrigues, Joel J. P. C. [2 ,3 ]
Vashishth, Vidushi [1 ]
Khanna, Anirudh [4 ]
Chhabra, Anshuman [4 ,5 ]
机构
[1] Netaji Subhas Univ Technol, Dept Informat Technol, New Delhi, India
[2] Fed Univ Piaui UFPI, Campus Petronio Portela, Teresina, PI, Brazil
[3] Inst Telecomunicacoes, Covilha, Portugal
[4] Netaji Subhas Univ Technol, Div Elect & Commun Engn, New Delhi, India
[5] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Opportunistic networks; Internet of Things; Reinforcement learning; Markov decision process; Dynamic programming; ONE simulator; Machine learning; Policy iteration; ALGORITHM; DESIGN;
D O I
10.1007/s11276-020-02331-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria.
引用
收藏
页码:4319 / 4338
页数:20
相关论文
共 50 条
  • [41] Optimized Routing in Software Defined Networks - A Reinforcement Learning Approach
    Mahboob, Tahira
    Jung, Young Rok
    Chung, Min Young
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 267 - 278
  • [42] Bandwidth and Storage Efficient Caching Based on Dynamic Programming and Reinforcement Learning
    Lin, Zhiyuan
    Huang, Wei
    Chen, Wei
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (02) : 206 - 209
  • [43] Reinforcement Learning based on Stochastic Dynamic Programming for Condition-based Maintenance of Deteriorating Production Processes
    Rasay, Hasan
    Naderkhani, Farnoosh
    Golmohammadi, Amir Mohammad
    2022 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2022, : 17 - 24
  • [44] Reinforcement Learning-based approach for dynamic vehicle routing problem with stochastic demand
    Zhou, Chenhao
    Ma, Jingxin
    Douge, Louis
    Chew, Ek Peng
    Lee, Loo Hay
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 182
  • [45] AOR: Adaptive opportunistic routing based on reinforcement learning for planetary surface exploration
    Wang, Yijie
    Yu, Ziping
    Zhao, Zhongliang
    Cao, Xianbin
    COMPUTER COMMUNICATIONS, 2023, 211 : 134 - 146
  • [46] Random forest classifier-based safe and reliable routing for opportunistic IoT networks
    Kandhoul, Nisha
    Dhurandher, Sanjay K.
    Woungang, Isaac
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2021, 34 (01)
  • [47] Cooperative Reinforcement Learning Aided Dynamic Routing in UAV Swarm Networks
    Wang, Zunliang
    Yao, Haipeng
    Mai, Tianle
    Xiong, Zehui
    Yu, F. Richard
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,
  • [48] Searching for optimal process routes: A reinforcement learning approach
    Khan, Ahmad
    Lapkin, Alexei
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 141
  • [49] Competitive Algorithms and Reinforcement Learning for NOMA in IoT Networks
    Mlika, Zoubeir
    Cherkaoui, Soumaya
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [50] An Optimal Stopping Decision Method for Routing in Opportunistic Networks
    Huang, Di
    Zhang, Sanfeng
    Chen, Zhou
    2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 2074 - 2079