RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks

被引:27
|
作者
Sharma, Deepak Kumar [1 ]
Rodrigues, Joel J. P. C. [2 ,3 ]
Vashishth, Vidushi [1 ]
Khanna, Anirudh [4 ]
Chhabra, Anshuman [4 ,5 ]
机构
[1] Netaji Subhas Univ Technol, Dept Informat Technol, New Delhi, India
[2] Fed Univ Piaui UFPI, Campus Petronio Portela, Teresina, PI, Brazil
[3] Inst Telecomunicacoes, Covilha, Portugal
[4] Netaji Subhas Univ Technol, Div Elect & Commun Engn, New Delhi, India
[5] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Opportunistic networks; Internet of Things; Reinforcement learning; Markov decision process; Dynamic programming; ONE simulator; Machine learning; Policy iteration; ALGORITHM; DESIGN;
D O I
10.1007/s11276-020-02331-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria.
引用
收藏
页码:4319 / 4338
页数:20
相关论文
共 50 条
  • [31] Opportunistic Fair Scheduling in Wireless Networks: An Approximate Dynamic Programming Approach
    Zhang, Zhi
    Moola, Sudhir
    Chong, Edwin K. P.
    MOBILE NETWORKS & APPLICATIONS, 2010, 15 (05) : 710 - 728
  • [32] Reinforcement learning based dynamic distributed routing scheme for mega LEO satellite networks
    Huang, Yixin
    Wu, Shufan
    Kang, Zeyu
    Mu, Zhongcheng
    Huang, Hai
    Wu, Xiaofeng
    Tang, Andrew Jack
    Cheng, Xuebin
    CHINESE JOURNAL OF AERONAUTICS, 2023, 36 (02) : 284 - 291
  • [33] Distributed probability density based multi-objective routing for Opp-IoT networks enabled by machine learning
    Kumar, S. P. Ajith
    Banyal, Siddhant
    Bhardwaj, Kartik Krishna
    Thakur, Hardeo Kumar
    Sharma, Deepak Kumar
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (02) : 1199 - 1211
  • [34] HiLSeR: Hierarchical learning-based sectionalised routing paradigm for pervasive communication and Resource efficiency in opportunistic IoT network
    Banyal, Siddhant
    Bharadwaj, Kartik Krishna
    Sharma, Deepak Kumar
    Khanna, Ashish
    Rodrigues, Joel J. P. C.
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2021, 30
  • [35] CLORP: Cross-Layer Opportunistic Routing Protocol for Underwater Sensor Networks Based on Multiagent Reinforcement Learning
    Liu, Shuai
    Wang, Jingjing
    Shi, Wei
    Han, Guangjie
    Yan, Shefeng
    Li, Jiaheng
    IEEE SENSORS JOURNAL, 2024, 24 (10) : 17243 - 17258
  • [36] Reinforcement learning based routing in delay tolerant networks
    Rezaei, Parisa
    Derakhshanfard, Nahideh
    WIRELESS NETWORKS, 2025, 31 (03) : 2909 - 2923
  • [37] Routing in Reinforcement Learning based Cognitive Radio Networks
    Patel, Jitisha R.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 591 - 596
  • [38] An improved PRoPHET - Random forest based optimized multi-copy routing for opportunistic IoT networks
    Srinidhi, N. N.
    Sagar, C. S.
    Chethan, Deepak S.
    Shreyas, J.
    Kumar, Dilip S. M.
    INTERNET OF THINGS, 2020, 11
  • [39] Resource Allocation Approach for Optimal Routing in IoT Wireless Mesh Networks
    Nurlan, Zhanserik
    Kokenovna, Tamara Zhukabayeva
    Othman, Mohamed
    Adamova, Aigul
    IEEE ACCESS, 2021, 9 (09): : 153926 - 153942
  • [40] Reinforcement learning based routing in wireless mesh networks
    Mustapha Boushaba
    Abdelhakim Hafid
    Abdeltouab Belbekkouche
    Michel Gendreau
    Wireless Networks, 2013, 19 : 2079 - 2091