RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks

被引:27
|
作者
Sharma, Deepak Kumar [1 ]
Rodrigues, Joel J. P. C. [2 ,3 ]
Vashishth, Vidushi [1 ]
Khanna, Anirudh [4 ]
Chhabra, Anshuman [4 ,5 ]
机构
[1] Netaji Subhas Univ Technol, Dept Informat Technol, New Delhi, India
[2] Fed Univ Piaui UFPI, Campus Petronio Portela, Teresina, PI, Brazil
[3] Inst Telecomunicacoes, Covilha, Portugal
[4] Netaji Subhas Univ Technol, Div Elect & Commun Engn, New Delhi, India
[5] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Opportunistic networks; Internet of Things; Reinforcement learning; Markov decision process; Dynamic programming; ONE simulator; Machine learning; Policy iteration; ALGORITHM; DESIGN;
D O I
10.1007/s11276-020-02331-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria.
引用
收藏
页码:4319 / 4338
页数:20
相关论文
共 50 条
  • [1] RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks
    Deepak Kumar Sharma
    Joel J. P. C. Rodrigues
    Vidushi Vashishth
    Anirudh Khanna
    Anshuman Chhabra
    Wireless Networks, 2020, 26 : 4319 - 4338
  • [2] GMMR: A Gaussian mixture model based unsupervised machine learning approach for optimal routing in opportunistic IoT networks
    Vashishth, Vidushi
    Chhabra, Anshuman
    Sharma, Deepak Kumar
    COMPUTER COMMUNICATIONS, 2019, 134 : 138 - 148
  • [3] A Machine Learning Approach Using Classifier Cascades for Optimal Routing in Opportunistic Internet of Things Networks
    Vashishth, Vidushi
    Chhabra, Anshuman
    Sharma, Deepak Kumar
    2019 16TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2019,
  • [4] A novel federated learning approach for routing optimisation in opportunistic IoT networks
    Bhardwaj, Moulik
    Singh, Jagdeep
    Gupta, Nitin
    Jadon, Kuldeep Singh
    Dhurandher, Sanjay Kumar
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2024, 46 (01) : 24 - 38
  • [5] Reinforcement Learning-Based Routing Protocol for Opportunistic Networks
    Dhurandher, Sanjay Kumar
    Singh, Jagdeep
    Obaidat, Mohammad S.
    Woungang, Isaac
    Srivastava, Samariddhi
    Rodrigues, Joel J. P. C.
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [6] Parallel Opportunistic Routing in IoT Networks
    Singh, Fateh
    Vijeth, J. K.
    Murthy, C. Siva Ram
    2016 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, 2016,
  • [7] RIATA: A Reinforcement Learning-Based Intelligent Routing Update Scheme for Future Generation IoT Networks
    Nain, Zulqar
    Musaddiq, Arslan
    Qadri, Yazdan Ahmad
    Nauman, Ali
    Afzal, Muhammad Khalil
    Kim, Sung Won
    IEEE ACCESS, 2021, 9 : 81161 - 81172
  • [8] Reinforcement learning-based fuzzy geocast routing protocol for opportunistic networks
    Khalid, Khuram
    Woungang, Isaac
    Dhurandher, Sanjay K.
    Singh, Jagdeep
    INTERNET OF THINGS, 2021, 14
  • [9] Caching in Dynamic IoT Networks by Deep Reinforcement Learning
    Yao, Jingjing
    Ansari, Nirwan
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3268 - 3275
  • [10] Reinforcement Learning for Optimal Path Length of Nanobots Using Dynamic Programming
    Lambe, Amruta
    2017 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2017, : 414 - 419