Q-learning-based algorithms for dynamic transmission control in IoT equipment

被引:3
|
作者
Malekijou, Hanieh [1 ]
Hakami, Vesal [1 ]
Javan, Nastooh Taheri [2 ]
Malekijoo, Amirhossein [3 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
[2] Imam Khomeini Int Univ, Comp Engn Dept, Qazvin, Iran
[3] Semnan Univ, Dept Elect & Comp Engn, Semnan, Iran
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 01期
关键词
Delay; Energy harvesting; Jitter; Transmission control; Markov decision process; Reinforcement learning; POWER ALLOCATION; ENERGY; COMPRESSION; COMMUNICATION; POLICY;
D O I
10.1007/s11227-022-04643-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate an energy-harvesting IoT device transmitting (delay/jitter)-sensitive data over a wireless fading channel. The sensory module on the device injects captured event packets into its transmission buffer and relies on the random supply of the energy harvested from the environment to transmit them. Given the limited harvested energy, our goal is to compute optimal transmission control policies that decide on how many packets of data should be transmitted from the buffer's head-of-line at each discrete timeslot such that a long-run criterion involving the average delay/jitter is either minimized or never exceeds a pre-specified threshold. We realistically assume that no advance knowledge is available regarding the random processes underlying the variations in the channel, captured events, or harvested energy dynamics. Instead, we utilize a suite of Q-learning-based techniques (from the reinforcement learning theory) to optimize the transmission policy in a model-free fashion. In particular, we come up with three Q-learning algorithms: a constrained Markov decision process (CMDP)-based algorithm for optimizing energy consumption under a delay constraint, an MDP-based algorithm for minimizing the average delay under the limitations imposed by the energy harvesting process, and finally, a variance-penalized MDP-based algorithm to minimize a linearly combined cost function consisting of both delay and delay variation. Extensive numerical results are presented for performance evaluation.
引用
收藏
页码:75 / 108
页数:34
相关论文
共 50 条
  • [21] Q-Learning-Based Dynamic Spectrum Access in Cognitive Industrial Internet of Things
    Feng Li
    Kwok-Yan Lam
    Zhengguo Sheng
    Xinggan Zhang
    Kanglian Zhao
    Li Wang
    Mobile Networks and Applications, 2018, 23 : 1636 - 1644
  • [22] Optimizing Q-Learning-Based Access Control Scheme Based on Q-Table Compression Method
    Ojetunde, Babatunde
    Yano, Kazuto
    2022 IEEE 33RD ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2022,
  • [23] Q-Learning-Based Spectrum Access for Multimedia Transmission Over Cognitive Radio Networks
    Huang, Xin-Lin
    Li, Yu-Xuan
    Gao, Yu
    Tang, Xiao-Wei
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 110 - 119
  • [24] A Q-learning-based Multipath Scheduler for Data Transmission Optimization in Heterogeneous Wireless Networks
    Nguyen, Thanh Trung
    Vu, Minh Hai
    Nguyen, Phi Le
    Do, Phan Thuan
    Nguyen, Kien
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [25] A deep q-learning-based optimization of the inventory control in a linear process chain
    M.-A. Dittrich
    S. Fohlmeister
    Production Engineering, 2021, 15 : 35 - 43
  • [26] Hierarchical Distributed Q-learning-based resource allocation and UBS control in SATIN
    Jeon, Kakyeom
    Lee, Howon
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1094 - 1095
  • [27] Q-learning-based practical disturbance compensation control for hypersonic flight vehicle
    Li, Xu
    Zhang, Ziyi
    Ji, Yuehui
    Liu, Junjie
    Gao, Qiang
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2023, 237 (08) : 1916 - 1929
  • [28] Q-Learning-Based Model Predictive Control for Energy Management in Residential Aggregator
    Ojand, Kianoosh
    Dagdougui, Hanane
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (01) : 70 - 81
  • [29] Q-learning-based congestion control strategy for information-centric networking
    Meng, Wei
    Zhang, Lingling
    INTERNET TECHNOLOGY LETTERS, 2021, 4 (05)
  • [30] Q-Learning-Based Multi-Rate Optimal Control for Process Industries
    Xia, Zhenxing
    Hu, Mengjie
    Dai, Wei
    Yan, Huaicheng
    Ma, Xiaoping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (06) : 2006 - 2010