Deep Reinforcement Learning Based MAC Protocol for Underwater Acoustic Networks

被引:18
|
作者
Ye, Xiaowen [1 ,2 ]
Yu, Yiding [3 ]
Fu, Liqun [1 ,2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen 361005, Fujian, Peoples R China
[2] Xiamen Univ, Key Lab Underwater Acoust Commun & Marine Informa, Minist Educ, Xiamen 361005, Fujian, Peoples R China
[3] Huawei Technol Co Ltd, Labs 2012, Theory Lab, Shatin, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Propagation delay; Media Access Protocol; Throughput; Training; Reinforcement learning; Underwater acoustics; Long propagation delay; medium access control; underwater acoustic networks; deep reinforcement learning; ACCESS; TDMA;
D O I
10.1109/TMC.2020.3029844
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Long propagation delay that causes throughput degradation of underwater acoustic networks (UWANs) is a critical issue in the medium access control (MAC) protocol design in UWANs. This paper develops a deep reinforcement learning (DRL) based MAC protocol for UWANs, referred to as delayed-reward deep-reinforcement learning multiple access (DR-DLMA), to maximize the network throughput by judiciously utilizing the available time slots resulted from propagation delays or not used by other nodes. In the DR-DLMA design, we first put forth a new DRL algorithm, termed as delayed-reward deep Q-network (DR-DQN). Then we formulate the multiple access problem in UWANs as a reinforcement learning (RL) problem by defining state, action, and reward in the parlance of RL, and thereby realizing the DR-DLMA protocol. In traditional DRL algorithms, e.g., the original DQN algorithm, the agent can get access to the "reward" from the environment immediately after taking an action. In contrast, in our design, the "reward" (i.e., the ACK packet) is only available after twice the one-way propagation delay after the agent takes an action (i.e., to transmit a data packet). The essence of DR-DQN is to incorporate the propagation delay into the DRL framework and modify the DRL algorithm accordingly. In addition, in order to reduce the cost of online training deep neural network (DNN), we provide a nimble training mechanism for DR-DQN. The optimal network throughputs in various cases are given as a benchmark. Simulation results show that our DR-DLMA protocol with nimble training mechanism can: (i) find the optimal transmission strategy when coexisting with other protocols in a heterogeneous environment; (ii) outperform state-of-the-art MAC protocols (e.g., slotted FAMA and DOTS) in a homogeneous environment; and (iii) greatly reduce energy consumption and run-time compared with DR-DLMA with traditional DNN training mechanism.
引用
收藏
页码:1625 / 1638
页数:14
相关论文
共 50 条
  • [1] MAC Protocol for Underwater Acoustic Networks Based on Deep Reinforcement Learning
    Geng, Xuan
    Zheng, Y. Rosa
    WUWNET'19: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, 2019,
  • [2] Deep Reinforcement Learning Based MAC Protocol for Underwater Acoustic Networks
    Ye, Xiaowen
    Fu, Liqun
    WUWNET'19: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, 2019,
  • [3] Packet Flow Based Reinforcement Learning MAC Protocol for Underwater Acoustic Sensor Networks
    Alhassan, Ibrahim B.
    Mitchell, Paul D.
    SENSORS, 2021, 21 (07)
  • [4] Reinforcement Learning Based MAC Protocol (UW-ALOHA-Q) for Underwater Acoustic Sensor Networks
    Park, Sung Hyun
    Mitchell, Paul Daniel
    Grace, David
    IEEE ACCESS, 2019, 7 (165531-165542) : 165531 - 165542
  • [5] Reinforcement Learning-Based MAC Protocol for Underwater Multimedia Sensor Networks
    Gazi, Firoj
    Ahmed, Nurzaman
    Misra, Sudip
    Wei, Wei
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2022, 18 (03)
  • [6] MAC Protocol for Underwater Acoustic Multi-Cluster Networks Based on Multi-Agent Reinforcement Learning
    Huang, Jiajie
    Ye, Xiaowen
    Fu, Liqun
    17TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, WUWNET 2023, 2024,
  • [7] Reinforcement Learning Based MAC Protocol (UW-ALOHA-QM) for Mobile Underwater Acoustic Sensor Networks
    Park, Sung Hyun
    Mitchell, Paul Daniel
    Grace, David
    IEEE ACCESS, 2021, 9 : 5906 - 5919
  • [8] Meta-learning Based Hyperparameter Reweighting MAC Protocol for Underwater Acoustic Networks
    Lu, Ge
    Tu, Shen'ao
    Dong, Yanhan
    Chen, Yougan
    Wan, Lei
    17TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, WUWNET 2023, 2024,
  • [9] An Improved R-MAC Based MAC Protocol for Underwater Acoustic Networks
    Liu Meiqin
    Huang Weikun
    Qian Liangfang
    Zhang Senlin
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2016,
  • [10] Requirement Learning and Matching MAC Protocol for Underwater Acoustic Sensor Networks
    Cen, Yi
    Liu, Minliu
    Li, Deshi
    WUWNET'21: THE 15TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, 2021,