Deep Reinforcement Learning Based MAC Protocol for Underwater Acoustic Networks

被引：18

作者：

Ye, Xiaowen ^{[1
,2
]}

Yu, Yiding ^{[3
]}

Fu, Liqun ^{[1
,2
]}

机构：

[1] Xiamen Univ, Sch Informat, Xiamen 361005, Fujian, Peoples R China

[2] Xiamen Univ, Key Lab Underwater Acoust Commun & Marine Informa, Minist Educ, Xiamen 361005, Fujian, Peoples R China

[3] Huawei Technol Co Ltd, Labs 2012, Theory Lab, Shatin, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2022年 / 21卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Propagation delay; Media Access Protocol; Throughput; Training; Reinforcement learning; Underwater acoustics; Long propagation delay; medium access control; underwater acoustic networks; deep reinforcement learning; ACCESS; TDMA;

D O I：

10.1109/TMC.2020.3029844

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Long propagation delay that causes throughput degradation of underwater acoustic networks (UWANs) is a critical issue in the medium access control (MAC) protocol design in UWANs. This paper develops a deep reinforcement learning (DRL) based MAC protocol for UWANs, referred to as delayed-reward deep-reinforcement learning multiple access (DR-DLMA), to maximize the network throughput by judiciously utilizing the available time slots resulted from propagation delays or not used by other nodes. In the DR-DLMA design, we first put forth a new DRL algorithm, termed as delayed-reward deep Q-network (DR-DQN). Then we formulate the multiple access problem in UWANs as a reinforcement learning (RL) problem by defining state, action, and reward in the parlance of RL, and thereby realizing the DR-DLMA protocol. In traditional DRL algorithms, e.g., the original DQN algorithm, the agent can get access to the "reward" from the environment immediately after taking an action. In contrast, in our design, the "reward" (i.e., the ACK packet) is only available after twice the one-way propagation delay after the agent takes an action (i.e., to transmit a data packet). The essence of DR-DQN is to incorporate the propagation delay into the DRL framework and modify the DRL algorithm accordingly. In addition, in order to reduce the cost of online training deep neural network (DNN), we provide a nimble training mechanism for DR-DQN. The optimal network throughputs in various cases are given as a benchmark. Simulation results show that our DR-DLMA protocol with nimble training mechanism can: (i) find the optimal transmission strategy when coexisting with other protocols in a heterogeneous environment; (ii) outperform state-of-the-art MAC protocols (e.g., slotted FAMA and DOTS) in a homogeneous environment; and (iii) greatly reduce energy consumption and run-time compared with DR-DLMA with traditional DNN training mechanism.

引用

页码：1625 / 1638

页数：14

共 50 条

[1] MAC Protocol for Underwater Acoustic Networks Based on Deep Reinforcement Learning
Geng, Xuan
Zheng, Y. Rosa
WUWNET'19: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, 2019,
[2] Deep Reinforcement Learning Based MAC Protocol for Underwater Acoustic Networks
Ye, Xiaowen
Fu, Liqun
WUWNET'19: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, 2019,
[3] Packet Flow Based Reinforcement Learning MAC Protocol for Underwater Acoustic Sensor Networks
Alhassan, Ibrahim B.
Mitchell, Paul D.
SENSORS, 2021, 21 (07)
[4] Reinforcement Learning Based MAC Protocol (UW-ALOHA-Q) for Underwater Acoustic Sensor Networks
Park, Sung Hyun
Mitchell, Paul Daniel
Grace, David
IEEE ACCESS, 2019, 7 (165531-165542) : 165531 - 165542
[5] Reinforcement Learning-Based MAC Protocol for Underwater Multimedia Sensor Networks
Gazi, Firoj
Ahmed, Nurzaman
Misra, Sudip
Wei, Wei
ACM TRANSACTIONS ON SENSOR NETWORKS, 2022, 18 (03)
[6] MAC Protocol for Underwater Acoustic Multi-Cluster Networks Based on Multi-Agent Reinforcement Learning
Huang, Jiajie
Ye, Xiaowen
Fu, Liqun
17TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, WUWNET 2023, 2024,
[7] Reinforcement Learning Based MAC Protocol (UW-ALOHA-QM) for Mobile Underwater Acoustic Sensor Networks
Park, Sung Hyun
Mitchell, Paul Daniel
Grace, David
IEEE ACCESS, 2021, 9 : 5906 - 5919
[8] Meta-learning Based Hyperparameter Reweighting MAC Protocol for Underwater Acoustic Networks
Lu, Ge
Tu, Shen'ao
Dong, Yanhan
Chen, Yougan
Wan, Lei
17TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, WUWNET 2023, 2024,
[9] An Improved R-MAC Based MAC Protocol for Underwater Acoustic Networks
Liu Meiqin
Huang Weikun
Qian Liangfang
Zhang Senlin
2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2016,
[10] Requirement Learning and Matching MAC Protocol for Underwater Acoustic Sensor Networks
Cen, Yi
Liu, Minliu
Li, Deshi
WUWNET'21: THE 15TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, 2021,

← 1 2 3 4 5 →