Multi-objective Game Learning Algorithm Based on Multi-armed Bandit in Underwater Acoustic Communication Networks

被引：1

作者：

Wang, Hui ^{[1
]}

Yang, Liejun ^{[2
,3
]}

机构：

[1] Minnan Normal Univ, Sch Phys & Informat Engn, 36, Xianqianzhi St, Zhangzhou 363000, Peoples R China

[2] Ningde Normal Univ, Sch Informat & Mech & Elect Engn, 1, Coll Rd, Ningde 352000, Peoples R China

[3] Fujian Prov Univ, Ningde Normal Univ, Key Lab Intelligent Ecotourism & Leisure Agr, Ningde 352100, Peoples R China

来源：

SENSORS AND MATERIALS | 2023年 / 35卷 / 05期

基金：

中国国家自然科学基金;

关键词：

underwater acoustic communication; reinforcement learning; power allocation; multi-armed bandit; POWER ALLOCATION; PROTOCOL;

D O I：

10.18494/SAM4305

中图分类号：

TH7 [仪器、仪表];

学科分类号：

0804 ; 080401 ; 081102 ;

摘要：

To address the challenges of interference in underwater multi-node communication and enhance the efficiency of underwater acoustic communication, we propose a multi-objective game learning algorithm based on the multi-armed bandit framework. Firstly, the multi-objective optimization problem is constructed as a multi-node multi-armed bandit (MAB) game model. Secondly, we incorporate the overall network interference level and nodes' power cost in the utility function to achieve the desired optimization objectives. Thirdly, we establish the existence and uniqueness of the Nash equilibrium point of the game model and introduce an improved greedy strategy MAB learning algorithm to determine the equilibrium solution. Finally, our simulation results demonstrate that the proposed algorithm effectively optimizes interference management while enhancing the nodes' adaptive capabilities.

引用

页码：1619 / 1630

页数：12

共 25 条

[1] [Anonymous], 2007, ACM SIGMOBILE MOBILE
[2] On the Achievable Rate of a Class of Acoustic Channels and Practical Power Allocation Strategies for OFDM Systems
Aval, Yashar M.
Wilson, Sarah Kate
Stojanovic, Milica
[J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2015, 40 (04) : 785 - 795
[3] Multi-Access Communications With Energy Harvesting: A Multi-Armed Bandit Model and the Optimality of the Myopic Policy
Blasco, Pol
Guenduez, Deniz
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2015, 33 (03) : 585 - 597
[4] Protocol design issues in underwater acoustic networks
Casari, Paolo
Zorzi, Michele
[J]. COMPUTER COMMUNICATIONS, 2011, 34 (17) : 2013 - 2025
[5] A Survey on MAC Protocols for Underwater Wireless Sensor Networks
Chen, Keyu
Ma, Maode
Cheng, En
Yuan, Fei
Su, Wei
[J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (03): : 1433 - 1447
[6] Cooperative Authentication in Underwater Acoustic Sensor Networks
Diamant, Roee
Casari, Paolo
Tomasin, Stefano
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (02) : 954 - 968
[7] Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits
Drugan, Madalina M.
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (08) : 2493 - 2502
[8] Distributed Stochastic Online Learning Policies for Opportunistic Spectrum Access
Gai, Yi
Krishnamachari, Bhaskar
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (23) : 6184 - 6193
[9] On Joint Frequency and Power Allocation in a Cross-Layer Protocol for Underwater Acoustic Networks
Jornet, Josep Miquel
Stojanovic, Milica
Zorzi, Michele
[J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2010, 35 (04) : 936 - 947
[10] Luo C., 2010, CCECE 2010, P1

← 1 2 3 →