Reinforcement Learning-Based Trajectory Optimization for Data Muling With Underwater Mobile Nodes

被引:3
作者
Fu, Qiang [1 ]
Song, Aijun [1 ]
Zhang, Fuming [2 ]
Pan, Miao [3 ]
机构
[1] Univ Alabama, Dept Elect & Comp Engn, Tuscaloosa, AL 35487 USA
[2] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[3] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77204 USA
基金
美国国家科学基金会;
关键词
Underwater acoustic communications; data muling; autonomous underwater vehicles; autonomous surface vehicles; trajectory optimization; reinforcement learning; WIRELESS SENSOR NETWORKS; DATA-COLLECTION; ROUTING PROTOCOL; COOPERATION; ALGORITHM; COVERAGE; DELAY;
D O I
10.1109/ACCESS.2022.3165046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses trajectory optimization problems for underwater data muling with mobile nodes. In the underwater data muling scenario, multiple autonomous underwater vehicles (AUVs) sample a mission area, and autonomous surface vehicles (ASVs) visit the navigating AUVs to retrieve the collected data. The optimization objectives are to simultaneously maximize fairness in data transmissions and minimize the travel distance of the surface nodes. We propose an nearest-K reinforcement learning algorithm, which chooses only from the nearest-K AUVs as candidates for the next node for data transmissions. We use the distance between AUVs and the ASV as the state, selected AUVs as the action. A reward is designed as the function of both the data volume transmitted and the ASV travel distance. In the scenario with multiple ASVs, an AUV association strategy is presented to support the use of multiple surface nodes. We conduct computer simulations for performance evaluation. The effects from the number of AUVs, the size of the mission area, and the state number are investigated. The simulation results show that the proposed algorithm outperforms traditional methods in terms of the fairness and ASV travel distance.
引用
收藏
页码:38774 / 38784
页数:11
相关论文
共 47 条
[1]  
[Anonymous], 2011, EURASIP J WIRLESS CO
[2]  
Busquets J., 2013, 2013 MTS/IEEE OCEANS-Bergen, P1
[3]  
Chen S., 2020, 2020 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), P1
[4]   ACOA-AFSA Fusion Dynamic Coded Cooperation Routing for Different Scale Multi-Hop Underwater Acoustic Sensor Networks [J].
Chen, Yougan ;
Zhu, Jianying ;
Wan, Lei ;
Huang, Shenqin ;
Zhang, Xinhai ;
Xu, Xiaomei .
IEEE ACCESS, 2020, 8 :186773-186788
[5]  
Cicek C.T., 2019, 2019 1 INT C UNM
[6]   Exploiting Mobility to Improve Underwater Sensor Networks [J].
Coutinho, Rodolfo W. L. ;
Boukerche, Azzedine .
PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYMPOSIUM ON MOBILITY MANAGEMENT AND WIRELESS ACCESS (MOBIWAC'18), 2018, :89-94
[7]   CARMA: Channel-Aware Reinforcement Learning-Based Multi-Path Adaptive Routing for Underwater Wireless Sensor Networks [J].
Di Valerio, Valerio ;
Lo Presti, Francesco ;
Petrioli, Chiara ;
Picari, Luigi ;
Spaccini, Daniele ;
Basagni, Stefano .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) :2634-2647
[8]  
Doniec M, AUTONOMOUS UNDERWATE
[9]   Delay-aware data fusion in duty-cycled wireless sensor networks: A Q-learning approach [J].
Donta, Praveen Kumar ;
Amgoth, Tarachand ;
Annavarapu, Chandra Sekhara Rao .
SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2022, 33
[10]   Data Collection and Path Determination Strategies for Mobile Sink in 3D WSNs [J].
Donta, Praveen Kumar ;
Rao, Banoth Sanjai Prasada ;
Amgoth, Tarachand ;
Annavarapu, Chandra Sekhara Rao ;
Swain, Silpamayee .
IEEE SENSORS JOURNAL, 2020, 20 (04) :2224-2233