Variational model-based Deep Reinforcement Learning for Non-Homogeneous Patrolling aquatic environments with multiple unmanned surface vehicles

被引：1

作者：

Luis, Samuel Yanes ^{[1
]}

Basilico, Nicola ^{[2
]}

Antonazzi, Michele ^{[2
]}

Gutierrez-Reina, Daniel ^{[1
]}

Marin, Sergio Toral ^{[1
]}

机构：

[1] Univ Seville, Dept Elect Engn, Camino Ave Descubrimientos s-n, Seville 41005, Spain

[2] Univ Milan, Dept Comp Sci, Via Celoria 18, I-20133 Milan, Italy

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 270卷

关键词：

Deep Reinforcement Learning; Environmental patrolling; Multi-agent path planning; Model-based decision making;

D O I：

10.1016/j.eswa.2025.126483

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the challenge of Non-Homogeneous Patrolling for Autonomous Surface Vehicles in non- homogeneous importance water environments with a dissimilar biological monitorization criterion. Traditional monitoring methods fail, especially in expansive areas such as Lake Ypacaraiin Paraguay. The proposed solution employs a cooperative Deep Reinforcement Learning framework, specifically a multi-agent version of the Double Deep Q-Learning algorithm based on safe-consensus decision making. This framework optimizes adaptive policies for such vehicles by simultaneously modeling the environment and patrolling high-importance zones. The incorporation of a Variational Auto-Encoder based on the U-Network architecture directly addresses the non-observability of the environment by predicting biological importance from partial observations. The methodology is validated in a realistic algae bloom contamination scenario, demonstrating superior performance and computational efficiency compared to traditional approaches like Gaussian Processes and K-Nearest-Neighbors. The Deep Reinforcement Learning framework, coupled with the Variational Auto-Encoder model, showcases flexibility and efficiency in addressing multi-agent cooperation and long-term objective optimization for water quality monitoring. The results reveal significant improvements, with the proposed model exceeding well-founded approaches with a 30% faster minimization of the patrolling score compared to these methods.

引用

页数：13

共 50 条

[31] Autonomous Control of Combat Unmanned Aerial Vehicles to Evade Surface-to-Air Missiles Using Deep Reinforcement Learning
Lee, Gyeong Taek
Kim, Chang Ouk
IEEE ACCESS, 2020, 8 : 226724 - 226736
[32] Development of deep reinforcement learning-based fault diagnosis method for actuator faults in unmanned aerial vehicles
Saied, M.
Tahan, N.
Chreif, K.
Francis, C.
Noun, Z.
AERONAUTICAL JOURNAL, 2025,
[33] Learning to see via epiretinal implant stimulation in silico with model-based deep reinforcement learning
Lavoie, Jacob
Besrour, Marwan
Lemaire, William
Rouat, Jean
Fontaine, Rejean
Plourde, Eric
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2024, 10 (02)
[34] CarAware: A Deep Reinforcement Learning Platform for Multiple Autonomous Vehicles Based on CARLA Simulation Framework
Araujo, Tulio Oliveira
Netto, Marcio Lobo
Justo, Joao Francisco
2023 8TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS, MT-ITS, 2023,
[35] Design of formation control algorithm for multiple autonomous underwater vehicles based on deep reinforcement learning
Yan J.
Xu L.
Cao W.-Q.
Yang X.
Luo X.-Y.
Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1457 - 1463
[36] Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning
Heiberg, Amalie
Larsen, Thomas Nakken
Meyer, Eivind
Rasheed, Adil
San, Omer
Varagnolo, Damiano
NEURAL NETWORKS, 2022, 152 : 17 - 33
[37] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
Xie, Ronglei
Meng, Zhijun
Wang, Lifeng
Li, Haochen
Wang, Kaipeng
Wu, Zhe
IEEE ACCESS, 2021, 9 : 24884 - 24900
[38] A sample efficient model-based deep reinforcement learning algorithm with experience replay for robot manipulation
Zhang, Cheng
Ma, Liang
Schmitz, Alexander
INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2020, 4 (02) : 217 - 228
[39] A sample efficient model-based deep reinforcement learning algorithm with experience replay for robot manipulation
Cheng Zhang
Liang Ma
Alexander Schmitz
International Journal of Intelligent Robotics and Applications, 2020, 4 : 217 - 228
[40] Modeling and energy dynamic control for a ZEH via hybrid model-based deep reinforcement learning
Li, Yanxue
Wang, Zixuan
Xu, Wenya
Gao, Weijun
Xu, Yang
Xiao, Fu
ENERGY, 2023, 277

← 1 2 3 4 5 →