Variational model-based Deep Reinforcement Learning for Non-Homogeneous Patrolling aquatic environments with multiple unmanned surface vehicles

被引:1
|
作者
Luis, Samuel Yanes [1 ]
Basilico, Nicola [2 ]
Antonazzi, Michele [2 ]
Gutierrez-Reina, Daniel [1 ]
Marin, Sergio Toral [1 ]
机构
[1] Univ Seville, Dept Elect Engn, Camino Ave Descubrimientos s-n, Seville 41005, Spain
[2] Univ Milan, Dept Comp Sci, Via Celoria 18, I-20133 Milan, Italy
关键词
Deep Reinforcement Learning; Environmental patrolling; Multi-agent path planning; Model-based decision making;
D O I
10.1016/j.eswa.2025.126483
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the challenge of Non-Homogeneous Patrolling for Autonomous Surface Vehicles in non- homogeneous importance water environments with a dissimilar biological monitorization criterion. Traditional monitoring methods fail, especially in expansive areas such as Lake Ypacaraiin Paraguay. The proposed solution employs a cooperative Deep Reinforcement Learning framework, specifically a multi-agent version of the Double Deep Q-Learning algorithm based on safe-consensus decision making. This framework optimizes adaptive policies for such vehicles by simultaneously modeling the environment and patrolling high-importance zones. The incorporation of a Variational Auto-Encoder based on the U-Network architecture directly addresses the non-observability of the environment by predicting biological importance from partial observations. The methodology is validated in a realistic algae bloom contamination scenario, demonstrating superior performance and computational efficiency compared to traditional approaches like Gaussian Processes and K-Nearest-Neighbors. The Deep Reinforcement Learning framework, coupled with the Variational Auto-Encoder model, showcases flexibility and efficiency in addressing multi-agent cooperation and long-term objective optimization for water quality monitoring. The results reveal significant improvements, with the proposed model exceeding well-founded approaches with a 30% faster minimization of the patrolling score compared to these methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Autonomous Control of Combat Unmanned Aerial Vehicles to Evade Surface-to-Air Missiles Using Deep Reinforcement Learning
    Lee, Gyeong Taek
    Kim, Chang Ouk
    IEEE ACCESS, 2020, 8 : 226724 - 226736
  • [32] Development of deep reinforcement learning-based fault diagnosis method for actuator faults in unmanned aerial vehicles
    Saied, M.
    Tahan, N.
    Chreif, K.
    Francis, C.
    Noun, Z.
    AERONAUTICAL JOURNAL, 2025,
  • [33] Learning to see via epiretinal implant stimulation in silico with model-based deep reinforcement learning
    Lavoie, Jacob
    Besrour, Marwan
    Lemaire, William
    Rouat, Jean
    Fontaine, Rejean
    Plourde, Eric
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2024, 10 (02)
  • [34] CarAware: A Deep Reinforcement Learning Platform for Multiple Autonomous Vehicles Based on CARLA Simulation Framework
    Araujo, Tulio Oliveira
    Netto, Marcio Lobo
    Justo, Joao Francisco
    2023 8TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS, MT-ITS, 2023,
  • [35] Design of formation control algorithm for multiple autonomous underwater vehicles based on deep reinforcement learning
    Yan J.
    Xu L.
    Cao W.-Q.
    Yang X.
    Luo X.-Y.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1457 - 1463
  • [36] Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning
    Heiberg, Amalie
    Larsen, Thomas Nakken
    Meyer, Eivind
    Rasheed, Adil
    San, Omer
    Varagnolo, Damiano
    NEURAL NETWORKS, 2022, 152 : 17 - 33
  • [37] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
    Xie, Ronglei
    Meng, Zhijun
    Wang, Lifeng
    Li, Haochen
    Wang, Kaipeng
    Wu, Zhe
    IEEE ACCESS, 2021, 9 : 24884 - 24900
  • [38] A sample efficient model-based deep reinforcement learning algorithm with experience replay for robot manipulation
    Zhang, Cheng
    Ma, Liang
    Schmitz, Alexander
    INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2020, 4 (02) : 217 - 228
  • [39] A sample efficient model-based deep reinforcement learning algorithm with experience replay for robot manipulation
    Cheng Zhang
    Liang Ma
    Alexander Schmitz
    International Journal of Intelligent Robotics and Applications, 2020, 4 : 217 - 228
  • [40] Modeling and energy dynamic control for a ZEH via hybrid model-based deep reinforcement learning
    Li, Yanxue
    Wang, Zixuan
    Xu, Wenya
    Gao, Weijun
    Xu, Yang
    Xiao, Fu
    ENERGY, 2023, 277