Variational model-based Deep Reinforcement Learning for Non-Homogeneous Patrolling aquatic environments with multiple unmanned surface vehicles

被引：1

作者：

Luis, Samuel Yanes ^{[1
]}

Basilico, Nicola ^{[2
]}

Antonazzi, Michele ^{[2
]}

Gutierrez-Reina, Daniel ^{[1
]}

Marin, Sergio Toral ^{[1
]}

机构：

[1] Univ Seville, Dept Elect Engn, Camino Ave Descubrimientos s-n, Seville 41005, Spain

[2] Univ Milan, Dept Comp Sci, Via Celoria 18, I-20133 Milan, Italy

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 270卷

关键词：

Deep Reinforcement Learning; Environmental patrolling; Multi-agent path planning; Model-based decision making;

D O I：

10.1016/j.eswa.2025.126483

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the challenge of Non-Homogeneous Patrolling for Autonomous Surface Vehicles in non- homogeneous importance water environments with a dissimilar biological monitorization criterion. Traditional monitoring methods fail, especially in expansive areas such as Lake Ypacaraiin Paraguay. The proposed solution employs a cooperative Deep Reinforcement Learning framework, specifically a multi-agent version of the Double Deep Q-Learning algorithm based on safe-consensus decision making. This framework optimizes adaptive policies for such vehicles by simultaneously modeling the environment and patrolling high-importance zones. The incorporation of a Variational Auto-Encoder based on the U-Network architecture directly addresses the non-observability of the environment by predicting biological importance from partial observations. The methodology is validated in a realistic algae bloom contamination scenario, demonstrating superior performance and computational efficiency compared to traditional approaches like Gaussian Processes and K-Nearest-Neighbors. The Deep Reinforcement Learning framework, coupled with the Variational Auto-Encoder model, showcases flexibility and efficiency in addressing multi-agent cooperation and long-term objective optimization for water quality monitoring. The results reveal significant improvements, with the proposed model exceeding well-founded approaches with a 30% faster minimization of the patrolling score compared to these methods.

引用

页数：13

共 50 条

[41] Intelligent Reflective Surface Resource Allocation Algorithm Based on Deep Reinforcement Learning in Internet of Vehicles System
Zeng, Rong
Feng, Zhenhui
Zhang, Zaichen
Ying, Na
Wang, Hao
Yao, Yingbiao
IEEE COMMUNICATIONS LETTERS, 2024, 28 (08) : 1885 - 1888
[42] Deep Reinforcement Learning-Based Motion Control for Unmanned Vehicles from the Perspective of Multi-Sensor Data Fusion
Wei, Hongbo
Cui, Xuerong
Zhang, Yucheng
Chen, Haihua
Zhang, Jingyao
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (10)
[43] Proximal Policy Optimization Through a Deep Reinforcement Learning Framework for Multiple Autonomous Vehicles at a Non-Signalized Intersection
Duy Quang Tran
Bae, Sang-Hoon
APPLIED SCIENCES-BASEL, 2020, 10 (16):
[44] Residual Deep Reinforcement Learning With Model-Based Optimization for Inverter-Based Volt-Var Control
Liu, Qiong
Guo, Ye
Deng, Lirong
Liu, Haotian
Li, Dongyu
Sun, Hongbin
IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2025, 16 (01) : 269 - 283
[45] Design Optimization of a Pneumatic Soft Robotic Actuator Using Model-Based Optimization and Deep Reinforcement Learning
Raeisinezhad, Mahsa
Pagliocca, Nicholas
Koohbor, Behrad
Trkov, Mitja
FRONTIERS IN ROBOTICS AND AI, 2021, 8
[46] A model-based deep reinforcement learning approach to the nonblocking coordination of modular supervisors of discrete event systems
Yang, Junjun
Tan, Kaige
Feng, Lei
Li, Zhiwu
INFORMATION SCIENCES, 2023, 630 : 305 - 321
[47] Deep Reinforcement Learning-Based Intelligent Reflecting Surface for Cooperative Jamming Model Design
Lu, Shaofang
Shen, Xianhao
Zhang, Panfeng
Wu, Zhen
Chen, Yi
Wang, Li
Xie, Xiaolan
IEEE ACCESS, 2023, 11 : 98764 - 98775
[48] Research on Path-Following Technology of a Single-Outboard-Motor Unmanned Surface Vehicle Based on Deep Reinforcement Learning and Model Predictive Control Algorithm
Cui, Bin
Chen, Yuanming
Hong, Xiaobin
Luo, Hao
Chen, Guanqiao
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (12)
[49] Model-free voltage control of active distribution system with PVs using surrogate model-based deep reinforcement learning
Cao, Di
Zhao, Junbo
Hu, Weihao
Ding, Fei
Yu, Nanpeng
Huang, Qi
Chen, Zhe
APPLIED ENERGY, 2022, 306
[50] A digital twin-driven dynamic path planning approach for multiple automatic guided vehicles based on deep reinforcement learning
Bao, Qiangwei
Zheng, Pai
Dai, Sheng
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024, 238 (04) : 488 - 499

← 1 2 3 4 5 →