Variational model-based Deep Reinforcement Learning for Non-Homogeneous Patrolling aquatic environments with multiple unmanned surface vehicles

被引:1
|
作者
Luis, Samuel Yanes [1 ]
Basilico, Nicola [2 ]
Antonazzi, Michele [2 ]
Gutierrez-Reina, Daniel [1 ]
Marin, Sergio Toral [1 ]
机构
[1] Univ Seville, Dept Elect Engn, Camino Ave Descubrimientos s-n, Seville 41005, Spain
[2] Univ Milan, Dept Comp Sci, Via Celoria 18, I-20133 Milan, Italy
关键词
Deep Reinforcement Learning; Environmental patrolling; Multi-agent path planning; Model-based decision making;
D O I
10.1016/j.eswa.2025.126483
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the challenge of Non-Homogeneous Patrolling for Autonomous Surface Vehicles in non- homogeneous importance water environments with a dissimilar biological monitorization criterion. Traditional monitoring methods fail, especially in expansive areas such as Lake Ypacaraiin Paraguay. The proposed solution employs a cooperative Deep Reinforcement Learning framework, specifically a multi-agent version of the Double Deep Q-Learning algorithm based on safe-consensus decision making. This framework optimizes adaptive policies for such vehicles by simultaneously modeling the environment and patrolling high-importance zones. The incorporation of a Variational Auto-Encoder based on the U-Network architecture directly addresses the non-observability of the environment by predicting biological importance from partial observations. The methodology is validated in a realistic algae bloom contamination scenario, demonstrating superior performance and computational efficiency compared to traditional approaches like Gaussian Processes and K-Nearest-Neighbors. The Deep Reinforcement Learning framework, coupled with the Variational Auto-Encoder model, showcases flexibility and efficiency in addressing multi-agent cooperation and long-term objective optimization for water quality monitoring. The results reveal significant improvements, with the proposed model exceeding well-founded approaches with a 30% faster minimization of the patrolling score compared to these methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Intelligent Reflective Surface Resource Allocation Algorithm Based on Deep Reinforcement Learning in Internet of Vehicles System
    Zeng, Rong
    Feng, Zhenhui
    Zhang, Zaichen
    Ying, Na
    Wang, Hao
    Yao, Yingbiao
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (08) : 1885 - 1888
  • [42] Deep Reinforcement Learning-Based Motion Control for Unmanned Vehicles from the Perspective of Multi-Sensor Data Fusion
    Wei, Hongbo
    Cui, Xuerong
    Zhang, Yucheng
    Chen, Haihua
    Zhang, Jingyao
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (10)
  • [43] Proximal Policy Optimization Through a Deep Reinforcement Learning Framework for Multiple Autonomous Vehicles at a Non-Signalized Intersection
    Duy Quang Tran
    Bae, Sang-Hoon
    APPLIED SCIENCES-BASEL, 2020, 10 (16):
  • [44] Residual Deep Reinforcement Learning With Model-Based Optimization for Inverter-Based Volt-Var Control
    Liu, Qiong
    Guo, Ye
    Deng, Lirong
    Liu, Haotian
    Li, Dongyu
    Sun, Hongbin
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2025, 16 (01) : 269 - 283
  • [45] Design Optimization of a Pneumatic Soft Robotic Actuator Using Model-Based Optimization and Deep Reinforcement Learning
    Raeisinezhad, Mahsa
    Pagliocca, Nicholas
    Koohbor, Behrad
    Trkov, Mitja
    FRONTIERS IN ROBOTICS AND AI, 2021, 8
  • [46] A model-based deep reinforcement learning approach to the nonblocking coordination of modular supervisors of discrete event systems
    Yang, Junjun
    Tan, Kaige
    Feng, Lei
    Li, Zhiwu
    INFORMATION SCIENCES, 2023, 630 : 305 - 321
  • [47] Deep Reinforcement Learning-Based Intelligent Reflecting Surface for Cooperative Jamming Model Design
    Lu, Shaofang
    Shen, Xianhao
    Zhang, Panfeng
    Wu, Zhen
    Chen, Yi
    Wang, Li
    Xie, Xiaolan
    IEEE ACCESS, 2023, 11 : 98764 - 98775
  • [48] Research on Path-Following Technology of a Single-Outboard-Motor Unmanned Surface Vehicle Based on Deep Reinforcement Learning and Model Predictive Control Algorithm
    Cui, Bin
    Chen, Yuanming
    Hong, Xiaobin
    Luo, Hao
    Chen, Guanqiao
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (12)
  • [49] Model-free voltage control of active distribution system with PVs using surrogate model-based deep reinforcement learning
    Cao, Di
    Zhao, Junbo
    Hu, Weihao
    Ding, Fei
    Yu, Nanpeng
    Huang, Qi
    Chen, Zhe
    APPLIED ENERGY, 2022, 306
  • [50] A digital twin-driven dynamic path planning approach for multiple automatic guided vehicles based on deep reinforcement learning
    Bao, Qiangwei
    Zheng, Pai
    Dai, Sheng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024, 238 (04) : 488 - 499