Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle

被引:10
|
作者
Jiang, Dong [1 ]
Huang, Jie [1 ]
Fang, Zheng [1 ]
Cheng, Chunxi [1 ]
Sha, Qixin [1 ]
He, Bo [1 ]
Li, Guangliang [1 ]
机构
[1] Ocean Univ China, Coll Elect Engn, Qingdao, Peoples R China
关键词
Deep reinforcement learning; Autonomous control; Autonomous underwater vehicle; Imitation learning; Interactive reinforcement learning; LEVEL CONTROL; PID CONTROL; DEEP;
D O I
10.1016/j.oceaneng.2022.111971
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Autonomous underwater vehicle (AUV) is playing a more and more important role in marine scientific research and resource exploration due to its flexibility. Recently, deep reinforcement learning (DRL) has been used to improve the autonomy of AUV. However, it is very time-consuming and even unpractical to define efficient reward functions for DRL to learn control policies in various tasks. In this paper, we implemented the generative adversarial imitation learning (GAIL) algorithm learning from demonstrated trajectories and proposed GA2IL learning from demonstrations and additional human rewards for AUV path following. We evaluated GAIL and our GA2IL method in a straight line following task and a sinusoids curve following task on the Gazebo platform extended to simulated underwater environments with AUV simulator of our lab. Both methods were compared to PPO-a classic traditional deep reinforcement learning from a predefined reward function, and a well-tuned PID controller. In addition, to evaluate the generalization of GAIL and our GA2IL method, we tested the trained control policies of the previous two tasks via GAIL and GA2IL in a new complex comb scan following task and a different sinusoids curve following task respectively. Our simulation results show AUV path following with GA2IL and GAIL can obtain a performance at a similar level to PPO and PID controller in both tasks. Moreover, GA2IL can generalize as well as PPO, adapting better to complex and different tasks than traditional PID controller.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle
    Zhang, Qilei
    Lin, Jinying
    Sha, Qixin
    He, Bo
    Li, Guangliang
    IEEE ACCESS, 2020, 8 : 24258 - 24268
  • [2] A Mixed Generative Adversarial Imitation Learning Based Vehicle Path Planning Algorithm
    Yang, Zan
    Nai, Wei
    Li, Dan
    Liu, Lu
    Chen, Ziyu
    IEEE ACCESS, 2024, 12 : 85859 - 85879
  • [3] Autonomous underwater vehicle formation control and obstacle avoidance using multi-agent generative adversarial imitation learning
    Fang, Zheng
    Jiang, Dong
    Huang, Jie
    Cheng, Chunxi
    Sha, Qixin
    He, Bo
    Li, Guangliang
    OCEAN ENGINEERING, 2022, 262
  • [4] Autonomous underwater vehicle formation control and obstacle avoidance using multi-agent generative adversarial imitation learning
    Fang, Zheng
    Jiang, Dong
    Huang, Jie
    Cheng, Chunxi
    Sha, Qixin
    He, Bo
    Li, Guangliang
    Ocean Engineering, 2022, 262
  • [5] Generative Adversarial Imitation Learning
    Ho, Jonathan
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [6] Urban Vehicle Trajectory Generation Based on Generative Adversarial Imitation Learning
    Wang, Min
    Cui, Jianqun
    Wong, Yew Wee
    Chang, Yanan
    Wu, Libing
    Jin, Jiong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (12) : 18237 - 18249
  • [7] Unmanned surface vehicle navigation through generative adversarial imitation learning
    Chaysri, Piyabhum
    Spatharis, Christos
    Blekas, Konstantinos
    Vlachos, Kostas
    OCEAN ENGINEERING, 2023, 282
  • [8] Path following control of mini autonomous underwater vehicle
    Liu, Hui
    Xiang, Xianbo
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 7504 - 7508
  • [9] Quantum generative adversarial imitation learning
    Xiao, Tailong
    Huang, Jingzheng
    Li, Hongjing
    Fan, Jianping
    Zeng, Guihua
    NEW JOURNAL OF PHYSICS, 2023, 25 (03):
  • [10] Deterministic generative adversarial imitation learning
    Zuo, Guoyu
    Chen, Kexin
    Lu, Jiahao
    Huang, Xiangsheng
    NEUROCOMPUTING, 2020, 388 : 60 - 69