Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle

被引:10
|
作者
Jiang, Dong [1 ]
Huang, Jie [1 ]
Fang, Zheng [1 ]
Cheng, Chunxi [1 ]
Sha, Qixin [1 ]
He, Bo [1 ]
Li, Guangliang [1 ]
机构
[1] Ocean Univ China, Coll Elect Engn, Qingdao, Peoples R China
关键词
Deep reinforcement learning; Autonomous control; Autonomous underwater vehicle; Imitation learning; Interactive reinforcement learning; LEVEL CONTROL; PID CONTROL; DEEP;
D O I
10.1016/j.oceaneng.2022.111971
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Autonomous underwater vehicle (AUV) is playing a more and more important role in marine scientific research and resource exploration due to its flexibility. Recently, deep reinforcement learning (DRL) has been used to improve the autonomy of AUV. However, it is very time-consuming and even unpractical to define efficient reward functions for DRL to learn control policies in various tasks. In this paper, we implemented the generative adversarial imitation learning (GAIL) algorithm learning from demonstrated trajectories and proposed GA2IL learning from demonstrations and additional human rewards for AUV path following. We evaluated GAIL and our GA2IL method in a straight line following task and a sinusoids curve following task on the Gazebo platform extended to simulated underwater environments with AUV simulator of our lab. Both methods were compared to PPO-a classic traditional deep reinforcement learning from a predefined reward function, and a well-tuned PID controller. In addition, to evaluate the generalization of GAIL and our GA2IL method, we tested the trained control policies of the previous two tasks via GAIL and GA2IL in a new complex comb scan following task and a different sinusoids curve following task respectively. Our simulation results show AUV path following with GA2IL and GAIL can obtain a performance at a similar level to PPO and PID controller in both tasks. Moreover, GA2IL can generalize as well as PPO, adapting better to complex and different tasks than traditional PID controller.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] TextGAIL: Generative Adversarial Imitation Learning for Text Generation
    Wu, Qingyang
    Li, Lei
    Yu, Zhou
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14067 - 14075
  • [32] Multi-Agent Generative Adversarial Imitation Learning
    Song, Jiaming
    Ren, Hongyu
    Sadigh, Dorsa
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [33] Multimodal Storytelling via Generative Adversarial Imitation Learning
    Chen, Zhiqian
    Zhang, Xuchao
    Boedihardjo, Arnold P.
    Dai, Jing
    Lu, Chang-Tien
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3967 - 3973
  • [34] Risk-Sensitive Generative Adversarial Imitation Learning
    Lacotte, Jonathan
    Ghavamzadeh, Mohammad
    Chow, Yinlam
    Pavone, Marco
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [35] A dynamic test scenario generation method for autonomous vehicles based on conditional generative adversarial imitation learning
    Jia, Lulu
    Yang, Dezhen
    Ren, Yi
    Qian, Cheng
    Feng, Qiang
    Sun, Bo
    Wang, Zili
    ACCIDENT ANALYSIS AND PREVENTION, 2024, 194
  • [36] Discrete-time Backstepping Path Following Control of Autonomous Underwater Vehicle
    Suryendu, Chhavi
    Subudhi, Bidyadhar
    IEEE INDICON: 15TH IEEE INDIA COUNCIL INTERNATIONAL CONFERENCE, 2018,
  • [37] Three-dimensional Path Following Control of Underactuated Autonomous Underwater Vehicle
    Yao, Xuliang
    Wang, Xiaowei
    IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 3134 - 3139
  • [38] Backstepping-based Path Following Control of an Underactuated Autonomous Underwater Vehicle
    Wang, Yintao
    Yan, Weisheng
    Gao, Bo
    Cui, Rongxin
    ICIA: 2009 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-3, 2009, : 451 - +
  • [39] Target Tracking using an Autonomous Underwater Vehicle: A Moving Path Following Approach
    Jain, R. Praveen
    Aguiar, A. Pedro
    de Sousa, Joao Borges
    2018 IEEE/OES AUTONOMOUS UNDERWATER VEHICLE WORKSHOP (AUV), 2018,
  • [40] Fuzzy Adaptive Sliding Mode Controller for Path Following of an Autonomous Underwater Vehicle
    Zhang, Wei
    Liang, Zhicheng
    Guo, Yi
    Meng, Detao
    Zhou, Jiajia
    Han, Yunfeng
    OCEANS 2015 - MTS/IEEE WASHINGTON, 2015,