Real-time Obstacle Avoidance for AUV Based on Reinforcement Learning and Dynamic Window Approach

被引:1
作者
Shen, Yue [1 ]
Xu, Han [1 ]
Wang, Dianrui [1 ]
Zhang, Yixiao [1 ]
Yan, Tianhong [2 ]
He, Bo [1 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao, Peoples R China
[2] China Jiliang Univ, Sch Mech Elect Engn, Hangzhou, Peoples R China
来源
GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST | 2020年
关键词
autonomous underwater vehicle; obstacle avoidance; dynamic window approach; Q-learning;
D O I
10.1109/IEEECONF38699.2020.9389357
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
As an important tool for exploring the ocean, autonomous underwater vehicle (AUV) plays an irreplaceable role in various marine activities. Due to the complexity and uncertainty of the marine environment, AUV is required to develop in a more intelligent direction. How to ensure that AUV avoids obstacles and reaches the target point smoothly is a key research issue of the AUV. The dynamic window approach (DWA) is adopted to AUV in this paper to achieve AUV's autonomous obstacle avoidance for static obstacles. The DWA is used to search for the optimal velocity command in its admissible velocity space by maximizing the objective function, however, the weights of its objective function are constant, which makes AUV lack flexibility in complex environments, and even unable to avoid obstacles. To address the above problem, reinforcement learning is introduced to optimize DWA. Q-learning, a reinforcement learning algorithm, is used to learn the weights of the DWA's objective function, which enables appropriate weights can be selected in different environments and improves the applicability of DWA in the complex environment. Compared with the original DWA, the DWA combined with Q-learning is effective and suitable for complex obstacle environments.
引用
收藏
页数:4
相关论文
共 12 条
  • [1] Abubakr O. A., 2018, 11 INT S MECHATRONIC, P1, DOI DOI 10.1109/ISMA.2018.8330141
  • [2] The exploration/exploitation tradeoff in dynamic cellular genetic algorithms
    Alba, E
    Dorronsoro, B
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2005, 9 (02) : 126 - 142
  • [3] [Anonymous], 1998, Fuzzy control
  • [4] Reinforcement Learning Based Obstacle Avoidance for Autonomous Underwater Vehicle
    Bhopale, Prashant
    Kazi, Faruk
    Singh, Navdeep
    [J]. JOURNAL OF MARINE SCIENCE AND APPLICATION, 2019, 18 (02) : 228 - 238
  • [5] A knowledge-free path planning approach for smart ships based on reinforcement learning
    Chen, Chen
    Chen, Xian-Qiao
    Ma, Feng
    Zeng, Xiao-Jun
    Wang, Jin
    [J]. OCEAN ENGINEERING, 2019, 189
  • [6] Eriksen Bjorn-Olav H., 2016, 2016 IEEE Conference on Control Applications (CCA), P499, DOI 10.1109/CCA.2016.7587879
  • [7] The dynamic window approach to collision avoidance
    Fox, D
    Burgard, W
    Thrun, S
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 1997, 4 (01) : 23 - 33
  • [8] Guan MY, 2018, IEEE INTL CONF IND I, P337, DOI 10.1109/INDIN.2018.8472029
  • [9] Saranrittichai P, 2013, 2013 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON)
  • [10] A TECHNIQUE FOR AUTONOMOUS UNDERWATER VEHICLE ROUTE PLANNING
    WARREN, CW
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 1990, 15 (03) : 199 - 204