Multi-Objective Reinforcement Learning for AUV Thruster Failure Recovery

被引:0
作者
Ahmadzadeh, Seyed Reza [1 ]
Kormushev, Petar [1 ]
Caldwell, Darwin G. [1 ]
机构
[1] Ist Italiano Tecnol, Dept Adv Robot, Via Morego 30, I-16163 Genoa, Italy
来源
2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL) | 2014年
关键词
DIFFERENTIAL EVOLUTION; VEHICLES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles (AUV). The proposed approach is a model-based direct policy search that learns on an on-board simulated model of the vehicle. When a fault is detected and isolated the model of the AUV is reconfigured according to the new condition. To discover a set of optimal solutions a multi-objective reinforcement learning approach is employed which can deal with multiple conflicting objectives. Each optimal solution can be used to generate a trajectory that is able to navigate the AUV towards a specified target while satisfying multiple objectives. The discovered policies are executed on the robot in a closed-loop using AUV's state feedback. Unlike most existing methods which disregard the faulty thruster, our approach can also deal with partially broken thrusters to increase the persistent autonomy of the AUV. In addition, the proposed approach is applicable when the AUV either becomes under-actuated or remains redundant in the presence of a fault. We validate the proposed approach on the model of the Girona500 AUV.
引用
收藏
页码:103 / 110
页数:8
相关论文
共 30 条
  • [1] Abbass HA, 2001, IEEE C EVOL COMPUTAT, P971, DOI 10.1109/CEC.2001.934295
  • [2] Ahmadzadeh S. R., 2013, 6 INT WORKSH EV REIN
  • [3] Ahmadzadeh S. R., 2014, ROB AUT ICRA 2014 IE, P6522
  • [4] Geometric control for autonomous underwater vehicles: overcoming a thruster failure
    Andonian, Michael
    Cazzaro, Dario
    Invernizzi, Luca
    Chyba, Monique
    Grammatico, Sergio
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 7051 - 7056
  • [5] [Anonymous], 2001, INT S UNM UNT SUBM T
  • [6] [Anonymous], 2008, P 25 INT C MACHINE L, DOI DOI 10.1145/1390156.1390162
  • [7] Antonelli G, 2003, SPRINGER TRAC ADV RO, V1, P109
  • [8] Babu B, 2005, P INT S 58 ANN SESS
  • [9] Babu BV, 2003, IEEE C EVOL COMPUTAT, P2696
  • [10] Barto A.G., 2011, AAAI