Additional planning with multiple objectives for reinforcement learning

被引:11
|
作者
Pan, Anqi [1 ,2 ]
Xu, Wenjun [3 ,4 ]
Wang, Lei [5 ]
Ren, Hongliang [3 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Donghua Univ, Engn Res Ctr Digitized Text & Fash Technol, Minist Educ, Shanghai 201620, Peoples R China
[3] Natl Univ Singapore, Dept Biomed Engn, Singapore 117583, Singapore
[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[5] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China
关键词
Reinforcement learning; Multi-objective; Robotic control; ALGORITHM;
D O I
10.1016/j.knosys.2019.105392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most control tasks have multiple objectives that need to be achieved simultaneously, while the reward definition is the weighted combination of all objects to determine one optimal policy. This configuration has a limitation in exploration flexibility and presents difficulty in reaching a satisfied terminate condition. Although some multi-objective reinforcement learning (MORL) methods have been presented recently, they concentrate on obtaining a set of compromising options rather than one best-performed strategy. On the other hand, the existing policy-improve methods have rarely emphasized on solving multiple objectives circumstances. Inspired by the enhanced policy search methods, an additional planning technique with multiple objectives for reinforcement learning is proposed in this paper, which is denoted as RLAP-MOP. This method provides opportunities to evaluate parallel requirements at the same time and suggests several optimal feasible actions to improve long-term performance further. Meanwhile, the short-term planning adopted in this paper has advantages in maintaining safe trajectories and building more accurate approximate models, which contributes to accelerating the training program. For comparison, an RLAP with single-objective optimization is also introduced in theoretical and experimental studies. The proposed techniques are investigated on a multi-objective cartpole environment and a soft robotic palpation task. The superiorities in the improved return values and learning stability prove that the multiple objectives based additional planning is a promising assistant to improve reinforcement learning. (c) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Adaptive Objective Selection for Correlated Objectives in Multi-Objective Reinforcement Learning
    Brys, Tim
    Van Moffaert, Kristof
    Nowe, Ann
    Taylor, Matthew E.
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1349 - 1350
  • [42] Artificial Development by Reinforcement Learning Can Benefit From Multiple Motivations
    Palm, Guenther
    Schwenker, Friedhelm
    FRONTIERS IN ROBOTICS AND AI, 2019, 6
  • [43] The Robotic Arm Velocity Planning Based on Reinforcement Learning
    Huang, Hao-Hsuan
    Cheng, Chih-Kai
    Chen, Yi-Hung
    Tsai, Hung-Yin
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (09) : 1707 - 1721
  • [44] Deep Reinforcement Learning for UAV Intelligent Mission Planning
    Yue, Longfei
    Yang, Rennong
    Zhang, Ying
    Yu, Lixin
    Wang, Zhuangzhuang
    COMPLEXITY, 2022, 2022
  • [45] Path planning of virtual human by using reinforcement learning
    He, Yue-Sheng
    Tang, Yuan-Yan
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 987 - 992
  • [46] Robot path planning algorithm based on reinforcement learning
    Zhang F.
    Li N.
    Yuan R.
    Fu Y.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2018, 46 (12): : 65 - 70
  • [47] Reinforcement learning for accelerated automatic treatment planning optimization
    Anjo, Eva
    Rocha, Humberto
    Dias, Joana
    RADIOTHERAPY AND ONCOLOGY, 2024, 194 : S4435 - S4437
  • [48] Route Planning and Power Management for PHEVs With Reinforcement Learning
    Zhang, Qian
    Wu, Kui
    Shi, Yang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (05) : 4751 - 4762
  • [49] The Robotic Arm Velocity Planning Based on Reinforcement Learning
    Hao-Hsuan Huang
    Chih-Kai Cheng
    Yi-Hung Chen
    Hung-Yin Tsai
    International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1707 - 1721
  • [50] Path Planning for Autonomous Balloon Navigation with Reinforcement Learning
    He, Yingzhe
    Guo, Kai
    Wang, Chisheng
    Fu, Keyi
    Zheng, Jiehao
    ELECTRONICS, 2025, 14 (01):