Efficient Elitist Cooperative Evolutionary Algorithm for Multi-Objective Reinforcement Learning

被引:5
|
作者
Zhou, Dan [1 ]
Du, Jiqing [1 ]
Arai, Sachiyo [1 ]
机构
[1] Chiba Univ, Grad Sch Sci & Engn, Dept Urban Environm Syst, Div Earth & Environm Sci, Chiba 2638522, Japan
关键词
Pareto optimization; Statistics; Social factors; Underwater vehicles; Measurement; Q-learning; Uncertainty; Reinforcement learning; Multi-objective reinforcement learning; efficient; cooperative; Pareto front; elite archive; GENETIC ALGORITHM;
D O I
10.1109/ACCESS.2023.3272115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequential decision-making problems with multiple objectives are known as multi-objective reinforcement learning. In these scenarios, decision-makers require a complete Pareto front that consists of Pareto optimal solutions. Such a front enables decision-makers to understand the relationship between objectives and make informed decisions from a broad range of solutions. However, existing methods may be unable to search for solutions in concave regions of the Pareto front or lack global optimization ability, leading to incomplete Pareto fronts. To address this issue, we propose an efficient elitist cooperative evolutionary algorithm that maintains both an evolving population and an elite archive. The elite archive uses cooperative operations with various genetic operators to guide the evolving population, resulting in efficient searches for Pareto optimal solutions. The experimental results on submarine treasure hunting benchmarks demonstrate the effectiveness of the proposed method in solving various multi-objective reinforcement learning problems and providing decision-makers with a set of trade-off solutions between travel time and treasure amount, enabling them to make flexible and informed decisions based on their preferences. Therefore, the proposed method has the potential to be a useful tool for implementing real-world applications.
引用
收藏
页码:43128 / 43139
页数:12
相关论文
共 50 条
  • [1] An improved elitist strategy multi-objective evolutionary algorithm
    Wang, Lu
    Xiong, Sheng-Wu
    Yang, Jie
    Fan, Ji-Shan
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2315 - +
  • [2] Decomposition based Multi-Objective Evolutionary Algorithm in XCS for Multi-Objective Reinforcement Learning
    Cheng, Xiu
    Browne, Will N.
    Zhang, Mengjie
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 622 - 629
  • [3] A Study of the Multi-Objective Evolutionary Algorithm Based on Elitist Strategy
    Chen WenBin
    Liu YiJun
    Wang Li
    Liu XiaoLing
    2009 ASIA-PACIFIC CONFERENCE ON INFORMATION PROCESSING (APCIP 2009), VOL 1, PROCEEDINGS, 2009, : 136 - 139
  • [4] Multi-strategy multi-objective differential evolutionary algorithm with reinforcement learning
    Han, Yupeng
    Peng, Hu
    Mei, Changrong
    Cao, Lianglin
    Deng, Changshou
    Wang, Hui
    Wu, Zhijian
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [5] Efficient Hybrid Multi-Objective Evolutionary Algorithm
    Mohammed, Tareq Abed
    Bayat, Oguz
    Ucan, Osman N.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (03): : 19 - 26
  • [6] Evolutionary Reinforcement Learning for Multi-objective SFC Deployment
    Zhao, Jialiang
    Wang, Ran
    Wu, Qiang
    Hao, Jie
    Xiong, Zehui
    2024 IEEE 21ST INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SMART SYSTEMS, MASS 2024, 2024, : 212 - 218
  • [7] An Enhanced Multi-Objective Evolutionary Algorithm with Reinforcement Learning for Energy-Efficient Scheduling in the Flexible Job Shop
    Shi, Jinfa
    Liu, Wei
    Yang, Jie
    PROCESSES, 2024, 12 (09)
  • [8] A novel multi-state reinforcement learning-based multi-objective evolutionary algorithm
    Wang, Jing
    Zheng, Yuxin
    Zhang, Ziyun
    Peng, Hu
    Wang, Hui
    INFORMATION SCIENCES, 2025, 688
  • [9] An elitist cooperative evolutionary bi-level multi-objective decomposition-based algorithm for sustainable supply chain
    Abbassi, Malek
    Chaabani, Abir
    Absi, Nabil
    Ben Said, Lamjed
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (23) : 7013 - 7032
  • [10] Cooperative two-engine multi-objective bee foraging algorithm with reinforcement learning
    Ma, Lianbo
    Cheng, Shi
    Wang, Xingwei
    Huang, Min
    Shen, Hai
    He, Xiaoxian
    Shi, Yuhui
    KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 278 - 293